Search
Menu
Home
Sources
About
Contacts
Apache ORC
Apache
ORC
is a
free and open-source
column-oriented
data storage
format
of the
Apache Hadoop
ecosystem
. It is similar to the other columnar-storage
file formats
available in the
Hadoop
ecosystem such as
RCFile
and
Parquet
. It is
compatible
with most of the
data processing
frameworks
in the Hadoop
environment
.
In
February 2013
, the
Optimized
Row
Columnar
file format
was
announced
by
Hortonworks
in
collaboration
with
Facebook
.
A
month
later, the
Apache Parquet
format was announced, developed by
Cloudera
and
Twitter
.