What is a .ORC file?
ORC (Optimized Row Columnar) is a columnar big-data format from the Hadoop world.
- Did you know
- ORC was created to store Hive data tables compactly and query them fast.
- ORC was announced by Hortonworks together with Facebook to overcome the limits of the earlier RCFile and speed up Apache Hive queries.
- Unlike RCFile, which treated each column as an opaque blob, ORC retains the table’s type information and writes one file per task.
- What Analyser reads
- Identify and read developer and data-serialisation files: dependency lockfiles (npm/Yarn/pnpm/Cargo/Poetry/Bundler/Composer - locked-package count), binary serialisations (MessagePack, CBOR, BSON, raw Protobuf messages and descriptor sets), Python pickles with a security note, NumPy .npz and Java jar/war/ear archives, IDL schemas (FlatBuffers/Thrift/Cap n Proto/HCL), MATLAB MAT-files, Redis RDB dumps and columnar big-data containers (Apache Arrow/Feather, Parquet, ORC). The JSON supersets JSON5/JSONC/Hjson now open in a full viewer - see Notebooks & data above.
- Depth of analysis
- .ORC is an identification-grade format: Analyser recognises it from its bytes and decodes the header metadata it carries, rather than opening it in a full viewer. Formats that do get a full viewer are marked "Full" on the formats page.
- Open a .ORC file
- Drag a .ORC file onto the Analyser home page (or tap to pick one). It is identified entirely in your browser - nothing is uploaded, there is no account, and it works offline once installed.