Samples are organised by feature in separate packages. Each sample has a specific README file in its corresponding package. The easiest way to launch a sample is to open up a unit test in your IDE and ...
ORC is a self-describing type-aware columnar file format designed for Hadoop workloads. It is optimized for large streaming reads, but with integrated support for finding required rows quickly.