Skip to content

hrmeetsingh/ParquetReader

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Build Status

Parquet Reader

Boilerplate java code for reading Apache Parquet file without installing and configuring libraries like Apache Spark/pandas.

Helps understand

  • Basic organization of data and schema inside a parquet file
  • Use of core Apache libraries only

Work in progress

  • Can be packaged to a jar (similar to parquet-tools, but leaner) for quick inspection of parquet files
  • Exception and edge case handling