How to Read Parquet File in Java Using Parquet-Avro

There seems to be lack of documentation/tutorial in the web about reading Parquet file in Java. I am currently exploring this, https://github.com/apache/parquet-mr and tried this https://github.com/apache/parquet-mr/tree/master/parquet-avro but I am not sure how to use it.

I also cannot find any comprehensive documentations and tutorials about using parquet-mr.

I tried to follow some codes in this test, https://github.com/apache/parquet-mr/blob/master/parquet-avro/src/test/java/org/apache/parquet/avro/TestReadWrite.java, but it seems the examples involve Hadoop which I don’t have any idea about.

How can I read the parquet file without having to think about (or include) Hadoop-related classes?

Leave a Comment