I started 6 months ago when I wanted to look inside datasets from Hugging Face. I was not satisfied with existing libraries. So, naturally, I built my own library from scratch.<p>Parquet is a very complicated format. It has 22 data types, 9 encodings, 8 compression codecs. Previous parquet js libraries went abandoned due to the complexity. However, I can confidently say that Hyparquet is now the most conformant parquet parser in existence. It can open more files than PyArrow and DuckDB. I dare you to find a file that Hyparquet can’t open!<p>In addition to supporting all the parquet files, Hyparquet supports efficient streaming of parquet data over the network, even cross-domain using CORS. This means you can stream files straight from S3 with no backend.<p>Hyparquet is open-source MIT licensed.<p>You can launch a local parquet file viewer by running "npx hyperparam"