TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: Hyparquet 1.0 – Apache Parquet Parser for the Browser

6 点作者 platypii11 个月前
I started 6 months ago when I wanted to look inside datasets from Hugging Face. I was not satisfied with existing libraries. So, naturally, I built my own library from scratch.<p>Parquet is a very complicated format. It has 22 data types, 9 encodings, 8 compression codecs. Previous parquet js libraries went abandoned due to the complexity. However, I can confidently say that Hyparquet is now the most conformant parquet parser in existence. It can open more files than PyArrow and DuckDB. I dare you to find a file that Hyparquet can’t open!<p>In addition to supporting all the parquet files, Hyparquet supports efficient streaming of parquet data over the network, even cross-domain using CORS. This means you can stream files straight from S3 with no backend.<p>Hyparquet is open-source MIT licensed.<p>You can launch a local parquet file viewer by running &quot;npx hyperparam&quot;

2 条评论

doppenhe11 个月前
Could HyParquet&#x27;s approach be extended to other data formats beyond Parquet?
评论 #40795343 未加载
doppenhe11 个月前
nice thanks for sharing