TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

CSVKit: CSV utilities that includes csvsql, csvgrep, csvstat, and more

103 pointsby ConceitedCodealmost 11 years ago

9 comments

kbensonalmost 11 years ago
I like fsql[1], as it allows me to combine multiple CSV, TSV, LTSV. JSON and YAML files as SQL tables and query against them. Very useful.<p>[1]: <a href="https://metacpan.org/pod/distribution/App-fsql/bin/fsql" rel="nofollow">https:&#x2F;&#x2F;metacpan.org&#x2F;pod&#x2F;distribution&#x2F;App-fsql&#x2F;bin&#x2F;fsql</a>
rwmjalmost 11 years ago
csvtool:<p><a href="https://forge.ocamlcore.org/plugins/scmgit/cgi-bin/gitweb.cgi?p=csv/csv.git;a=blob;f=examples/csvtool.ml;h=950e9aa6705c45f1315b285d055033e42b5d9ea4;hb=HEAD#l512" rel="nofollow">https:&#x2F;&#x2F;forge.ocamlcore.org&#x2F;plugins&#x2F;scmgit&#x2F;cgi-bin&#x2F;gitweb.cg...</a><p>You will already have this program in your Linux distro since I wrote it about a decade ago and it has been maintained by a small team ever since then. It also handles all the quirks of Excel CSV files and is used extensively in production.
评论 #7807571 未加载
smackfualmost 11 years ago
No, everyone must think they can start by splitting on commas and descending slowly into madness. That is the way of the programmer.
评论 #7807660 未加载
denimboyalmost 11 years ago
Also openrefine (formally google refine)<p><pre><code> http:&#x2F;&#x2F;openrefine.org&#x2F; </code></pre> is like a GUI version of csvkit.<p>It can do external look ups, fuzzy matching, and has its own programming languages Jython and GREL.
评论 #7807716 未加载
pessimizeralmost 11 years ago
I&#x27;m going to continue to throw it out there when anybody mentions csvkit, but I can&#x27;t believe that more people haven&#x27;t settled on <a href="http://csvfix.byethost5.com/csvfix15/csvfix.html" rel="nofollow">http:&#x2F;&#x2F;csvfix.byethost5.com&#x2F;csvfix15&#x2F;csvfix.html</a>
评论 #7806758 未加载
joncooperalmost 11 years ago
Do any of y&#x27;all have an opinion on which of the tools mentioned here is the most robust to junky input?
评论 #7806780 未加载
Malarkey73almost 11 years ago
I&#x27;m not totally sure why csvgrep csvsort csvcut csvjoin are needed as replacements for grep sort, cut or join?<p>The syntax doesn&#x27;t seem noticeably clearer?<p>And - without testing - I presume csvkit in Python is a bit slower than the GNU coreutils in C?
评论 #7807497 未加载
评论 #7806893 未加载
评论 #7806852 未加载
评论 #7807222 未加载
fsiefkenalmost 11 years ago
Would be nice if a tool works on querying markdown&#x2F;org-mode tables... correlations, averages and sums would be nice too.
dinedalalmost 11 years ago
I&#x27;d love to add similar features to textql[1]. am currently working on a v2 that does a lot more, but right now it fills a gap not in CSVKit, which is the direct execution of SQL on flat files.<p>[1]: <a href="https://github.com/dinedal/textql" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;dinedal&#x2F;textql</a>