TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

TXR: A Programming Language for Convenient Data Munging

91 pointsby hashxover 10 years ago

9 comments

rout39574over 10 years ago
I wish their page included something along the lines of &quot;Why do I care?&quot;<p>Maybe a few examples of &quot;data munging&quot; tasks which the authors view as poor fits for [language X] and how their stuff solves the problem better.<p>Maybe something like &quot;why is our language better than regexps in whatever language environment you already know?&quot;
评论 #8410205 未加载
评论 #8410979 未加载
nieveover 10 years ago
TXR looks rather like the CRM114 language that&#x27;s been used to implement some rather amazingly accurate text classifiers (some better than most people on their own mail), though a bit less bizarre and I think more accessible: <a href="http://crm114.sourceforge.net/docs/INTRO.txt" rel="nofollow">http:&#x2F;&#x2F;crm114.sourceforge.net&#x2F;docs&#x2F;INTRO.txt</a> CRM114 too treats pattern matching as the fundamental construct and has blazing performance for it and certain kinds of number crunching (it has to), but I don&#x27;t think it&#x27;s nearly as useful for the average hacker trying to munge a couple of text files. Still, worth a look both to users and possibly to language implementors. I&#x27;m definit
spullaraover 10 years ago
Reminds of a trick I do with mustache.java. Templates can not only be used to generate output, but because of the declarative nature of the mustache language they can be used to parse output back into data that in combination with the template would generate that output. Makes for pretty intuitive parsers. In my case all text that isn&#x27;t templating declarations are regexes.
评论 #8410118 未加载
dansoover 10 years ago
As I come to see more of my data-related work be consumed by data munging&#x2F;cleaning work, I&#x27;m convinced that a language&#x2F;framework devoted to data munging is at least as important as those devoted to data visualization.
slackstationover 10 years ago
It looks ugly and akward to type. It&#x27;s doesn&#x27;t seem like it would be a pleasure in which to write programs.
评论 #8410436 未加载
评论 #8410025 未加载
评论 #8410097 未加载
评论 #8411091 未加载
评论 #8410493 未加载
baneover 10 years ago
Cool ideas. I really like that it has support for grammars. What&#x27;s the performance like compared to Perl on similar tasks?
aureliusover 10 years ago
Kaz Kylheku is one of the kooks from comp.lang.lisp where lisp is the One True Language. The funny thing is that TXR is written in C!<p>Kaz: How come you didn&#x27;t write TXR in lisp?
评论 #8430269 未加载
hyp0over 10 years ago
At first I thought this was TXL, for source code transformation <a href="http://www.txl.ca/" rel="nofollow">http:&#x2F;&#x2F;www.txl.ca&#x2F;</a>
stdbrouwover 10 years ago
Hmm, this looks more like parsing than munging to me, but then I guess &quot;munging&quot; is not exactly scientific terminology.<p>My own take on easy data transformations, if you&#x27;ll allow me the plug: <a href="https://github.com/stdbrouw/refract" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;stdbrouw&#x2F;refract</a>