TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Haskell improves log processing 4x over Python

114 pointsby jmintzover 14 years ago

6 comments

andrewcookeover 14 years ago
The work sounds very cool (and they are hiring), but (only) a factor of 4 speedup over Python is (to repeat a phrase from elsewhere today) like boasting that you're the tallest midget ;o)
评论 #2137176 未加载
评论 #2137233 未加载
Peakerover 14 years ago
Sounds great. I'm a very big Haskell fan.<p>I'd love to point people to this when trying to convey some advantages of Haskell. To make it more compelling, can you expand some on the downsides and maybe obstacles you encountered?<p>The thing I'm unsure about, is how difficult it would be for (very) talented developers to just jump in. We have really talented developers, and everyone is super time-constrained, so many are wary of diving into a language as different as Haskell. Was it hard for your developers to figure Haskell out? Did your previous use of Scala help? How long did it take them to dive into Scala?
评论 #2137222 未加载
评论 #2138657 未加载
Locke1689over 14 years ago
The author is mostly write about the usage cases of Haskell, but simply "systems" is a bit misleading because there are certain performance characteristics of lazy programs which make them bad choices for some systems programs. Any type of real-time system, for example, can suffer unpredictable performance in critical sections, which is pretty undesirable.
评论 #2137651 未加载
评论 #2137578 未加载
评论 #2138328 未加载
ynnivover 14 years ago
Are the logs being read from disk? In my experience, python is highly optimized for reading (possibly compressed) files from disk. If your infrastructure keeps logs in memory, python will lose this advantage and compete on computational performance where Haskell has the advantage. This is important for those of us who grind logs on disk and might be considering a language switch.
评论 #2137291 未加载
评论 #2137281 未加载
kordlessover 14 years ago
I'd be interested in hearing more about how the author is using the resulting data set. Doing extractions at event generation time can be very useful if you know what you are after in advance, but not so good for adhoc analysis.<p>Any reason why you didn't use Hadoop for this, then run batch jobs to extract summaries?
评论 #2138360 未加载
aristusover 14 years ago
Awesome work. If you haven't heard about Tim Bray's WideFinder challenge, it was really interesting.<p><a href="http://tartarus.org/james/diary/2008/06/17/widefinder-final-results" rel="nofollow">http://tartarus.org/james/diary/2008/06/17/widefinder-final-...</a>