TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: Good tool to visualize data?

20 pointsby hyuenover 15 years ago
Hi all, I am in the need to visualize some time series data. The problem I am having is that I want to be able to impose restrictions in an ad-hoc way, such as delimiting time, filtering out some categories, and then get some statistics on that data, such as the mean, a histogram, or other things.<p>What I am doing right now is a bunch of gnuplot scripts together with python/c programs to grab the data, but this is getting tedious.<p>Is there any open source tool/library that does something similar?<p>Thanks

10 comments

jjguyover 15 years ago
The suggestions below are solid:<p><pre><code> R Python + matplotlib </code></pre> I've used and been impressed by both. Starlight and Palantir may fit your needs, but those tend closer to enterprise applications -- I get the sense you're searching for something lighter.<p>Some more suggestions:<p><i>DAVIX</i> [D] A live CD distribution containing lots of visualization tools, from capture to parsing to presentation.<p><i>SecViz</i> and their "graph exchange" [0] -- lots of pictures of various datasets, usually with details of how it was produced.<p>Personally, I found most packages too restrictive and fell back on Processing: <a href="http://processing.org" rel="nofollow">http://processing.org</a> A project out of the MIT Media Lab, it's a generic graphics framework you can (among other things) produce 2D time series graphs in. It's still tedious, but tedium is traded for control.<p>Some of my projects in Processing:<p><a href="http://jjguy.com/som/" rel="nofollow">http://jjguy.com/som/</a> - Self Organizing Maps<p><a href="http://jjguy.com/life/" rel="nofollow">http://jjguy.com/life/</a> - Conway's Game of Life<p>I don't have a great time series write-up posted, but I've been working with Robert May's population model recently. [1] You can find the source at [2] and example output image at [3].<p>[D] <a href="http://secviz.org/content/the-davix-live-cd" rel="nofollow">http://secviz.org/content/the-davix-live-cd</a><p>[0] <a href="http://secviz.org/category/image-galleries/graph-exchange" rel="nofollow">http://secviz.org/category/image-galleries/graph-exchange</a><p>[1] <a href="http://en.wikipedia.org/wiki/Logistic_function#In_ecology:_modeling_population_growth" rel="nofollow">http://en.wikipedia.org/wiki/Logistic_function#In_ecology:_m...</a><p>[2] <a href="http://jjguy.com/populationModel.pde" rel="nofollow">http://jjguy.com/populationModel.pde</a><p>[3] <a href="http://jjguy.com/normal.png" rel="nofollow">http://jjguy.com/normal.png</a>
kalendaeover 15 years ago
R <a href="http://www.r-project.org/" rel="nofollow">http://www.r-project.org/</a> i would consider it to have a high learning curve tho.
评论 #930283 未加载
beambotover 15 years ago
Since you're already using Python, I'd highly recommend looking at Matplotlib (pylab). We've used it to great utility in the past.
apuover 15 years ago
I've been building up a javascript library to do this, as visualizizing results/data takes up more than half my research time.<p>Unfortunately, this library is nowhere close to ready to be released, but I can describe roughly the architecture, in case it's useful to you.<p>I take all my data and write a python script which will dump it to a .JSON file. This includes the raw data/results, names of different fields, groups of fields to toggle on/off together, and what kinds of visualizations I want to use with a given set of data (view as numbers, as bar graph, as line graph, as linear time-series data, as color-coded images, etc.)<p>Then I have a standard html + javascript file in which I simply load in this JSON file. Because I've prespecified the format of the JSON file (I.e., what fields it has and how the data is stored), I only need to customize a few functions to display results. Things like filtering data, searching and sorting I get "for free", since they're in the library.<p>The main advantages:<p>- Interactive browsing of data in various formats, all in the web browser, with no plugins etc. required.<p>- Can be viewed locally or across the network<p>- No need for a server to be running<p>- Processing done on each local computer, as opposed to on some server<p>- HTML 5 is now good enough (just barely) to offer all the kinds of interaction I require<p>- Most of the code is in my standard library for visualizations, and so the time to create a new visualization for a new set of data is quite small.<p>Drawbacks:<p>- HTML 5 is still slow<p>- Can't do very advanced stuff yet, without writing a lot of custom code. On the otherhand, using jQuery + jQuery UI, it's very easy to make things draggable, for example, to compare various things side-by-side or even on top of each other (with transparancy)<p>- No server, so can't "save" complicated settings or parameters (although cookies help).<p>Anyway, in case you end up building something of your own, hopefully some of these ideas might help.
araneaeover 15 years ago
Flot is super cool but probably a bit off-topic for your needs <a href="http://code.google.com/p/flot/" rel="nofollow">http://code.google.com/p/flot/</a><p>Every time I try to use R I just say "fuck it" and use Matlab. Matlab is pretty flexible w.r.t. manipulating data and there's a lot of tools to deal specifically with time series analysis <a href="http://www.mathworks.com/access/helpdesk/help/techdoc/data_analysis/brenonn.html" rel="nofollow">http://www.mathworks.com/access/helpdesk/help/techdoc/data_a...</a><p>The open source version of Matlab is Octave but they aren't really comparable. But if you're dead set on open source, that's what I'd recommend.<p>If you want the flexibility of Matlab and the prettiness and manipulation ability of flot, Mathematica produces some pretty elegant figures, but I hate the notebook interface. Yes, I know, you can use a command line interface with it but the UI for Matlab is actually useful and awesome (like being able to see the contents of all the objects you create.)
elblancoover 15 years ago
Depends on the kinda of visualization you are talking about.<p>I've worked extensively with tools like: Starlight <a href="http://www.futurepointsystms.com" rel="nofollow">http://www.futurepointsystms.com</a> Palantir (mentioned previously) <a href="http://www.palantirtech.com" rel="nofollow">http://www.palantirtech.com</a> I2's tools <a href="http://www.i2inc.com/" rel="nofollow">http://www.i2inc.com/</a> and others.<p>There's also scientific data visualization, for looking at things like heat dissipation in an engine...<p>But it sounds like you are looking for something like Matlab.
hotpocketsover 15 years ago
Octave? (sort of an open source matlab)
fizxover 15 years ago
Look into rrdtool.
hyperbovineover 15 years ago
Ggobi
keefeover 15 years ago
mathematica