TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: Visprex – Open-source, in-browser data visualisation tool for CSV files

194 点作者 kengoa6 个月前
Hello HN. I&#x27;ve always found writing data visualisation scripts boring and repetitive in data science workflows earlier in my career, so I built this tool to automate it. The available methods are based on my experience in econometrics where histograms and scatterplots were the starting points to check data distributions.<p>The link is to the documentation and the app is freely available at <a href="https:&#x2F;&#x2F;visprex.com" rel="nofollow">https:&#x2F;&#x2F;visprex.com</a>, and if you&#x27;re curious about the implementation it&#x27;s open source at <a href="https:&#x2F;&#x2F;github.com&#x2F;visprex&#x2F;visprex">https:&#x2F;&#x2F;github.com&#x2F;visprex&#x2F;visprex</a>. I&#x27;d appreciate any comments and feedback!

9 条评论

doodlebugging6 个月前
Looks nice. I&#x27;ve had occasion to import&#x2F;export, edit, etc. thousands of CSV files from multiple software platforms over the years and this tool looks like a simple way for a user to determine whether there are issues in the CSV file that will cause problems on import to their application.<p>One question I immediately have is how this compares to a spreadsheet CSV import tool such as the one in Excel which is extremely flexible. It appears that this app requires a specific format (comma delimited, new line at end of each row) in order to work. I never tried to count the times that a CSV file that I had to work with required editing in order to facilitate import to Excel or other application because CSV is such a non-standard standard output that the only way one could know whether the import would be successful was to pop it into an editor, like Notepad++ and examine it before import. Notepad++ was a critical tool in the chain to force compliance for all the different applications I used. Each application allowed CSV import&#x2F;export but some accepted almost any delimiter while others were strict about file format and failing to understand the expected CSV format for each would definitely cause headaches as some input errors could leave a very subtle footprint that you may not catch until late in processing.<p>Anyway, it appears that your definition of CSV format is pretty strict so how do you propose that a user manage importation of files that do not fit your CSV definition? Notepad++ before import to verify compliance?<p>I also see one thing on the main page under &quot;Security&quot; that looks like it could be worded differently.<p>&gt;No tracking or analytics software is used for privacy<p>To me, this implies that no steps have been taken to manage user&#x2F;data privacy.<p>Perhaps a comma could be inserted so that it reads &quot;...used, for privacy.&quot; or maybe it should read:<p>For (user&#x2F;data) privacy, there is no tracking or analytics software.
评论 #42099366 未加载
paddy_m6 个月前
Nice work!<p>Do you have any plans for data cleaning?<p>I am working on a somewhat similar open source project. I intend to add heuristic data cleaning. With the UI I want to be able to toggle between different strategies quickly - strip characters from a column to treat it as numeric, if less than 2% or 5% of values have a character, fill na with mean, interpret dates in different formats - drop if the date doesn&#x27;t parse. The idea bing that if it&#x27;s really quick to change between different strategies, you can create more opinionated strategies to get to the right answer faster.<p>Happy to collaborate and talk tables with anyone who&#x27;s interested.
评论 #42099634 未加载
评论 #42097918 未加载
teddyh6 个月前
I loaded a CSV with one date&#x2F;time column and one numerical column. I then selected “Scatter Plot”, but got the message “Not enough numerical columns found. Load a CSV file with at least 2 numerical columns in the Datasets tab.” I would have thought that a date&#x2F;time column would count?
评论 #42097449 未加载
TripleChecker6 个月前
Are you planning to add xlsx support?<p>Also a few typos you might want to review: <a href="https:&#x2F;&#x2F;triplechecker.com&#x2F;s&#x2F;823563&#x2F;docs.visprex.com" rel="nofollow">https:&#x2F;&#x2F;triplechecker.com&#x2F;s&#x2F;823563&#x2F;docs.visprex.com</a>
评论 #42100641 未加载
mosselman6 个月前
Cool! Does anyone know of any javascript libraries that I could use to get this type of distribution visualisation from tabular data? Something I can run on my site that is.
评论 #42099609 未加载
parsimo20106 个月前
I like this a lot- I am going to show it to my students!<p>They seem to hate learning R, and while this doesn’t prevent them from having to build a model, this will speed up the exploration steps.
评论 #42099641 未加载
rrr_oh_man6 个月前
Very cool stuff!<p>Maybe bar &#x2F; beeswarm charts would be useful?<p>I was missing the possibility to show differences by category, eg mpg by make in the cars dataset.
评论 #42097467 未加载
jeffreygoesto6 个月前
I typically fire up GnuPlot and there CSV loading and a plot are one line each. What does Visprex do more or better?
评论 #42099335 未加载
imfing6 个月前
cool project!<p>Visualizing tabular data often presented some challenges, as I had to rely on tools like Google Sheets or Colab + Pandas for quick cleaning and wrangling before exploring different visualizations.<p>I think having more client-side data cleaning capabilities would make it even more powerful
评论 #42099657 未加载