TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Efficient Tabular Storage

62 pointsby ah-over 9 years ago

6 comments

SloopJonover 9 years ago
It looks like the NYCTaxi dataset is here:<p><a href="http:&#x2F;&#x2F;www.andresmh.com&#x2F;nyctaxitrips&#x2F;" rel="nofollow">http:&#x2F;&#x2F;www.andresmh.com&#x2F;nyctaxitrips&#x2F;</a><p>Some background on this data:<p><a href="http:&#x2F;&#x2F;chriswhong.com&#x2F;data-visualization&#x2F;taxitechblog1&#x2F;" rel="nofollow">http:&#x2F;&#x2F;chriswhong.com&#x2F;data-visualization&#x2F;taxitechblog1&#x2F;</a><p>And data for 2014 directly from the city:<p><a href="https:&#x2F;&#x2F;data.cityofnewyork.us&#x2F;view&#x2F;gn7m-em8n" rel="nofollow">https:&#x2F;&#x2F;data.cityofnewyork.us&#x2F;view&#x2F;gn7m-em8n</a>
TheGuyWhoCodesover 9 years ago
Vertica has all those performance enhancements, great DB can&#x27;t recommend because of pricing :(
评论 #10140375 未加载
owlishover 9 years ago
How do databases like MySQL store data efficiently for querying? It seems like something like protobuf would do well here, though you&#x27;d need to generate code for each dataset.
评论 #10139249 未加载
评论 #10140055 未加载
Little_Peterover 9 years ago
How is it different from HDF5 (h5py and pytables)?
Ashim_Usmaniover 9 years ago
Great share
Sprintover 9 years ago
This is super interesting, thanks!