科技回声

15 条评论

Excellent work! I have a very small (marketing-related) nitpick:Would you consider hosting your project on github instead of SourceForge? I don't say this to be trendy, github just has a much more pleasing interface in my opinion. It's also faster, subjectively at least.

评论 #3515786 未加载

评论 #3515725 未加载

mrich超过 13 年前

Congratulations, well done!Our team has built an in-memory DB (hybrid, both column and row store available, also using advanced compression). We have investigated GPUs for scans, we found that while they can outperform CPUs, the bottleneck is the transfer of the data to the card. Since the card's memory is only in the GB's, and our databases are typically much larger this is kind of limiting, since you have to copy data back and forth.Do you know what the maximum amount of GPU memory for a blade-like system currently is?

评论 #3513391 未加载

评论 #3515970 未加载

评论 #3516083 未加载

beagle3超过 13 年前

Super cool, and thanks for sharing!Are you familiar with APL/K/Q/J ? These implement column store databases which a programming language significantly more expressive and usable. K and J are also significantly faster than any other SQL database -- it would be interesting to compare K's ultra-optimized in-memory processing to the GPU -- I'm not at all sure GPU is going to win here in practice.See <a href="https://github.com/kevinlawler/kona" rel="nofollow">https://github.com/kevinlawler/kona</a> for an open source K implementation (the wiki has docs). Same language, newer version (with database integration) and english language instead of ascii chars is called Q see <a href="http://kx.com/q/d/" rel="nofollow">http://kx.com/q/d/</a> (kdb1, q1 if you like longer, kdb, q if you like terser), and <a href="http://code.kx.com/wiki/Main_Page" rel="nofollow">http://code.kx.com/wiki/Main_Page</a>

评论 #3517112 未加载

评论 #3514448 未加载

corysama超过 13 年前

Very, very cool. I'm always happy to see more ways to utilize the tremendous performance/price of GPUs.So, what are the pros and cons of this approach? For a given hardware and energy budget, what kinds of task would be better suited to this kind of GPU implementation? Conversely, what kind of tasks would still be faster to run on a Core i7?

评论 #3513521 未加载

joshu超过 13 年前

now this is the kind of thing i come to HN for... but why oh why sourceforge?

评论 #3513629 未加载

foobarbazetc超过 13 年前

This is awesome. :)Someone did something similar for PostgreSQL here:<a href="http://wiki.postgresql.org/wiki/PGStrom" rel="nofollow">http://wiki.postgresql.org/wiki/PGStrom</a>

chubs超过 13 年前

That looks absolutely brilliant. So is the database all stored in the GPU memory? Or in main memory? Or does your engine handle storing to disk as well?Pretty novel use of a GPU though, and if it's quick, it could be really useful.

评论 #3513289 未加载

grundprinzip超过 13 年前

Looks like a very interesting approach to me given the recent developments.Still I have a few questions regarding your numbers. Can you specify how much data you actually stream from disk? Because when doing some back of the envelope calculations (8 columns out of a 1.8B rows assuming 4b each) you would need to stream ~300MB/s from disk. Which seems very unlikely for your setup (except you have a super fast SATA drive, or SSD).Now, with only 4GB RAM, your are constantly filling your RAM and transporting data to GPU. Plus you need to compress the data somewhere (on GPU?).Do I understand the manual.txt correctly, that you can only achieve the performance when the data is written compressed to disk before? (While sorting it?)I may be wrong, but the group by and join looked like versions requiring sorted data.Can you please give a little more details on that?

评论 #3517383 未加载

ericflo超过 13 年前

Interesting. I wonder how a GPU could best be put to use in a database engine. Joins, I suppose? Would be cool to read more about this implementation, however the readme is a bit sparse.

pavelkaroukin超过 13 年前

antonmks, did you consider using OpenCL instead? If yes - why CUDA outweighed it?

评论 #3517246 未加载

alecco超过 13 年前

Column store, compare it with MonetDB (open source) and similar. This design is suitable for read-mostly databases. It's a trade-off, faster reads and much better compression. But the price is writes usually become an order of magnitude slower. It isn't good for transactional work (OLTP).<a href="http://en.wikipedia.org/wiki/Column-oriented_DBMS" rel="nofollow">http://en.wikipedia.org/wiki/Column-oriented_DBMS</a>Also the author talks (other comments) about transfer rate when the real issue with GPUS is latency. Most GPUs don't have decent RAM caches and it's very hard to cover the latency.Also: <a href="http://reddit.com/r/programming/comments/oxq6a/a_database_engine_that_runs_on_a_gpu_outperforms/c3kvts6" rel="nofollow">http://reddit.com/r/programming/comments/oxq6a/a_database_en...</a>

toblender超过 13 年前

Very nice. I'm glad people are using their GPU's for something more productive than farming bitcoins.

2AM超过 13 年前

nvidia gpus only, right? does it work with ati as well? CUDA is probably more advanced, so it's no surprise.

jrmg超过 13 年前

I love this kind of novel thinking. I'd definitely like to read more detailed articles on how this works.

评论 #3513345 未加载

TobbenTM超过 13 年前

What is the point of using "å" instead of "a"?

评论 #3513297 未加载

评论 #3513315 未加载

评论 #3513484 未加载

15 条评论

breckinloggins超过 13 年前

评论 #3515786 未加载

评论 #3515725 未加载

mrich超过 13 年前

评论 #3513391 未加载

评论 #3515970 未加载

评论 #3516083 未加载

beagle3超过 13 年前

评论 #3517112 未加载

评论 #3514448 未加载

corysama超过 13 年前

评论 #3513521 未加载

joshu超过 13 年前

now this is the kind of thing i come to HN for... but why oh why sourceforge?

评论 #3513629 未加载

foobarbazetc超过 13 年前

This is awesome. :)Someone did something similar for PostgreSQL here:<a href="http://wiki.postgresql.org/wiki/PGStrom" rel="nofollow">http://wiki.postgresql.org/wiki/PGStrom</a>

chubs超过 13 年前

评论 #3513289 未加载

grundprinzip超过 13 年前

评论 #3517383 未加载

ericflo超过 13 年前

Interesting. I wonder how a GPU could best be put to use in a database engine. Joins, I suppose? Would be cool to read more about this implementation, however the readme is a bit sparse.

pavelkaroukin超过 13 年前

antonmks, did you consider using OpenCL instead? If yes - why CUDA outweighed it?

评论 #3517246 未加载

alecco超过 13 年前

toblender超过 13 年前

Very nice. I'm glad people are using their GPU's for something more productive than farming bitcoins.

2AM超过 13 年前

nvidia gpus only, right? does it work with ati as well? CUDA is probably more advanced, so it's no surprise.

jrmg超过 13 年前

I love this kind of novel thinking. I'd definitely like to read more detailed articles on how this works.

Show HN: my database engine for GPU

15 条评论

Show HN: my database engine for GPU

15 条评论