Xilinx-Samsung SmartSSD Computational Storage Drive Launched

116 pointsby blopeurover 4 years ago

18 comments

ChuckMcMover 4 years ago

The core concept that you ship computation to data rather than the other way around is what made Google so impressive when it launched. There are lots of algorithms that do well in that model. Back when I was at NetApp I did a design of a system where the "smart storage" essentially labeled blocks with an MD5 hash when you went to store them. That allowed you to rapidly determine if you already had the block stored and could safely toss the one being written[1]. Really fast de-duplication and good storage compression.At Blekko they had taken this concept to the next logical step and built a storage array out of triply replicated blocks (called 'buckets') that were distributed by their hashid. You could then write templated perl code that operated in parallel over hundreds (or thousands) of buckets and giving a composite result. It always surprised me that IBM didn't care about that system when they acquired Blekko, it was pretty cool. If you implemented it in these Samsung drives it would make for a killer data science appliance. That design almost writes itself.Also in the storage space, there was the CMU "Active disk" architecture[2] which was supposed to replace RAID. There was a startup spin-off from this work but I cannot recall its name anymore, sigh.These days it would useful to design a simulator for systems like this and derive a calculus for analyzing their performance with respect to other architectures. Probably a masters thesis and maybe a PhD or two in that work.[1] Yes MD5 hash collisions are a thing but not for identical length documents (aka an 8K block), and yes NetApp got a patent issued for it.[2] <a href="https://www.pdl.cmu.edu/PDL-FTP/Active/ActiveDisksBerkeley98.pdf" rel="nofollow">https://www.pdl.cmu.edu/PDL-FTP/Active/ActiveDisksBerkeley98...</a>

评论 #25063784 未加载

评论 #25066425 未加载

评论 #25065958 未加载

评论 #25068359 未加载

bob1029over 4 years ago

I think putting something like SQLite on the actual storage device could be a super efficient way to directly express your intent to the actual durable storage system and bypass mountains of virtual bullshit.The optimization opportunities are pretty obvious to me. Imagine if SQLite journaling was aware of how long the supercapacitor in the SSD would last, potentially even with real-time monitoring of device variables. You could have your entire WAL sitting in DRAM on the drive as long as it has enough stored energy to flush to NAND upon external power loss.

评论 #25063635 未加载

评论 #25063291 未加载

mikewarotover 4 years ago

I've been strongly interested in computational fabrics for at least 15 years... this looks interesting, but very, very locked down.It is my understanding that FPGA vendors have fought the open source community every step of the way. I would hate to see the future of computing locked up in a new spiffy prison.

评论 #25063435 未加载

评论 #25062056 未加载

jdsullyover 4 years ago

Storage is starting to get extremely exciting again. The KV SSD's, this, and Intel's Optane are opening up a lot of new avenues for extremely high performance storage.

评论 #25063587 未加载

评论 #25062127 未加载

generalizationsover 4 years ago

Sorry if I missed it, but I'm not seeing it: what's the bandwidth here? i.e., the time to read, process and write back the whole contents of the disk (using just the FPGA)?

评论 #25066409 未加载

rkagererover 4 years ago

Can anyone quantify the advantages this yields in terms of latency and bandwidth, compared to plugging a regular SSD into an external FPGA (via PCIe or whatever interface)?

dave_4_bagelsover 4 years ago

I'm still waiting to become skilled enough or end up invested in a project enough to merit dedicated super fast SSD storage, or some kind of exotic storage appliance!

Severianover 4 years ago

So I'm thinking Deduplication on drive will be the big thing here. Think XFS or ReFS block cloning but without server side processing.

评论 #25063484 未加载

jmpmanover 4 years ago

Too bad IBM killed off Netezza. If the cloud vendors started offering this widely, it would have given them another round of relevance.

foobiekrover 4 years ago

I wonder how hard it would be to port the server-side code of FoundationDB to one of these devices; architecturally FDB seems well suidted to this (at least until predicates show up) as it is already extremely constrained as to the expectations on the storage nodes; they basically provide just (time-bounded) versioned KV access.

Taniwhaover 4 years ago

Looks like this is a Xylinx KU15P - not shabby, but about 1/2 the size of the 3-die monstrosities that are in the AWS FPGA instances you can rent for ~$1.50 an hour - so useful for disk stuff closely coupled to the drive, but maybe not as a general compute resource (depending on actual price of course)

gfodyover 4 years ago

They really should extend KVS for this - it's going to be very difficult to leverage if the XSS interface is underneath the filesystem (as shown in the diagram) especially for RDBMS where the database is (usually) a single big flat file as far as the filesystem is concerned.

avmichover 4 years ago

Reminds me of Micron's Automata Processor: <a href="https://www.cs.virginia.edu/~skadron/Papers/wang_APoverview_CODES16.pdf" rel="nofollow">https://www.cs.virginia.edu/~skadron/Papers/wang_APoverview_...</a>

m3kw9over 4 years ago

So one can look at the chip on beside the storage as an cpu offload built inside the drive, instead of a coprocessor on the motherboard. I’m not seeing a huge use case here except the narrowest of uses like decryption, compression.

评论 #25062015 未加载

评论 #25063574 未加载

评论 #25062624 未加载

mrfusionover 4 years ago

Would anyone want to do an explain-like-I’m-in-high-school on this?

评论 #25063013 未加载

评论 #25062224 未加载

评论 #25062033 未加载

ze_mover 4 years ago

That reminds me of calculating the mandelbrot set on the commodore 1541 floppy drive, back in the days.

sitkackover 4 years ago

This is super exciting. It makes no mention of on device bandwidth.

评论 #25066558 未加载

The_rationalistover 4 years ago

So no mention of syCL support... Only offering ~C in 2020 is an insult to computer science.Unrelated: when will Nvidia allow to seamlessly offload Java or another GC based language to the GPU? <a href="https://developer.nvidia.com/blog/grcuda-a-polyglot-language-binding-for-cuda-in-graalvm/" rel="nofollow">https://developer.nvidia.com/blog/grcuda-a-polyglot-language...</a> GrCuda seems promising but it would only allow interoperability with Java on the CPU, not offload Java to the GPU, right? Such advances would make gpu computing order of magnitudes more developper friendly and therefore much more mainstream.

评论 #25061709 未加载

评论 #25062258 未加载

评论 #25066991 未加载

评论 #25063550 未加载