TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Xilinx-Samsung SmartSSD Computational Storage Drive Launched

116 pointsby blopeurover 4 years ago

18 comments

ChuckMcMover 4 years ago
The core concept that you ship computation to data rather than the other way around is what made Google so impressive when it launched. There are lots of algorithms that do well in that model. Back when I was at NetApp I did a design of a system where the &quot;smart storage&quot; essentially labeled blocks with an MD5 hash when you went to store them. That allowed you to rapidly determine if you already had the block stored and could safely toss the one being written[1]. Really fast de-duplication and good storage compression.<p>At Blekko they had taken this concept to the next logical step and built a storage array out of triply replicated blocks (called &#x27;buckets&#x27;) that were distributed by their hashid. You could then write templated perl code that operated in parallel over hundreds (or thousands) of buckets and giving a composite result. It always surprised me that IBM didn&#x27;t care about that system when they acquired Blekko, it was pretty cool. If you implemented it in these Samsung drives it would make for a killer data science appliance. That design almost writes itself.<p>Also in the storage space, there was the CMU &quot;Active disk&quot; architecture[2] which was supposed to replace RAID. There was a startup spin-off from this work but I cannot recall its name anymore, sigh.<p>These days it would useful to design a simulator for systems like this and derive a calculus for analyzing their performance with respect to other architectures. Probably a masters thesis and maybe a PhD or two in that work.<p>[1] Yes MD5 hash collisions are a thing but not for identical length documents (aka an 8K block), and yes NetApp got a patent issued for it.<p>[2] <a href="https:&#x2F;&#x2F;www.pdl.cmu.edu&#x2F;PDL-FTP&#x2F;Active&#x2F;ActiveDisksBerkeley98.pdf" rel="nofollow">https:&#x2F;&#x2F;www.pdl.cmu.edu&#x2F;PDL-FTP&#x2F;Active&#x2F;ActiveDisksBerkeley98...</a>
评论 #25063784 未加载
评论 #25066425 未加载
评论 #25065958 未加载
评论 #25068359 未加载
bob1029over 4 years ago
I think putting something like SQLite on the actual storage device could be a super efficient way to directly express your intent to the actual durable storage system and bypass mountains of virtual bullshit.<p>The optimization opportunities are pretty obvious to me. Imagine if SQLite journaling was aware of how long the supercapacitor in the SSD would last, potentially even with real-time monitoring of device variables. You could have your entire WAL sitting in DRAM on the drive as long as it has enough stored energy to flush to NAND upon external power loss.
评论 #25063635 未加载
评论 #25063291 未加载
mikewarotover 4 years ago
I&#x27;ve been strongly interested in computational fabrics for at least 15 years... this looks interesting, but very, very locked down.<p>It is my understanding that FPGA vendors have fought the open source community every step of the way. I would hate to see the future of computing locked up in a new spiffy prison.
评论 #25063435 未加载
评论 #25062056 未加载
jdsullyover 4 years ago
Storage is starting to get extremely exciting again. The KV SSD&#x27;s, this, and Intel&#x27;s Optane are opening up a lot of new avenues for extremely high performance storage.
评论 #25063587 未加载
评论 #25062127 未加载
generalizationsover 4 years ago
Sorry if I missed it, but I&#x27;m not seeing it: what&#x27;s the bandwidth here? i.e., the time to read, process and write back the whole contents of the disk (using just the FPGA)?
评论 #25066409 未加载
rkagererover 4 years ago
Can anyone quantify the advantages this yields in terms of latency and bandwidth, compared to plugging a regular SSD into an external FPGA (via PCIe or whatever interface)?
dave_4_bagelsover 4 years ago
I&#x27;m still waiting to become skilled enough or end up invested in a project enough to merit dedicated super fast SSD storage, or some kind of exotic storage appliance!
Severianover 4 years ago
So I&#x27;m thinking Deduplication on drive will be the big thing here. Think XFS or ReFS block cloning but without server side processing.
评论 #25063484 未加载
jmpmanover 4 years ago
Too bad IBM killed off Netezza. If the cloud vendors started offering this widely, it would have given them another round of relevance.
foobiekrover 4 years ago
I wonder how hard it would be to port the server-side code of FoundationDB to one of these devices; architecturally FDB seems well suidted to this (at least until predicates show up) as it is already extremely constrained as to the expectations on the storage nodes; they basically provide just (time-bounded) versioned KV access.
Taniwhaover 4 years ago
Looks like this is a Xylinx KU15P - not shabby, but about 1&#x2F;2 the size of the 3-die monstrosities that are in the AWS FPGA instances you can rent for ~$1.50 an hour - so useful for disk stuff closely coupled to the drive, but maybe not as a general compute resource (depending on actual price of course)
gfodyover 4 years ago
They really should extend KVS for this - it&#x27;s going to be very difficult to leverage if the XSS interface is underneath the filesystem (as shown in the diagram) especially for RDBMS where the database is (usually) a single big flat file as far as the filesystem is concerned.
avmichover 4 years ago
Reminds me of Micron&#x27;s Automata Processor: <a href="https:&#x2F;&#x2F;www.cs.virginia.edu&#x2F;~skadron&#x2F;Papers&#x2F;wang_APoverview_CODES16.pdf" rel="nofollow">https:&#x2F;&#x2F;www.cs.virginia.edu&#x2F;~skadron&#x2F;Papers&#x2F;wang_APoverview_...</a>
m3kw9over 4 years ago
So one can look at the chip on beside the storage as an cpu offload built inside the drive, instead of a coprocessor on the motherboard. I’m not seeing a huge use case here except the narrowest of uses like decryption, compression.
评论 #25062015 未加载
评论 #25063574 未加载
评论 #25062624 未加载
mrfusionover 4 years ago
Would anyone want to do an explain-like-I’m-in-high-school on this?
评论 #25063013 未加载
评论 #25062224 未加载
评论 #25062033 未加载
ze_mover 4 years ago
That reminds me of calculating the mandelbrot set on the commodore 1541 floppy drive, back in the days.
sitkackover 4 years ago
This is super exciting. It makes no mention of on device bandwidth.
评论 #25066558 未加载
The_rationalistover 4 years ago
So no mention of syCL support... Only offering ~C in 2020 is an insult to computer science.<p>Unrelated: when will Nvidia allow to seamlessly offload Java or another GC based language to the GPU? <a href="https:&#x2F;&#x2F;developer.nvidia.com&#x2F;blog&#x2F;grcuda-a-polyglot-language-binding-for-cuda-in-graalvm&#x2F;" rel="nofollow">https:&#x2F;&#x2F;developer.nvidia.com&#x2F;blog&#x2F;grcuda-a-polyglot-language...</a> GrCuda seems promising but it would only allow interoperability with Java on the CPU, not offload Java to the GPU, right? Such advances would make gpu computing order of magnitudes more developper friendly and therefore much more mainstream.
评论 #25061709 未加载
评论 #25062258 未加载
评论 #25066991 未加载
评论 #25063550 未加载