TechEcho

9 comments

Even as an ML-focused graphics-less GPU, this is great. If this can be prototyped on an FPGA, it would be even better. Using block RAM for shared memory and built-in PCIe and DDR IP blocks should help speed things up considerably.It unfortunately wouldn't be very cost-effective for training ML models, but it would take things a step closer to actual tape-out (if some organization has the $$$ for it).

tux3about 3 years ago

Worth noting this is targeting ML applications, so I don't think you'll be able to display even a text console with it for the foreseeable future.But I love that this is even in the realm of possibilities! There's no reason we couldn't, in principle, have a small open-source GPU taping out on the free Skywater shuttle, and I am here for it!

评论 #31137421 未加载

评论 #31140181 未加载

gnufxabout 3 years ago

Perhaps also see the (OpenPOWER-based) Libre-SOC effort <a href="https://libre-soc.org/" rel="nofollow">https://libre-soc.org/</a>

yjftsjthsd-habout 3 years ago

> Internal GPU Core ISA loosely compliant with RISC-V ISA. Where RISC-V conflicts with designing for a GPU setting, we break with RISC-V.Very amateur question: I thought RISC-V added vector extensions so you could use it directly for GPU/TPU chips without having to fragment the ecosystem?

评论 #31139193 未加载

Sirenedabout 3 years ago

Neat! Looks like it's very much in its early stages (no concurrent execution/threads yet) but it's so great to see FOSS digital design work in an industry dominated by huge players

avianesabout 3 years ago

I'm not really optimistic about the hardware and the tape-out goal. The author seems to have a very basic knowledge about it.For instance, the int multiplier design is overengineered, very naive and far from state of the art (no pipeline, no adder compressor). I would suggest the author to check wallace tree multiplier.But at this stage it would be preferable to use the native verilog multiply or a DSP macro to target FPGA for prototyping, and to focus the SIMT architecture and the pipelining. Arithmetic unit design is a science in its own.However it's a beautiful project!

terafoabout 3 years ago

I'm interested whether it would be a good idea to implement a Vulkan driver for such GPU via emulating TMUs and ROPs in software, and it might not even matter that much since modern rendering pipelines are more and more compute reliant anyways(UE5 Nanite barely uses hardware rasterizers, latest idTech uses software rasterization as well). The only problem I see lies is with raytracing since it is pretty reliant on fixed function units.

评论 #31144765 未加载

评论 #31143182 未加载

atq2119about 3 years ago

From the planning document:> Branching: Done> Single instruction multiple thread (SIMT): PlannedI guess we should be supportive, and it is impressive how far they got on the software side, but boy, is the author in for a surprise.

lizardactivistabout 3 years ago

Very cool, but can someone remind me what "GPU" means again?

评论 #31138357 未加载

评论 #31138415 未加载

9 comments

mrintellectualabout 3 years ago

tux3about 3 years ago

评论 #31137421 未加载

评论 #31140181 未加载

gnufxabout 3 years ago

Perhaps also see the (OpenPOWER-based) Libre-SOC effort <a href="https://libre-soc.org/" rel="nofollow">https://libre-soc.org/</a>

yjftsjthsd-habout 3 years ago

评论 #31139193 未加载

Sirenedabout 3 years ago

Neat! Looks like it's very much in its early stages (no concurrent execution/threads yet) but it's so great to see FOSS digital design work in an industry dominated by huge players

VeriGPU: GPU in Verilog loosely based on RISC-V ISA

9 comments

VeriGPU: GPU in Verilog loosely based on RISC-V ISA

9 comments