TechEcho

8 comments

minimaxirabout 2 years ago

Some context for those who aren't in the loop: ONNX Runtime (<a href="https://onnxruntime.ai/" rel="nofollow">https://onnxruntime.ai/</a>) is a standardization format for AI models. Nowadays, it's extremely easy to export models in the ONNX format, especially language models with tools like Hugging Face transformers which have special workflows for it.ONNX support in the browser was lacking and limited to CPU, but with a WebGPU backend it may now finally be feasible to run models in the browser on a GPU, which opens up interesting oppertunities. Although from this PR it looks like only a few operations are implemented, no browser-based GPT yet.

评论 #35695335 未加载

评论 #35695868 未加载

pumanoirabout 2 years ago

A great option but there is wonnx which seems to be more complete and mature. And the bonus is that it's implemented in Rust (if you are into it).<a href="https://github.com/webonnx/wonnx">https://github.com/webonnx/wonnx</a>

评论 #35697245 未加载

评论 #35699013 未加载

CSSerabout 2 years ago

Okay, I’ve just got to know: what’s up with the commit messages? I’ve never seen anyone just straight up use numbers before.

评论 #35695276 未加载

评论 #35695235 未加载

评论 #35699531 未加载

评论 #35696751 未加载

b_mc2about 2 years ago

A pretty cool library that uses ONNX is transformers.js [1] and they're already working to add WebGPU support.[2][1] <a href="https://xenova.github.io/transformers.js/" rel="nofollow">https://xenova.github.io/transformers.js/</a>[2] <a href="https://twitter.com/xenovacom/status/1650634015060156420" rel="nofollow">https://twitter.com/xenovacom/status/1650634015060156420</a>

jarymabout 2 years ago

As an aside, I love ONNX and the main reason I'm sticking with PyTorch. I was able to develop and train an RL model in Python and then convert it to ONNX and call it from C# production code.It still took a lot of effort but the final version is very performant and reliable.

Culonavirusabout 2 years ago

It would be great if, one lovely day, we could just slap together an electron app and run inference through webgpu hassle-free and cross-platform.

WorldPeasabout 2 years ago

Quite the uh... interesting commit strategy...

synergy20about 2 years ago

interesting, is there a helloworld example or tutorial somewhere to check out how this works in real?

8 comments

minimaxirabout 2 years ago

评论 #35695335 未加载

评论 #35695868 未加载

pumanoirabout 2 years ago

评论 #35697245 未加载

评论 #35699013 未加载

CSSerabout 2 years ago

Okay, I’ve just got to know: what’s up with the commit messages? I’ve never seen anyone just straight up use numbers before.

评论 #35695276 未加载

评论 #35695235 未加载

评论 #35699531 未加载

评论 #35696751 未加载

b_mc2about 2 years ago

jarymabout 2 years ago

Culonavirusabout 2 years ago

It would be great if, one lovely day, we could just slap together an electron app and run inference through webgpu hassle-free and cross-platform.

WorldPeasabout 2 years ago

Quite the uh... interesting commit strategy...

synergy20about 2 years ago

interesting, is there a helloworld example or tutorial somewhere to check out how this works in real?

ONNX Runtime merges WebGPU backend

8 comments

ONNX Runtime merges WebGPU backend

8 comments