科技回声

3 条评论

This is pretty great. PyTorch uses triton as the backend for torch.compile (the big feature of PyTorch 2.0, and the necessary part for making Flex Attention in the about to be released 2.5 usably fast).Triton's team doesn't support Windows, and, worse yet, does not accept community PRs to enable any sort of support.Here's the github issue: <a href="https://github.com/triton-lang/triton/issues/1640">https://github.com/triton-lang/triton/issues/1640</a>And here's the performance comparison of Flex Attention with and without torch.compile (tldr it's 3x slower than a standard MHA when not compiled): <a href="https://github.com/rasbt/LLMs-from-scratch/blob/76e9a9ec02a1a060aac61608598fdd50cc7d52bd/ch03/02_bonus_efficient-multihead-attention/mha-implementations.ipynb">https://github.com/rasbt/LLMs-from-scratch/blob/76e9a9ec02a1...</a>EDIT: after taking a look at the repo, the only thing changed in the "46 commits ahead of [official triton]" is the README. Somewhat sketchy.

评论 #41866352 未加载

lostmsu7 个月前

Microsoft should help this project setup CI infra with necessary GPUs.

yjftsjthsd-h7 个月前

"Triton" here is apparently a programming language, which upstream describes as> This is the development repository of Triton, a language and compiler for writing highly efficient custom Deep-Learning primitives. The aim of Triton is to provide an open-source environment to write fast code at higher productivity than CUDA, but also with higher flexibility than other existing DSLs.So if you clicked in expecting the illumos-based virtualization platform, this isn't that. Though> This is the basis for torchao, which crucially changes some large models from "can't run" to "can run" on consumer GPUs. That's easier than supporting them in other quantization frameworks, or letting the consumers use Linux or WSLDoes sound neat on its own merits.

评论 #41866783 未加载

3 条评论

Scene_Cast27 个月前

评论 #41866352 未加载

lostmsu7 个月前

Microsoft should help this project setup CI infra with necessary GPUs.

yjftsjthsd-h7 个月前

评论 #41866783 未加载

Triton Fork for Windows Support

3 条评论

Triton Fork for Windows Support

3 条评论