"Triton" here is apparently a programming language, which upstream describes as<p>> This is the development repository of Triton, a language and compiler for writing highly efficient custom Deep-Learning primitives. The aim of Triton is to provide an open-source environment to write fast code at higher productivity than CUDA, but also with higher flexibility than other existing DSLs.<p>So if you clicked in expecting the illumos-based virtualization platform, this isn't that. Though<p>> This is the basis for torchao, which crucially changes some large models from "can't run" to "can run" on consumer GPUs. That's easier than supporting them in other quantization frameworks, or letting the consumers use Linux or WSL<p>Does sound neat on its own merits.