TE
科技回声
首页
24小时热榜
最新
最佳
问答
展示
工作
中文
GitHub
Twitter
首页
TensorRT-LLM runtime now open-source
4 点
作者
mmoskal
2 个月前
1 comment
mmoskal
2 个月前
Previously, the "Executor" runtime was shipped as binary blobs. This is the bit that schedules requests and manages KV cache (similar to vLLM or SGLang server).