TE
TechEcho
Home
24h Top
Newest
Best
Ask
Show
Jobs
English
GitHub
Twitter
Home
Running LLMs with 3.3M Context Tokens on a Single GPU
14 points
by
Van_Chopiszt
7 months ago
1 comment
charlie_xxx
7 months ago
Their demo looks really cool: <a href="https://github.com/mit-han-lab/duo-attention">https://github.com/mit-han-lab/duo-attention</a>