TE
TechEcho
Home
24h Top
Newest
Best
Ask
Show
Jobs
English
GitHub
Twitter
Home
PowerInfer-2: Fast Large Language Model Inference on a Smartphone
1 points
by
27theo
11 months ago
1 comment
27theo
11 months ago
Arxiv Paper: <a href="https://arxiv.org/abs/2406.06282" rel="nofollow">https://arxiv.org/abs/2406.06282</a><p>Previous submission (paper link submitted): <a href="https://news.ycombinator.com/item?id=40646450">https://news.ycombinator.com/item?id=40646450</a>