TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Talaria: Interactively Optimizing Machine Learning Models for Efficient Inferenc

41 pointsby quantisan8 months ago

4 comments

jgoertler8 months ago
Hi, I’m Jochen, one of the authors.<p>We recently did a Show HN (<a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=41463916">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=41463916</a>) which did not get much traction, so I’m posting this again here:<p>We just released Mycelium, the library that powers Talaria’s graph viewer. You can check it out and play around with it here: <a href="https:&#x2F;&#x2F;apple.github.io&#x2F;ml-mycelium" rel="nofollow">https:&#x2F;&#x2F;apple.github.io&#x2F;ml-mycelium</a><p>I’m happy to answer any questions about Talaria or Mycelium!
SaBaAg8 months ago
Are inference metrics like latency and power measured live from device? To which devices can Talaria be applied?
efnx8 months ago
How does this compare to TVM?
评论 #41499425 未加载
bobosha8 months ago
Could you give us a tl;dr on this project? and how could I use something like this work for on-device applications, think &quot;smart home&quot; style applications?