TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: I built a full mulimodal LLM by merging multiple models into one

40 pointsby yoeven4 months ago

4 comments

kouteiheika4 months ago
I clicked expecting a single full multimodal LLM made by merging multiple existing models into one like the title suggests (which sounds very interesting), and I found... a library which is an LLM router&#x2F;calls a bunch of LLM web APIs and exposes that under a unified&#x2F;easy to use interface?<p>With all due respect, sorry, but this title is very misleading. I&#x27;d expect &quot;build an LLM&quot; to mean, well, actually building an LLM, and while it&#x27;s a very nice library it&#x27;s definitely <i>not</i> what the title suggests.
评论 #42908531 未加载
评论 #42908415 未加载
评论 #42908476 未加载
upghost4 months ago
I&#x27;ll jump in before the haterade engine wakes up -- great bit of engineering work here! I can&#x27;t imagine a better level of abstracting away the unnecessary stuff while still retaining that level of manual control.<p>The only thing I don&#x27;t see is setup for local&#x2F;in-house LLMs, but it&#x27;s easy enough to spoof OpenAI calls if necessary.
评论 #42908368 未加载
Peretus4 months ago
Whoa, great to see Yoeven&#x27;s work here. I learned about JigsawStack when I applied for a role there and was super impressed with what he&#x27;s built. We ended up having a call and he was able to tell me a bit more about what he&#x27;s working on.<p>He is a friendly and super down-to-earth guy who has made some remarkably good progress on building a platform that just works. For instance, easily connecting a fine-tuned LLM that knows how to scrape content to a translation LLM and wrapping that up in a platform with a really good developer experience.<p>If you&#x27;re interested in kind of thing, he also did a ShowHN last year on Dzero, a distributed SQLite database built on Cloudflare D1: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=40563729">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=40563729</a>
FilipSivak4 months ago
You clearly don&#x27;t understand what multimodal means. Multimodal is for example new gemini where you can input green car and get the very same car, only with red paint. Multimodal LLM can do the edit in the latent space, which is the key.<p>Very misleading title, and you won&#x27;t get away with it by using word &quot;mulimodal&quot; either.
评论 #42909318 未加载