TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Liquid: Language models are scalable and unified multi-modal generators

82 点作者 pr337h4m27 天前

5 条评论

Centigonal27 天前
I love the website for this paper! Each section asks a question, and immediately answers it with a figure and a few sentences of discussion. It's less tech-demo heavy than a lot of other paper websites (those are cool, too, in their own way), and instead focuses on characterizing multimodal model behavior in a nice, clean, disciplined way.
gwern27 天前
&gt; For the first time, Liquid uncovers a scaling law that performance drop unavoidably brought by the unified training of visual and language tasks diminishes as the model size increases...No prior work has explored whether LLMs retain the power-law scaling laws observed in language tasks when extended to visual generation tasks. We prove this alignment and further show that vision can be effectively learned by LLMs as a form of language.<p>Does this really show much that <a href="https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2301.03728#facebook" rel="nofollow">https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2301.03728#facebook</a> (uncited) and other earlier work did not?
swyx27 天前
hmm this is a tough name - conflicts with Liquid AI <a href="https:&#x2F;&#x2F;hn.algolia.com&#x2F;?dateRange=all&amp;page=0&amp;prefix=true&amp;query=liquid%20ai%20models&amp;sort=byPopularity&amp;type=story" rel="nofollow">https:&#x2F;&#x2F;hn.algolia.com&#x2F;?dateRange=all&amp;page=0&amp;prefix=true&amp;que...</a>
评论 #43702454 未加载
Nijikokun27 天前
it performs well with composition, however it seems SD and SDXL excels in capability and quality when intermixed with pipelines and workflows, this doesn&#x27;t do much to talk about that comparison and whenever i see things like this i think about the overall workflow, like cool you do good composition but you don&#x27;t fit within the workflow or ecosystem that surrounds that tool and thus i have low expectations around adoption
marviel27 天前
The Synesthesia these models must experience has gotta be intense
评论 #43700727 未加载