TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

JPEG-LM: LLMs as Image Generators with Canonical Codec Representations

5 pointsby hardmaru9 months ago

1 comment

magicalhippo9 months ago
Initially I was assuming they were not including the Huffman encoding step, but no:<p><i>The bytes in the files do not have consistent meanings and would depend on their context and the implicit Huffman tables. [...]</i><p><i>However, we observe that conventional, vanilla language modeling surprisingly conquers these challenges without special designs as training goes (e.g., JPEG-LM generates realistic images barely with any corrupted JPEG patches).</i><p>That surprised me, but then I&#x27;m not in the field.