TechEcho

13 comments

Animatsover 1 year ago

Extreme compression will be when you put in a movie and get a SORA prompt back that regenerates something close enough to the movie.

评论 #39436609 未加载

评论 #39438904 未加载

评论 #39435743 未加载

评论 #39438480 未加载

评论 #39437122 未加载

评论 #39441663 未加载

评论 #39436028 未加载

评论 #39438454 未加载

ToJansover 1 year ago

Ahhh, Sloot's digital coding system [1] is finally here ;).[1] <a href="https://en.m.wikipedia.org/wiki/Sloot_Digital_Coding_System" rel="nofollow">https://en.m.wikipedia.org/wiki/Sloot_Digital_Coding_System</a>

评论 #39437134 未加载

userbinatorover 1 year ago

How fast is this and how big is the decoder/encoder? The model weights are not accessible.From the description, it looks like it's only being tested with 128x128 frames, which implies that the speed is very low.

评论 #39436454 未加载

IshKebabover 1 year ago

> It can be observed that our model outperforms them at low bitratesIt can? Maybe I'm misunderstanding the graphs but it doesn't look like it to me?

评论 #39435808 未加载

holodukeover 1 year ago

Back in 2005 there was a collegue at my first job writing video format converters software. He was considered a genius and the stereo type of an introvert software developer. He claimed that one day an entire movie could be compressesed on a single floppydisk. Everybody laughed and thought he was weird. He might be right after all.

评论 #39441166 未加载

评论 #39438882 未加载

resolutebatover 1 year ago

Here's the research behind this: <a href="https://arxiv.org/html/2402.08934v1" rel="nofollow">https://arxiv.org/html/2402.08934v1</a>As a casual non-scholar, non-AI person trying to parse this though, it's infuriatingly convoluted. I was expecting a table of "given source file X, we got file size Y with quality loss Z", but while quality (SSIM/LPIPS) is compared to standard codecs like H.264, for the life of me I can't find any measure of how efficient the compression is here.Applying AI to image compression has been tried before though, with distinctly mediocre results: some may recall the Xerox debacle about 10 years, when it turned out copiers were helpfully "optimizing" images by replacing digits with others in invoices, architectural drawings, etc.<a href="https://www.theverge.com/2013/8/6/4594482/xerox-copiers-randomly-replacing-numbers-in-documents" rel="nofollow">https://www.theverge.com/2013/8/6/4594482/xerox-copiers-rand...</a>

评论 #39437976 未加载

评论 #39438024 未加载

评论 #39438831 未加载

sbalamuruganover 1 year ago

It’s uncanny how much of the current stuff has been predicted by the sitcom -“Silicon Valley”

评论 #39435895 未加载

评论 #39438706 未加载

LeoPantheraover 1 year ago

It's important to remember that any compression gains must include the size of the decompressor which, I assume, will include an enormous diffusion model.

评论 #39438327 未加载

smerikover 1 year ago

Does anyone remember the <a href="https://en.wikipedia.org/wiki/Sloot_Digital_Coding_System" rel="nofollow">https://en.wikipedia.org/wiki/Sloot_Digital_Coding_System</a>?

zaptremover 1 year ago

Can you share example videos?

评论 #39435104 未加载

hulituover 1 year ago

> Extreme video compression with prediction using pre-trainded diffusion modelsIs this more extreme than youtube ?

mjevansover 1 year ago

I wonder how effective a speed focused variation could be for quality among 264, 265, and AV1.

hosejaover 1 year ago

Middle-out.