Video Vectorization

319 点作者 xanthine将近 5 年前

33 条评论

dharma1将近 5 年前

This why Flash was such a nice format for some things (not going into the pitfalls of Flash - but in some ways it was fantastic).I guess these days we have animated SVG, and <a href="https://lottiefiles.com/" rel="nofollow">https://lottiefiles.com/</a> is getting some traction - but these require you to export in a specific format of course, you can't just convert/trace a bitmap movie with these. And SVG or Lottie aren't designed for longer/streaming vector animations, and they don't carry synchronised or streaming audio - Flash did all of those things.Vectorisation of bitmap images does have some artifacts, as is evident in the Simpsons demo on this website - when possible, you should export in a vector movie format directly from the vector animation software.It is kind of depressing that we don't have an open standard for vector movies (with sound), over a decade after Flash was killed. Sometimes it feels like technology stops or moves backwards.

评论 #23900199 未加载

评论 #23901478 未加载

teddyh将近 5 年前

Isn’t this basically the same thing which was famously used to achieve full motion video in the 1992 Amiga demo “State of the Art”¹ and improved one year later in the followup “9 Fingers”²?1. <a href="https://www.pouet.net/prod.php?which=99" rel="nofollow">https://www.pouet.net/prod.php?which=99</a> <a href="https://www.youtube.com/watch?v=J2r7-ygXOzo" rel="nofollow">https://www.youtube.com/watch?v=J2r7-ygXOzo</a>2. <a href="https://www.pouet.net/prod.php?which=100" rel="nofollow">https://www.pouet.net/prod.php?which=100</a> <a href="https://www.youtube.com/watch?v=tGetanBEKK8" rel="nofollow">https://www.youtube.com/watch?v=tGetanBEKK8</a>

评论 #23895894 未加载

评论 #23895753 未加载

评论 #23896040 未加载

评论 #23895781 未加载

emmanueloga_将近 5 年前

It just boggles my mind that the front page of a page that offers a "patented vector-transcoder converts video to a vector format, reducing bitrates" doesn't include such video on its front page.

评论 #23895807 未加载

评论 #23896324 未加载

black_puppydog将近 5 年前

This reminds me of a paper from 2005 by Daniel Sýkora et al. [1] which tries something very similar, with the specific use case of animation video. The authors describe it best in the abstract I think:> Video Codec for Classical Cartoon Animations with Hardware Accelerated Playback> We introduce a novel approach to video compression which is suitable for traditional outline-based cartoon animations. In this case the dynamic foreground consists of several homogeneous regions and the background is static textural image. For this drawing style we show how to recover hybrid representation where the background is stored as a single bitmap and the foreground as a sequence of vector images.The idea of using prior knowledge about the nature of the content to decide on an encoding scheme makes intuitive sense to me, though I'm not a codec person so I don't know how feasible it would be to make these ideas into the hardware-accelerated codecs we know from other methods.Of course, these methods would make the most sense when used directly by the animation studios during export, not as an afterthought. But I'll take what we can get.By the way, the corpus of Sýkora's works [2] is really really impressive in my opinion. He gave a talk in my institute while I was researching methods around neural style, and his take on parametric models, paired with the quality (and speed!) of his results, really left a mark, if not to say they made me seriously question wtf I was doing there. His work is strictly tailored to a professional animation / video production setting, so it seems extremely applicable compared to the toy-like nature of neural style methods. That is not to say he doesn't know about those. His team's recent papers actually fruitfully combine the two.[1]: <a href="https://link.springer.com/chapter/10.1007/11595755_6" rel="nofollow">https://link.springer.com/chapter/10.1007/11595755_6</a>[2]: <a href="https://dcgi.fel.cvut.cz/home/sykorad/" rel="nofollow">https://dcgi.fel.cvut.cz/home/sykorad/</a>

评论 #23897107 未加载

gardaani将近 5 年前

Domain specific video formats could be the future. They can reduce bandwidth requirements and offer features not available in pixel videos.One good example is <a href="https://asciinema.org" rel="nofollow">https://asciinema.org</a> , which plays back terminal sessions. The text in the "video" is selectable!

评论 #23900148 未加载

bkm将近 5 年前

One overlooked usecase is RDP and VNC-like concepts. Right now Appetize (App streaming service) uses raster graphics. Vector would allow high-FPS streaming of Apps, making running Apps in the cloud a realistic option.

评论 #23896126 未加载

评论 #23895834 未加载

评论 #23895854 未加载

londons_explore将近 5 年前

Is there market demand for this?In a world where videos are still sent around as multi-megabyte gif files, and audio clips are still distributed with a random slideshow on YouTube, I think a lot of users aren't so bothered about efficiency - they just want the simplest thing that works.Lots of music is most efficiently stored as MIDI, yet how many songs on iTunes are midi?For video, raster is king because it works for everything.

评论 #23897148 未加载

评论 #23896338 未加载

评论 #23895811 未加载

评论 #23896023 未加载

评论 #23902506 未加载

评论 #23895547 未加载

评论 #23895525 未加载

评论 #23895794 未加载

评论 #23895616 未加载

royjacobs将近 5 年前

Reminds me of the Spaceballs demos State of the Art and Nine Fingers [1]. Released in 1993 iirc.[1] <a href="https://www.youtube.com/watch?v=PPoYzwib7JQ" rel="nofollow">https://www.youtube.com/watch?v=PPoYzwib7JQ</a>

acd将近 5 年前

I think your technology would be useful for restoring old animated videos. Plus it would be useful for The catchy intro animations used on startups to demonstrate their technology.Also would be nice to use when you have bad internet connection speed to watch e-learning material animations.For e-learning you may need a hybrid M3u like Playlist approach with video for the presenter and vector graphics for the screen casts.Manga videos would probably also compress well.Children animated videos.What if you would reduce the color space vectorize of ordinary video to for example 8 colors and smooth out the noise to make large flat surfaces could you compress it with vectors?

评论 #23895867 未加载

laserpistus将近 5 年前

We are using a similar concept in production. Rendered videos for online education within programming. It is a limited usecase of course, but brings the same benefits: 1/100 bandwidth; crisp rendered text/content instead of rasterized; Easy editing; and the content is searchable / indexable.Using html gives us some additional benefits in that we can combine rendered with rasterized content. We also get access to a lot more advanced functionality through the browser context.

评论 #23902424 未加载

vhiremath4将近 5 年前

Woah this is incredible. Even with the increased CPU demands, this is a very large bandwidth savings.>In practice however, DRM, streaming , analytics and ad placement also require javascript logic to function in web runtimes, so in real word settings web-video playback can and does use a non-trivial amount of CPU time.I'm a little skeptical of this claim. No idea how much CPU is used for DRM, but I can't imagine it's on the order of multiple percentage points.

评论 #23895600 未加载

评论 #23897416 未加载

评论 #23895982 未加载

gruez将近 5 年前

>Our first vectorized proof of concept for animations is a 17 second clip of the Simpsons located here. Keep in mind, our technology is still at a very early stage, and this is much optimization work left to be done.><a href="https://files.vectorly.io/demo/v0-2-simpsons-250kbps/index.html" rel="nofollow">https://files.vectorly.io/demo/v0-2-simpsons-250kbps/index.h...</a>There isn't a raster version to compare to, but that looks noticeably worse than what I'd expect from a raster version. There's a lot of artifacting when there's motion, and the linework looks.... off.><a href="https://files.vectorly.io/demo/khan-20kbps/index.html" rel="nofollow">https://files.vectorly.io/demo/khan-20kbps/index.html</a>The khan academy one looks much better, although there's still some minor artifacting, eg. when the mouse comes close to the "O" in "O_2" changes a bit.

评论 #23898121 未加载

评论 #23899539 未加载

评论 #23898084 未加载

fefe23将近 5 年前

AFAIK MPEG-4 experimented with encoding 3d objects but it never took off. As usual for MPEG they did not specify how to get the 3d data from a scene but how to encode them, actually how to decode them, so that innovation could happen on the encoding side.The idea is so obvious that I would be astounded if this company gets anywhere. I'd wager many research teams already attempted this and were never heard from again.Also note that video compression is pretty impressive these days. A typical 2 hour 1080p movie compresses down to a handful of GiB. Compare that to a typical 1080p action game which is easily ten times that big, because storing all the meshes and textures takes a lot of space, it turns out.

评论 #23899895 未加载

cphoover将近 5 年前

Why use computer vision when likely most of these animations come from some software that can/could output a vectorized video format?You wouldn't have any conversion artifacts that way.

评论 #23897960 未加载

katmannthree将近 5 年前

It sounds cool but given that the demo looks like this [0] on my bog-standard Windows 10 PC with Chrome (and Firefox and Edge) I'm assuming they've still got some bugs to work out... If it's working for anyone here I'd love to see a screen capture of the proper rendering.[o]: <a href="https://i.imgur.com/YO42u2C.png" rel="nofollow">https://i.imgur.com/YO42u2C.png</a>

评论 #23895598 未加载

评论 #23897277 未加载

评论 #23896244 未加载

评论 #23898178 未加载

wodenokoto将近 5 年前

As a kid, I used to trace frames from the Simpsons in Macromedia Flash (later Adobe, later discontinued) as a way of creating high-res images, so reading this article really hit home for me!While the output of this algorithm (just like my traces) isn't as faithful to the source material as, say, H.264 is, the result looks great and has an amazing style.This might be a great target for mobile-first webtoons.

villgax将近 5 年前

Even their job postings on LinkedIn look like somebody anger-wrote it. For an Image Processing role, they've written no ML/DL engineers which is obvious but still, as if Computer Vision isn't any way linked to whatever it is they are trying to do with compression.

fredley将近 5 年前

I actually love the slightly 'off' aesthetic on the simpsons video, and I think there's some interesting creative space where this algorithm is deliberately de-tuned for interesting results.

okaleniuk将近 5 年前

It's awesome, but it also brings back memories of the flash-animation.

评论 #23896408 未加载

zcw100将近 5 年前

Sounds like <a href="https://github.com/fogleman/primitive" rel="nofollow">https://github.com/fogleman/primitive</a> but for video

sneak将近 5 年前

Patenting algorithms is supposed to be impossible.Companies that ignore this and patent the “system and method” for implementing algorithms are being jerks.

评论 #23896547 未加载

rhn_mk1将近 5 年前

Maybe this finally brings back the crispy scaling quality of Flash videos. They seem to be released in lossy raster formats these days.

tabtab将近 5 年前

If the vectors also relatively smoothly morph in time, then monitor-side interpolation (including motion smoothing) wouldn't be needed and directors would have more control over how interpolation is done. They've complained about monitors trying to do too much. It seems pixels are becoming obsolete for video and movies.

karteum将近 5 年前

In terms of stream format, it seems that BIFS / MPEG 4 part 11 originaly aimed at the same purpose (probably in a more efficient manner than textual SVG), isn't it ? <a href="https://en.wikipedia.org/wiki/MPEG-4_Part_11" rel="nofollow">https://en.wikipedia.org/wiki/MPEG-4_Part_11</a>

chmod775将近 5 年前

While this is pretty cool, this naive approach will fail spectacularly for animation that isn't just vector graphics, which is most animation.This might have a future as part of a regular video codec, being used when there's mostly vector graphics on screen (or just for those areas that are vector graphics).

bane将近 5 年前

I imagine is a posterization preprocessing step would make this simpler and we could have very low bandwidth "video". If this could be done in real time, it would dramatically lower the bandwidth required for two-way video chat.

xanthine将近 5 年前

Their android SDK (no release version yet) is available at their Github repo, and so are their bulk upload tools (for talking to their servers using, I guess, a pay to use API).

nautical将近 5 年前

If I understand this correctly, will it also allow to send updates to video on the fly ? Example : Change video from X to Y time to new vectors [V1 ..] ?

villgax将近 5 年前

Not ready yet, just hyping SVG videos for now

vslira将近 5 年前

That's really interesting, excited to follow the project and see more!

jcims将近 5 年前

Curious how ML would train on vectorized video instead of rasterized.

imtringued将近 5 年前

Can this technique be used to generate a CAD sketch from a photograph?

评论 #23896606 未加载

iworkfromhome将近 5 年前

This is really cool. So the future video inspiration that will be built using only the code, without shooting again. Because everything can only be made with code. Cool!

33 条评论

dharma1将近 5 年前

评论 #23900199 未加载

评论 #23901478 未加载

teddyh将近 5 年前

评论 #23895894 未加载

评论 #23895753 未加载

评论 #23896040 未加载

评论 #23895781 未加载

emmanueloga_将近 5 年前

It just boggles my mind that the front page of a page that offers a "patented vector-transcoder converts video to a vector format, reducing bitrates" doesn't include such video on its front page.

评论 #23895807 未加载

评论 #23896324 未加载

black_puppydog将近 5 年前

评论 #23897107 未加载

gardaani将近 5 年前

评论 #23900148 未加载

bkm将近 5 年前

评论 #23896126 未加载

评论 #23895834 未加载

评论 #23895854 未加载

londons_explore将近 5 年前

评论 #23897148 未加载

评论 #23896338 未加载

评论 #23895811 未加载

评论 #23896023 未加载

评论 #23902506 未加载

评论 #23895547 未加载

评论 #23895525 未加载

评论 #23895794 未加载

评论 #23895616 未加载

royjacobs将近 5 年前

acd将近 5 年前

评论 #23895867 未加载

laserpistus将近 5 年前

评论 #23902424 未加载

vhiremath4将近 5 年前

评论 #23895600 未加载

评论 #23897416 未加载

评论 #23895982 未加载

gruez将近 5 年前

评论 #23898121 未加载

评论 #23899539 未加载

评论 #23898084 未加载

fefe23将近 5 年前

评论 #23899895 未加载

cphoover将近 5 年前

Why use computer vision when likely most of these animations come from some software that can/could output a vectorized video format?You wouldn't have any conversion artifacts that way.

评论 #23897960 未加载

katmannthree将近 5 年前

评论 #23895598 未加载

评论 #23897277 未加载

评论 #23896244 未加载

评论 #23898178 未加载

wodenokoto将近 5 年前

villgax将近 5 年前

fredley将近 5 年前

I actually love the slightly 'off' aesthetic on the simpsons video, and I think there's some interesting creative space where this algorithm is deliberately de-tuned for interesting results.

okaleniuk将近 5 年前

It's awesome, but it also brings back memories of the flash-animation.

评论 #23896408 未加载

zcw100将近 5 年前

Sounds like <a href="https://github.com/fogleman/primitive" rel="nofollow">https://github.com/fogleman/primitive</a> but for video

sneak将近 5 年前

Patenting algorithms is supposed to be impossible.Companies that ignore this and patent the “system and method” for implementing algorithms are being jerks.

评论 #23896547 未加载

rhn_mk1将近 5 年前

Maybe this finally brings back the crispy scaling quality of Flash videos. They seem to be released in lossy raster formats these days.

tabtab将近 5 年前

karteum将近 5 年前

chmod775将近 5 年前

bane将近 5 年前

xanthine将近 5 年前

Their android SDK (no release version yet) is available at their Github repo, and so are their bulk upload tools (for talking to their servers using, I guess, a pay to use API).

nautical将近 5 年前

If I understand this correctly, will it also allow to send updates to video on the fly ? Example : Change video from X to Y time to new vectors [V1 ..] ?

villgax将近 5 年前

Not ready yet, just hyping SVG videos for now

vslira将近 5 年前

That's really interesting, excited to follow the project and see more!

jcims将近 5 年前

Curious how ML would train on vectorized video instead of rasterized.

imtringued将近 5 年前

Can this technique be used to generate a CAD sketch from a photograph?

评论 #23896606 未加载

iworkfromhome将近 5 年前

This is really cool. So the future video inspiration that will be built using only the code, without shooting again. Because everything can only be made with code. Cool!