Salsify – A New Architecture for Real-time Internet Video

568 pointsby jremmonsabout 7 years ago

23 comments

mgamacheabout 7 years ago

To me, the key innovation here is the tight integration between network conditions and codec frame size. Standard codecs are created with specific bandwidth requirements and they provide encoded frames that 'average' around that size. You could just re-initialize a codec at a lower bandwidth on the fly, but you would have to send an I frame (large full frame) to kickoff the new series of frames (as video most video frames are just updates of a previous frame). Having a codec accept a bandwidth target per frame is a really good idea.

评论 #16964915 未加载

评论 #16965148 未加载

评论 #16968322 未加载

rawrmaanabout 7 years ago

Aside from the fact that the tech is obviously cool, I think the FAQ section is really well-written. Props to the team.

评论 #16964515 未加载

noelwelshabout 7 years ago

I'm fairly sure that Ben Orenstein and friend are forming a company to commercialise this as a Screenhero replacement. Discussed on this podcast: <a href="http://artofproductpodcast.com/episode-39" rel="nofollow">http://artofproductpodcast.com/episode-39</a>Very interested to see what they cook up (and kinda envious I didn't have the idea / don't have the space in my life to have a crack at it myself---it sounds very interesting).

khalilravannaabout 7 years ago

I've been taking Financial Markets course by Robert Shiller and he continuously makes the point when talking about inventions and new ideas that "it's crazy to me that this didn't exist before". It's usually the sign of a really good invention when you have that thought. And that's the thought I'm having looking at this combining the codec and transport protocol together: "Why hasn't this been done before?" == "This is awesome!"

评论 #16968803 未加载

zParticleabout 7 years ago

A bigger frustration I experience is that some streaming seems to just "give up"; stalling and never resuming. I know the connection and server are okay because I can usually force it to resume manually, e.g. doing a page refresh, so is it just bad server architecture or a codec problem?

评论 #16969571 未加载

评论 #16966317 未加载

评论 #16965637 未加载

bsderabout 7 years ago

Um ... from the paper ..."6.1 Limitations of SalsifyNo audio. Salsify does not encode or transmit audio."Claiming that you beat a bunch of codecs that have synchronized audio (even though they disable it) is kind of misleading ...

评论 #16964779 未加载

andygcookabout 7 years ago

Is this at all related to the company Salsify? <a href="https://www.salsify.com" rel="nofollow">https://www.salsify.com</a>

评论 #16966829 未加载

mbestoabout 7 years ago

Slighty tangential...Salsify is led by Sadjad Fouladi, a doctoral student in computer science at Stanford University, along with fellow Stanford students John Emmons, Emre Orbay, and Riad S. Wahby, as well as Catherine Wu, a junior at Saratoga High School in Saratoga, California. The project is advised by Keith Winstein, an assistant professor of computer science.Salsify was funded the National Science Foundation and the Defense Advanced Research Projects Agency (DARPA). Salsify has also received support from Google, Huawei, VMware, Dropbox, Facebook, and the Stanford Platform Lab.Financially supported by the government, tech juggernauts, and executed by top tier doctoral students + a high school student + a top tier university professor.Assuming this could be game-changing innovation to further advance worldwide communication, it's refreshing to see the positive externalities of a combination of capitalistic (F500 tech co's) and socialistic (university, government) systems executed by a seemingly diverse set of actors.

评论 #16964628 未加载

quickthrower2about 7 years ago

Even Richard Hendricks didn't combine the codec and the transport. Genius.

评论 #16965173 未加载

cbhlabout 7 years ago

(Disclaimer: This comment is my personal opinion, not that of my employer.)Really exciting work.Encoding multiple versions of a video and picking a smaller one in response to congestion already happens for video-on-demand (think YouTube and Netflix videos) in DASH. That said, with VOD you can encode the video slower than real-time.I can't imagine this ever making it into Skype/FaceTime/Hangouts/Duo. The big corps will probably continue to focus on "more internet" (fiber optic, zero rating, wi-fi hotspots, and internet traffic management practices).

评论 #16966219 未加载

评论 #16965166 未加载

nodjaabout 7 years ago

The cost here seems to be bandwidth, is this because you're using VP8? Could this be adapted for other codecs like AV1?

评论 #16964560 未加载

baxtrabout 7 years ago

It's 2018 and I still have many dropped calls and other weird stuff when I talk with people on my mobile. FaceTime Audio is often a good alternative but still not perfect. So, I really hope the audio version of this will be commercialized soon.

lyinawakeabout 7 years ago

Unfortunately this would only apply to one-on-one low latency video chats. For streaming to an audience, which generally uses a distribution network between the user and the video source to help handle load and geographical distribution, the CDN itself has no influence on video encoding. The CDN would need to jump in and do this back-and-forth negotiation and delivery of lower quality frames, which it is not currently suited for. I'd love to see it come about, but it's not just the codecs we need to look at for adoption beyond point-to-point video calls.

评论 #16968640 未加载

cryptonectorabout 7 years ago

There's a vegetable named "salsify", very yummy. <a href="https://duckduckgo.com/?q=salsify+vegetable&t=ffab&ia=recipes" rel="nofollow">https://duckduckgo.com/?q=salsify+vegetable&t=ffab&ia=recipe...</a>

hexane360about 7 years ago

Barely related to this, but looking at the results (section 5.2) I'm amazed at how much worse T-Mobile is for latency. AT&T and Verizon both give about 2 s of delay for Hangouts, while T-Mobile gives 7 s of delay.

评论 #16965026 未加载

评论 #16965094 未加载

评论 #16964997 未加载

ameliusabout 7 years ago

> What would you say to tomorrow’s codec implementers?> Standardize an interface to export and import the encoder’s and decoder’s internal state between frames!Can't this be achieved using sandboxing/emulation/VM techniques?

评论 #16966606 未加载

dangabout 7 years ago

Another recent discussion was <a href="https://news.ycombinator.com/item?id=16802079" rel="nofollow">https://news.ycombinator.com/item?id=16802079</a>.

pishpashabout 7 years ago

Kudos for making things accessible. However, joint source-channel coding is not news, especially at the level of scalable video coding (probably 20-year-old research by this point). In academia this isn't as exciting as it sounds to industry.

评论 #16964770 未加载

评论 #16965099 未加载

评论 #16964783 未加载

kappiabout 7 years ago

this is not new. If you google video telephony adaptive rate adaptation techniques based on network conditions, you will find many(even from 1980s).

profalseidolabout 7 years ago

How does this compare to Pied Pipers algorithm?

jestar_jokinabout 7 years ago

"Is this a startup company?No.Are you sure? Your website looks like a startup company’s.It's just the HTML template! They all look like this. [...]"Brilliant

评论 #16964699 未加载

评论 #16964695 未加载

sercandabout 7 years ago

I saw this on Reddit before. Most not impressed since it doesn't support audio.

tarheeljasonabout 7 years ago

> Salsify is led by [...] Catherine Wu, a junior at Saratoga High School in Saratoga, California.Oh

评论 #16965003 未加载

评论 #16964668 未加载

评论 #16964801 未加载