Show HN: Edit videos faster by automatically removing silences

171 pointsby shahahmedover 3 years ago

Our team is filled with technologists and creators, and when we record and edit videos, 80% of the time is spent chopping up the video, removing silences, and picking the right takes. So we decided to build a tool that did that for you — or at least get you there most of the way!Our initial implementation is somewhat naïve and uses a user configurable silence threshold that just reads in volume levels. In the future, we’d like to use a frequency-based approach that focuses on the human voice. We’re also open to ideas, so let us know if you have any!

25 comments

Jeddover 3 years ago

I watched the demonstration videos on the landing page, and the effect wasn't as bad as I often see on some youtube videos, but I think that's because the subject was sitting at a desk, rather than standing, moving around much more.Anecdata - heavily jump-cut edits on youtube instantly inspire me to find an alternative source of the same information, as the breathless, rapid-fire, jerky-video, sensation is irksome. Evidently I'm in the minority, and I'm okay with that.

评论 #30200356 未加载

评论 #30200405 未加载

dceddiaover 3 years ago

Very nice! As someone who wrote a native tool to do pretty much this (Recut / getrecut.com) it's super impressive to see it done in a web browser. The editor feels very fast and fluid.Doing it natively was hard enough, and recently I've been rewriting Recut with Rust + Electron so I have an idea of how much work it was to get it working well :sweat-smile: Keep up the good work y'all!

评论 #30203028 未加载

评论 #30202080 未加载

评论 #30201019 未加载

betimslover 3 years ago

This is basically an attention killing machine and I'm not talking about the method, the method is OK and not novel. I have found that these time cuts completely ruin my ability to concentrate and after it happens 2-3 times in a short period, I lose interest in the video because it is so annoying.

评论 #30206051 未加载

评论 #30205078 未加载

engineerthrwawyover 3 years ago

Honestly the constant jump cut style of editing feels so unnatural to me... I'd rather watch someone takes pauses and not be taken out of the moment.... I'm sure this has applications particularly in advertising/marketing but it's not without outs issues.

评论 #30204192 未加载

mimimi31over 3 years ago

Reminds me of the tool presented in [1], which also shows some interesting applications. Apparently an improved version is now being sold on its own platform [2], but the original Python script is still available on Github [3].[1] <a href="https://www.youtube.com/watch?v=DQ8orIurGxw" rel="nofollow">https://www.youtube.com/watch?v=DQ8orIurGxw</a>[2] <a href="https://jumpcutter.com" rel="nofollow">https://jumpcutter.com</a>[3] <a href="https://github.com/carykh/jumpcutter" rel="nofollow">https://github.com/carykh/jumpcutter</a>

评论 #30201291 未加载

powrtochover 3 years ago

Whoa. For years I've been seeing YouTube videos that seem to just teleport choppily around what I assumed were silences. I always assumed there was a standard tool that everyone used to do this. I can do the equivalent thing to a podcast episode in like 5 seconds in Logic. The notion that people have been doing this by hand is staggering, but kudos to you for finally coming along and filling this niche.

评论 #30200914 未加载

评论 #30199845 未加载

评论 #30200002 未加载

评论 #30199997 未加载

laseanover 3 years ago

I get the use case. Not sure there's enough value here as a cloud-based SaaS product. I use a similar product called Recut (<a href="https://getrecut.com/" rel="nofollow">https://getrecut.com/</a>). $99 one-time fee (no subscription). Easy to open the edited project in NLEs. macOS only. Unclear that Kapwing is a better option when Recut would be breakeven at seven months and you can don't need to round trip videos in the cloud.

评论 #30200701 未加载

评论 #30200450 未加载

评论 #30204576 未加载

greggman3over 3 years ago

I don't know if any other apps do this but there are plenty of Japanese youtube channels that do thisThis one in particular is the one where I was introduced to the style but it certainly wasn't the first to do this.<a href="https://www.youtube.com/c/%E6%9C%89%E9%9A%A3%E5%A0%82%E3%81%97%E3%81%8B%E7%9F%A5%E3%82%89%E3%81%AA%E3%81%84%E4%B8%96%E7%95%8C" rel="nofollow">https://www.youtube.com/c/%E6%9C%89%E9%9A%A3%E5%A0%82%E3%81%...</a>To be clear, they aren't just cutting out silence, they are cutting them to the point that it's a style and the speech pattern, particularly of the Owl character, sounds unnatural, which I'm assuming is what they were going for. It's that style of nearly cutting phrases together faster than they would be naturally spoken is the thing I'm saying is a trend in some Japanese videos.Note: The channel itself is run by a stationary store chain from Tokyo. The funny part is the Owl character often makes fun of what they're showing as in "why would anyone buy this?" or "That's way too expensive" which is funny for a channel run by a store selling most of things they're showing off.

评论 #30201598 未加载

thegoleffectover 3 years ago

Congrats on the launch Shah! I can tell you stayed up late giddy for this launch :D. As another peer building for video creators, I am delighted to see more efficiency features like this released.This approach was the one I tried first also (I also tried the frequency one fwiw, which has its own, worse drawbacks). But using loudness runs into issues if the source loudness isn't (relatively) even across the entire source media. Using a single sensitivity setting like this would be a problem if:* recording gain is set to automatic, and there are sudden changes in noise floor like wind (if recorded in 24-bit or lower)* crew adjusts gain partway through recording (big no-no but happens)* talent/host moves in and out of microphone sweet spot* talent/host adjusts themselves in a squeaky chair during silence or transition-to-silence (or coughs, or breaths loudly, or ambulance goes by...)If you apply the edit w/ a single sensitivity and something like the above is true, it would cut in the wrong place. Unfortunately, you would have to watch the entire show, skipping to boundaries with your full attention to know that ever got a cut wrong.

评论 #30202063 未加载

评论 #30201707 未加载

shanecoughlanover 3 years ago

Weird. I edit a lot of videos but this is basically not something I have ever needed. The natural cadence of speakers is more valuable to me.

评论 #30202114 未加载

评论 #30201468 未加载

fragsworthover 3 years ago

If it's the type of video where cutting silence automatically is viable, I wish they would just write out the transcript.You can read what they're saying in a fraction of the time it takes to watch a video, and often internalize it better.

评论 #30201300 未加载

dukeofdoomover 3 years ago

I use final cut to edit videos. Two things I haven't figured out how to do is to automatically that I would love an automatic solution to:1. Cross fade the audio between clips, without crossfading the video, if the sound is available past the cut point. I think its pretty much 90% of the time that I would want something like that.2. I have clips from a gopro that had a selfie stick that made a loud clicky sound from rattling that peaks above regular audio. Some way to lower the volume on that clicky noise without having to manually go through and do it for each click.

评论 #30202734 未加载

me_againover 3 years ago

I believe there is (or was) a product used by TV stations which would semi-automatically re-edit a movie to fit in a specific amount of time, mostly by removing the start of end of a cut where not much was happening on the audio or video. Assuming I'm not completely imagining that, anyone know what it's called?

SuboptimalEngover 3 years ago

I'm glad there you guys made this option available in the browser. For those that don't know, YouTubers can spend more than 50% of their time editing on just cutting/trimming down silences!Source: I am a YouTuber.I even tried to tackle this problem in 2021 by building Atomic Edits[0] - an electron desktop app that did the same thing! I built a working prototype, but eventually stopped working on it after realizing that lots of web-based video editors started offering this functionality. In retrospect, it was obvious that doing this in the browser with cloud saves was way better.Anyways, nice work! I'll check it out later.[0] <a href="https://www.github.com/SuboptimalEng/atomic-edits" rel="nofollow">https://www.github.com/SuboptimalEng/atomic-edits</a>

itakeover 3 years ago

Instead of cutting out the silence spots, why not speed them up? If the presenter is silent b/c they are drawing something on the board (like in a lecture), then the result will feel a choppy.

评论 #30200570 未加载

评论 #30206091 未加载

评论 #30198901 未加载

unfocussed_mikeover 3 years ago

Somewhere in the afterlife, Harold Pinter is cursing.

评论 #30200162 未加载

ricklamersover 3 years ago

On the about page you mentioned that you raised instead of continuing on the bootstrapped path. What made you pull the trigger?My first impression when I checked out the landing page and product was: this is a perfect bootstrapped SaaS product idea to grow to meaningful ARR with a small team of 2-3 + some hired help for things like customer support.

评论 #30200804 未加载

culebron21over 3 years ago

The tool seems useful, but listening to the edited audio gives a suffocating feeling, because the speaker makes no pauses to breathe. Absolutely terrible sensation, exactly like I had earlier with commercial news radio stations that were packed with advertizing and announces.

aphitover 3 years ago

This service seems very similar to another product I saw here months ago on Show HN called SavvyCut (<a href="https://www.savvycut.com/" rel="nofollow">https://www.savvycut.com/</a>).Can you comment on differences?

评论 #30228131 未加载

phren0logyover 3 years ago

How would you compare your offering Descript? The pricing appears similar.

评论 #30200370 未加载

评论 #30200125 未加载

throwaway81523over 3 years ago

I use ffmpeg for this. It has some silence removal options described in the man page. I've never gotten it to work really well, but for the stuff I do, it helps.

jenthovenover 3 years ago

Like this a lot, will save time on rough cutting. Did you use a library for volume detection?

评论 #30194896 未加载

rondrabkinover 3 years ago

love the creativity in finding new use cases for AI that ...really make sense.

pishpashover 3 years ago

Isn't this a basic function in Audacity (for audio)?

评论 #30202729 未加载

评论 #30201906 未加载

Cypherover 3 years ago

I already have a free plugin that does this :(

评论 #30228098 未加载