Our team is filled with technologists and creators, and when we record and edit videos, 80% of the time is spent chopping up the video, removing silences, and picking the right takes. So we decided to build a tool that did that for you — or at least get you there most of the way!<p>Our initial implementation is somewhat naïve and uses a user configurable silence threshold that just reads in volume levels. In the future, we’d like to use a frequency-based approach that focuses on the human voice. We’re also open to ideas, so let us know if you have any!
I watched the demonstration videos on the landing page, and the effect wasn't as bad as I often see on some youtube videos, but I think that's because the subject
was sitting at a desk, rather than standing, moving around much more.<p>Anecdata - heavily jump-cut edits on youtube instantly inspire me to find an alternative source of the same information, as the breathless, rapid-fire, jerky-video, sensation is irksome. Evidently I'm in the minority, and I'm okay with that.
Very nice! As someone who wrote a native tool to do pretty much this (Recut / getrecut.com) it's super impressive to see it done in a web browser. The editor feels very fast and fluid.<p>Doing it natively was hard enough, and recently I've been rewriting Recut with Rust + Electron so I have an idea of how much work it was to get it working well :sweat-smile: Keep up the good work y'all!
This is basically an attention killing machine and I'm not talking about the method, the method is OK and not novel. I have found that these time cuts completely ruin my ability to concentrate and after it happens 2-3 times in a short period, I lose interest in the video because it is so annoying.
Honestly the constant jump cut style of editing feels so unnatural to me... I'd rather watch someone takes pauses and not be taken out of the moment.... I'm sure this has applications particularly in advertising/marketing but it's not without outs issues.
Reminds me of the tool presented in [1], which also shows some interesting applications. Apparently an improved version is now being sold on its own platform [2], but the original Python script is still available on Github [3].<p>[1] <a href="https://www.youtube.com/watch?v=DQ8orIurGxw" rel="nofollow">https://www.youtube.com/watch?v=DQ8orIurGxw</a><p>[2] <a href="https://jumpcutter.com" rel="nofollow">https://jumpcutter.com</a><p>[3] <a href="https://github.com/carykh/jumpcutter" rel="nofollow">https://github.com/carykh/jumpcutter</a>
Whoa. For years I've been seeing YouTube videos that seem to just teleport choppily around what I assumed were silences. I always assumed there was a standard tool that everyone used to do this. I can do the equivalent thing to a podcast episode in like 5 seconds in Logic. The notion that people have been doing this by hand is staggering, but kudos to you for finally coming along and filling this niche.
I get the use case. Not sure there's enough value here as a cloud-based SaaS product. I use a similar product called Recut (<a href="https://getrecut.com/" rel="nofollow">https://getrecut.com/</a>). $99 one-time fee (no subscription). Easy to open the edited project in NLEs. macOS only. Unclear that Kapwing is a better option when Recut would be breakeven at seven months and you can don't need to round trip videos in the cloud.
I don't know if any other apps do this but there are plenty of Japanese youtube channels that do this<p>This one in particular is the one where I was introduced to the style but it certainly wasn't the first to do this.<p><a href="https://www.youtube.com/c/%E6%9C%89%E9%9A%A3%E5%A0%82%E3%81%97%E3%81%8B%E7%9F%A5%E3%82%89%E3%81%AA%E3%81%84%E4%B8%96%E7%95%8C" rel="nofollow">https://www.youtube.com/c/%E6%9C%89%E9%9A%A3%E5%A0%82%E3%81%...</a><p>To be clear, they aren't just cutting out silence, they are cutting them to the point that it's a style and the speech pattern, particularly of the Owl character, sounds unnatural, which I'm assuming is what they were going for. It's that style of nearly cutting phrases together faster than they would be naturally spoken is the thing I'm saying is a trend in some Japanese videos.<p>Note: The channel itself is run by a stationary store chain from Tokyo. The funny part is the Owl character often makes fun of what they're showing as in "why would anyone buy this?" or "That's way too expensive" which is funny for a channel run by a store selling most of things they're showing off.
Congrats on the launch Shah! I can tell you stayed up late giddy for this launch :D. As another peer building for video creators, I am delighted to see more efficiency features like this released.<p>This approach was the one I tried first also (I also tried the frequency one fwiw, which has its own, worse drawbacks). But using loudness runs into issues if the source loudness isn't (relatively) even across the entire source media. Using a single sensitivity setting like this would be a problem if:<p>* recording gain is set to automatic, and there are sudden changes in noise floor like wind (if recorded in 24-bit or lower)<p>* crew adjusts gain partway through recording (big no-no but happens)<p>* talent/host moves in and out of microphone sweet spot<p>* talent/host adjusts themselves in a squeaky chair during silence or transition-to-silence (or coughs, or breaths loudly, or ambulance goes by...)<p>If you apply the edit w/ a single sensitivity and something like the above is true, it would cut in the wrong place. Unfortunately, you would have to watch the entire show, skipping to boundaries with your full attention to know that ever got a cut wrong.
If it's the type of video where cutting silence automatically is viable, I wish they would just write out the transcript.<p>You can read what they're saying in a fraction of the time it takes to watch a video, and often internalize it better.
I use final cut to edit videos. Two things I haven't figured out how to do is to automatically that I would love an automatic solution to:<p>1. Cross fade the audio between clips, without crossfading the video, if the sound is available past the cut point. I think its pretty much 90% of the time that I would want something like that.<p>2. I have clips from a gopro that had a selfie stick that made a loud clicky sound from rattling that peaks above regular audio. Some way to lower the volume on that clicky noise without having to manually go through and do it for each click.
I believe there is (or was) a product used by TV stations which would semi-automatically re-edit a movie to fit in a specific amount of time, mostly by removing the start of end of a cut where not much was happening on the audio or video. Assuming I'm not completely imagining that, anyone know what it's called?
I'm glad there you guys made this option available in the browser. For those that don't know, YouTubers can spend more than 50% of their time editing on just cutting/trimming down silences!<p>Source: I am a YouTuber.<p>I even tried to tackle this problem in 2021 by building Atomic Edits[0] - an electron desktop app that did the same thing! I built a working prototype, but eventually stopped working on it after realizing that lots of web-based video editors started offering this functionality. In retrospect, it was obvious that doing this in the browser with cloud saves was way better.<p>Anyways, nice work! I'll check it out later.<p>[0] <a href="https://www.github.com/SuboptimalEng/atomic-edits" rel="nofollow">https://www.github.com/SuboptimalEng/atomic-edits</a>
Instead of cutting out the silence spots, why not speed them up? If the presenter is silent b/c they are drawing something on the board (like in a lecture), then the result will feel a choppy.
On the about page you mentioned that you raised instead of continuing on the bootstrapped path. What made you pull the trigger?<p>My first impression when I checked out the landing page and product was: this is a perfect bootstrapped SaaS product idea to grow to meaningful ARR with a small team of 2-3 + some hired help for things like customer support.
The tool seems useful, but listening to the edited audio gives a suffocating feeling, because the speaker makes no pauses to breathe. Absolutely terrible sensation, exactly like I had earlier with commercial news radio stations that were packed with advertizing and announces.
This service seems very similar to another product I saw here months ago on Show HN called SavvyCut (<a href="https://www.savvycut.com/" rel="nofollow">https://www.savvycut.com/</a>).<p>Can you comment on differences?
I use ffmpeg for this. It has some silence removal options described in the man page. I've never gotten it to work really well, but for the stuff I do, it helps.