Audapolis: Edit audio files by transcript, not waveform

291 点作者 mavsman10 个月前

20 条评论

vunderba10 个月前

I remember when Adobe demoed this idea of being able to edit waveforms by the recognized text back in 2016 and it was pretty mind blowing for the time.<a href="https://youtu.be/I3l4XLZ59iw" rel="nofollow">https://youtu.be/I3l4XLZ59iw</a>EDIT: I could also definitely see Audapolis being useful if you could integrate it into a podcast's post processing flow (volume normalization, de-essing) by recognizing certain verbal tics and automatically removing them from the audio such as "ummmm...", etc.

评论 #41047318 未加载

评论 #41038654 未加载

bluelightning2k10 个月前

A genuinely free alternative to Descript sounds very useful.I've always liked the idea of Descript and was considering building something similar before it came out. The problem is my use case is a couple of videos a year so doesn't fit with an expensive monthly subscription

评论 #41061336 未加载

hammeiam10 个月前

I've spent some of my free time over the past couple of months working on something similar. It's in a decent state but I need help from somebody who understands the .fcpxml format so you can export your edits to Davinci and FCP.Take a look at <a href="https://matcha.video" rel="nofollow">https://matcha.video</a>

评论 #41061213 未加载

评论 #41049122 未加载

petarb10 个月前

This is awesome to see as an open source project.This functionality is some of my favorite when editing videos in Descript. It’s so much easier than chopping up waveforms in Audacity

corn13read210 个月前

This is pretty dated and doesn't support whisper which is the de-facto speech recognition model currently

评论 #41049052 未加载

raymond_goo10 个月前

Demo Video: <a href="https://pajowu.de/audapolis_intro.mp4" rel="nofollow">https://pajowu.de/audapolis_intro.mp4</a>

Machado11710 个月前

The other day I was using the voice memos app on iOS 18 and was surprised to find that it also supports editing the recording by transcript

评论 #41045066 未加载

alsetmusic10 个月前

One of the hosts of a podcast that I listen to has had positive things to say about DeScript.[0] Just mentioning it because he's been talking about it for a few years so I expect its had a good amount of feature development over time.[0] descript.com/

评论 #41037865 未加载

pryelluw10 个月前

If the maintainer is reading, having a demo video would be nice.

评论 #41044310 未加载

leetrout10 个月前

Hindenburg also added this capability.> Hindenburg’s manuscript feature gives you a complete overview of your audio. You can select the text just as you would in a text document and watch as your edits are made in real-time. If you need to export your text in a specific format, no problem. Hindenburg supports the most common text and transcription export formats.<a href="https://hindenburg.com/" rel="nofollow">https://hindenburg.com/</a>

emadda10 个月前

Nice, are there plans to notarize the mac app?I built something similar here: <a href="https://bigwav.app" rel="nofollow">https://bigwav.app</a>

评论 #41050171 未加载

geekodour10 个月前

this looks great! will try out. I built a similar but very scrappy tool for the same usecase last year, I'd probably not build it if i found this.[0] <a href="https://github.com/geekodour/wscribe-editor">https://github.com/geekodour/wscribe-editor</a>

jdprgm10 个月前

This really needs a video demo or at least a more in depth text description of the features. Will download later to try but curious does this just do simple hard cuts on audio text or is there any ai magic for blending sentence timing if that makes sense?A number of comments turned me onto Descript -- made a similar comment on another audio thread recently: drives me absolutely insane how all audio tools with any AI are web based monthly saas instead of offline private gpu upfront purchase.

评论 #41040078 未加载

generalizations10 个月前

Combine this with the tech to generate new audio matching the speaker's voice profile, and you've really got something cool.

评论 #41044338 未加载

jiehong10 个月前

That’s awesome!Is 1 emoji for each commit title a new trend?

评论 #41037024 未加载

评论 #41037044 未加载

j4510 个月前

This is exciting to see - it seems the last release of was a year ago.Can anyone clarify if this project is active?

StarterPro10 个月前

Call me a jerk, but anyone who is editing audio seriously, probably wants the waveform, no?

评论 #41047772 未加载

frakkingcylons10 个月前

Somewhat off-topic: I saw the funding note at the bottom - it’s pretty cool that the German government is giving some funding to projects like this. I wonder how much the US is doing in that regard, like if there’s a list of projects that tax dollars goes towards.

评论 #41040511 未加载

iainctduncan10 个月前

IMHO you should really change the headline on this. I'm an audio person, and my first thought was "that's stupid, words are awful at describing sound". But then I looked, and editing transcriptions of voice recordings by word is actually a great idea. That was not the impression the headline gave me, FWIW!

评论 #41037741 未加载

评论 #41038254 未加载

评论 #41038367 未加载

MForster10 个月前

And here I was expecting that I could edit the text and the app would change the audio file to say what I had typed...

评论 #41039616 未加载