TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Audapolis: Edit audio files by transcript, not waveform

291 点作者 mavsman10 个月前

20 条评论

vunderba10 个月前
I remember when Adobe demoed this idea of being able to edit waveforms by the recognized text back in 2016 and it was pretty mind blowing for the time.<p><a href="https:&#x2F;&#x2F;youtu.be&#x2F;I3l4XLZ59iw" rel="nofollow">https:&#x2F;&#x2F;youtu.be&#x2F;I3l4XLZ59iw</a><p><i>EDIT: I could also definitely see Audapolis being useful if you could integrate it into a podcast&#x27;s post processing flow (volume normalization, de-essing) by recognizing certain verbal tics and automatically removing them from the audio such as &quot;ummmm...&quot;, etc.</i>
评论 #41047318 未加载
评论 #41038654 未加载
bluelightning2k10 个月前
A genuinely free alternative to Descript sounds very useful.<p>I&#x27;ve always liked the idea of Descript and was considering building something similar before it came out. The problem is my use case is a couple of videos a year so doesn&#x27;t fit with an expensive monthly subscription
评论 #41061336 未加载
hammeiam10 个月前
I&#x27;ve spent some of my free time over the past couple of months working on something similar. It&#x27;s in a decent state but I need help from somebody who understands the .fcpxml format so you can export your edits to Davinci and FCP.<p>Take a look at <a href="https:&#x2F;&#x2F;matcha.video" rel="nofollow">https:&#x2F;&#x2F;matcha.video</a>
评论 #41061213 未加载
评论 #41049122 未加载
petarb10 个月前
This is awesome to see as an open source project.<p>This functionality is some of my favorite when editing videos in Descript. It’s so much easier than chopping up waveforms in Audacity
corn13read210 个月前
This is pretty dated and doesn&#x27;t support whisper which is the de-facto speech recognition model currently
评论 #41049052 未加载
raymond_goo10 个月前
Demo Video: <a href="https:&#x2F;&#x2F;pajowu.de&#x2F;audapolis_intro.mp4" rel="nofollow">https:&#x2F;&#x2F;pajowu.de&#x2F;audapolis_intro.mp4</a>
Machado11710 个月前
The other day I was using the voice memos app on iOS 18 and was surprised to find that it also supports editing the recording by transcript
评论 #41045066 未加载
alsetmusic10 个月前
One of the hosts of a podcast that I listen to has had positive things to say about DeScript.[0] Just mentioning it because he&#x27;s been talking about it for a few years so I expect its had a good amount of feature development over time.<p>[0] descript.com&#x2F;
评论 #41037865 未加载
pryelluw10 个月前
If the maintainer is reading, having a demo video would be nice.
评论 #41044310 未加载
leetrout10 个月前
Hindenburg also added this capability.<p>&gt; Hindenburg’s manuscript feature gives you a complete overview of your audio. You can select the text just as you would in a text document and watch as your edits are made in real-time. If you need to export your text in a specific format, no problem. Hindenburg supports the most common text and transcription export formats.<p><a href="https:&#x2F;&#x2F;hindenburg.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;hindenburg.com&#x2F;</a>
emadda10 个月前
Nice, are there plans to notarize the mac app?<p>I built something similar here: <a href="https:&#x2F;&#x2F;bigwav.app" rel="nofollow">https:&#x2F;&#x2F;bigwav.app</a>
评论 #41050171 未加载
geekodour10 个月前
this looks great! will try out. I built a similar but very scrappy tool for the same usecase last year, I&#x27;d probably not build it if i found this.<p>[0] <a href="https:&#x2F;&#x2F;github.com&#x2F;geekodour&#x2F;wscribe-editor">https:&#x2F;&#x2F;github.com&#x2F;geekodour&#x2F;wscribe-editor</a>
jdprgm10 个月前
This really needs a video demo or at least a more in depth text description of the features. Will download later to try but curious does this just do simple hard cuts on audio text or is there any ai magic for blending sentence timing if that makes sense?<p>A number of comments turned me onto Descript -- made a similar comment on another audio thread recently: drives me absolutely insane how all audio tools with any AI are web based monthly saas instead of offline private gpu upfront purchase.
评论 #41040078 未加载
generalizations10 个月前
Combine this with the tech to generate new audio matching the speaker&#x27;s voice profile, and you&#x27;ve really got something cool.
评论 #41044338 未加载
jiehong10 个月前
That’s awesome!<p>Is 1 emoji for each commit title a new trend?
评论 #41037024 未加载
评论 #41037044 未加载
j4510 个月前
This is exciting to see - it seems the last release of was a year ago.<p>Can anyone clarify if this project is active?
StarterPro10 个月前
Call me a jerk, but anyone who is editing audio seriously, probably wants the waveform, no?
评论 #41047772 未加载
frakkingcylons10 个月前
Somewhat off-topic: I saw the funding note at the bottom - it’s pretty cool that the German government is giving some funding to projects like this. I wonder how much the US is doing in that regard, like if there’s a list of projects that tax dollars goes towards.
评论 #41040511 未加载
iainctduncan10 个月前
IMHO you should really change the headline on this. I&#x27;m an audio person, and my first thought was &quot;that&#x27;s stupid, words are awful at describing sound&quot;. But then I looked, and editing <i>transcriptions of voice recordings</i> by word is actually a great idea. That was not the impression the headline gave me, FWIW!
评论 #41037741 未加载
评论 #41038254 未加载
评论 #41038367 未加载
MForster10 个月前
And here I was expecting that I could edit the text and the app would change the audio file to say what I had typed...
评论 #41039616 未加载