TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Using ML to silence Lex Fridman voice on podcasts?

4 点作者 choletentent大约 3 年前
I&#x27;m a big fan of Lex Friedman. I&#x27;ve listen to most of his podcasts. But recently, he&#x27;s talking to much before thinking, without adding any value to the conversation or pushing his ideas that are, at least, very controversial. Nothing against him personally. But I know his opinions on things already. I don&#x27;t want listen him preaching them over and over.<p>I noticed that his comments add so little to the conversation, that if I could trim his voice out of the podcast, that would increase the quality of it.<p>I thought there would be some automated way of doing it using ML. I have some experience with CNN on images, but I&#x27;ve never dealt with audio before. Any recommendations?

2 条评论

apohn大约 3 年前
What about classifying who is speaking and just muting audio when Lex is speaking? First, extract a lot of samples (e.g 2-5 seconds of Audio) from a lot of his podcasts and label them as 1&#x2F;0 or Lex&#x2F;Other person speaking.<p>Take those samples and convert them to a frequency spectrum. For each sample, average (or use max, min, whatever) the values over the time sample. Take bins of values (e.g. 100hz, 120 hz, 140hz), and filter out all values outside of the human speaking range.<p>What you then have is a training set that is a set of features that are the amplitude of each frequency, and a target of 1 (Lex is speaking) or 0 (Somebody else is speaking).<p>Use your ML or Deep Learning Algo of choice to see if you can get useful results out of it.
alex14fr大约 3 年前
Lex Fridman is an AI expert, you should ask him.
评论 #30415995 未加载
评论 #30416194 未加载