TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Most accurate ML speech-to-text API?

2 点作者 lumens将近 5 年前
I&#x27;m building a project that relies on at least pretty-good transcription with timestamps for each word and ideally speaker diarization.<p>Right now I&#x27;m using Google Cloud&#x27;s Speech-to-Text, but the accuracy is underwhelming when transcribing a Zoom call (50%ish).<p>Am I likely to fare much better with Azure&#x2F;AWS? What about Symbl.ai?

2 条评论

taf2将近 5 年前
Which model are you using on the zoom calls? Also are you used enhanced or just default? There a lot of factors with any engine.
mdrabla将近 5 年前
While sometimes more expensive, I&#x27;ve found GCP the best option (from an accuracy standpoint) for STT diarization