TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Updates to Cloud Speech-to-Text and general availability of Cloud Text-to-Speech

76 pointsby rayshanover 6 years ago

9 comments

danShumwayover 6 years ago
Google&#x27;s speech-to-text is powerful, but I&#x27;d be pretty skeptical about tying a project to it given how services like Maps have been handled recently. There are companies like Mozilla trying to build more open solutions, but to the best of my knowledge (please correct me if I&#x27;m wrong) any pre-trained services Mozilla offers will also still involve you connecting to their servers.<p>Maybe I&#x27;m just paranoid, but I just can&#x27;t imagine using a speech-to-text system for anything serious that I can&#x27;t self-host. It feels like we&#x27;ve just seen example and example over and over again why this is a bad idea -- to the point that when I hear a company like Google talk about a locked-down cloud platform as &quot;making AI accessible to everyone&quot; it feels almost dishonest.<p>Especially once we start talking about text-to-speech. We can already do a lot of that locally - we should be pretty hesitant about coupling new text-to-speech techniques to strategies that require us to move logic away from local devices onto the cloud.
评论 #17883076 未加载
评论 #17884837 未加载
评论 #17884046 未加载
oulipoover 6 years ago
If you want to build open-source, 100% on-device and private-by-design Voice assistants which can run on a Raspberry Pi, you can take a look at what we are building at <a href="https:&#x2F;&#x2F;snips.ai" rel="nofollow">https:&#x2F;&#x2F;snips.ai</a> (disclaimer: I&#x27;m a co-founder)<p>We want to make it possible to have embedded assistants in all your objects which preserve people privacy, and do this with open-source: <a href="https:&#x2F;&#x2F;medium.com&#x2F;snips-ai&#x2F;an-introduction-to-snips-nlu-the-open-source-library-behind-snips-embedded-voice-platform-b12b1a60a41a" rel="nofollow">https:&#x2F;&#x2F;medium.com&#x2F;snips-ai&#x2F;an-introduction-to-snips-nlu-the...</a><p>Take a look at our blog to get started in 1h: <a href="https:&#x2F;&#x2F;medium.com&#x2F;snips-ai&#x2F;voice-controlled-lights-with-a-raspberry-pi-and-snips-822e53d7ede6" rel="nofollow">https:&#x2F;&#x2F;medium.com&#x2F;snips-ai&#x2F;voice-controlled-lights-with-a-r...</a><p>It also binds in popular Home automation platforms like Home Assistant and the Jeedom platform
评论 #17883740 未加载
评论 #17883668 未加载
评论 #17883611 未加载
zawerfover 6 years ago
Anyone know how this relates to the Web Speech API[1]?<p>Will they ship it with chrome to replace the existing speech synthesis api? (I believe right now it just uses whatever voices are available to the device or OS but chrome can fallback to a serverside voice)<p>[1] <a href="https:&#x2F;&#x2F;developer.mozilla.org&#x2F;en-US&#x2F;docs&#x2F;Web&#x2F;API&#x2F;Web_Speech_API" rel="nofollow">https:&#x2F;&#x2F;developer.mozilla.org&#x2F;en-US&#x2F;docs&#x2F;Web&#x2F;API&#x2F;Web_Speech_...</a><p>[2] <a href="https:&#x2F;&#x2F;developer.mozilla.org&#x2F;en-US&#x2F;docs&#x2F;Web&#x2F;API&#x2F;SpeechSynthesis" rel="nofollow">https:&#x2F;&#x2F;developer.mozilla.org&#x2F;en-US&#x2F;docs&#x2F;Web&#x2F;API&#x2F;SpeechSynth...</a>
评论 #17884237 未加载
andrewstuartover 6 years ago
On my machine the demo page doesn&#x27;t work at <a href="https:&#x2F;&#x2F;cloud.google.com&#x2F;text-to-speech&#x2F;" rel="nofollow">https:&#x2F;&#x2F;cloud.google.com&#x2F;text-to-speech&#x2F;</a><p>I tried to get Google to fix this a long time ago and it seemed to work for a while after being offline for weeks.
评论 #17883026 未加载
评论 #17882927 未加载
TheChaplainover 6 years ago
Anyone knows how this compare to Dragon NaturallySpeaking?
评论 #17884244 未加载
pastaover 6 years ago
A friend is working for a newspaper. He records interviews.<p>We tried all the software we could find to turn the recording (Dutch) into text but there is nothing that gives a helpful result.<p>I know that a recording-to-text is different than speech-to-text but even when I use OK Google most of the time the results are horrible.<p>So after all those years I am still a little skeptical.
评论 #17884055 未加载
评论 #17884358 未加载
ezoeover 6 years ago
I really want a free software implementation of Text-To-Speech and Speech-To-Text that runs on local computer without network.<p>I don&#x27;t trust those cloud-based solutions.
joshmnover 6 years ago
Speech-to-text is great. I&#x27;m using it to transcribe voicemails in a product I&#x27;m building.
_wmdover 6 years ago
I&#x27;d love to read what this page has to say, but somehow it managed to load with some click-grabbing Gawker-type theme? Half expecting a &quot;100 Surprising Cloud Facts, And Number 12 Will Shock You&quot; link to appear in that inexcusable waste of space along the bottom. <a href="https:&#x2F;&#x2F;i.imgur.com&#x2F;Uk1udNo.jpg" rel="nofollow">https:&#x2F;&#x2F;i.imgur.com&#x2F;Uk1udNo.jpg</a>