TechEcho

6 comments

coryfkleinabout 7 years ago

This is the audio equivalent of the Face2Face algorithm that takes one person's face and places it onto the character in a video, matching the latter subject's expressions.This means we now live in a world where you can create a recording of Donald Trump saying, "I colluded with the Russians to rig the election," and not only have the voice sound like Trump but also bring along his personal expressive style so that it becomes indistinguishable from Trump himself.Would love to see these two combined - make an audio-video recording of an actor confessing to election fraud, then use Face2Face to swap in Trump's face and use Tacotron to swap in his voice.

评论 #16695397 未加载

modelessabout 7 years ago

Note that this is separate from the other front page post about Google Cloud TTS powered by WaveNet. That's a product, while this is exciting new research (which will hopefully become part of a product).

visargaabout 7 years ago

This technology has been around for a year but we only got a few samples. I'm very excited. I use TTS to read back all the text I consume on PC.This web demo allows you to enter your own text:<a href="https://cloud.google.com/text-to-speech/" rel="nofollow">https://cloud.google.com/text-to-speech/</a>(select US American and Wavenet)

评论 #16692331 未加载

polishTarabout 7 years ago

<a href="https://google.github.io/tacotron/publications/global_style_tokens/demos/gstwn/gstwn_vs_g_2.wav" rel="nofollow">https://google.github.io/tacotron/publications/global_style_...</a>Ha!

评论 #16694011 未加载

评论 #16693190 未加载

John_KZabout 7 years ago

Well, I guess it's time for authenticated phonecalls.

aaronharnlyabout 7 years ago

Congratulations Daisy! This work is really impressive (and quite fun).

6 comments

coryfkleinabout 7 years ago

评论 #16695397 未加载

modelessabout 7 years ago

visargaabout 7 years ago

评论 #16692331 未加载

polishTarabout 7 years ago

<a href="https://google.github.io/tacotron/publications/global_style_tokens/demos/gstwn/gstwn_vs_g_2.wav" rel="nofollow">https://google.github.io/tacotron/publications/global_style_...</a>Ha!

评论 #16694011 未加载

评论 #16693190 未加载

John_KZabout 7 years ago

Well, I guess it's time for authenticated phonecalls.

aaronharnlyabout 7 years ago

Congratulations Daisy! This work is really impressive (and quite fun).

Expressive Speech Synthesis with Tacotron

6 comments

Expressive Speech Synthesis with Tacotron

6 comments