TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Whisper Jax: Transcribe a 1 hour of audio in under 15 seconds

36 pointsby mysteryboxabout 2 years ago

4 comments

jax_user123about 2 years ago
Can Whisper-JAX also translate audio streams in real time from X -&gt; Y, both of which are non-English languages?<p>Additionally I have tried Whisper JAX on JupyterHub and for some reason it does not transcribe&#x2F;translate under 10 seconds for me. In fact, if an audio is 5 minutes long, it would still take 3-4 minutes for transcribing it, although I have followed the similar steps as the ones in the Kaggle notebook, posted by the Author themself. Any ideas&#x2F;suggestions why this would be happenind would be really helpful.<p>Thank you!
sebnunabout 2 years ago
I&#x27;ve been looking for faster implementations of Whisper, the main drawback with Whisper Jax is that the performance comes from running on Google TPUs, which are much more expensive than GPUs.<p>On &quot;normal&quot; GPUs the fastest implementation I&#x27;ve found is <a href="https:&#x2F;&#x2F;github.com&#x2F;guillaumekln&#x2F;faster-whisper">https:&#x2F;&#x2F;github.com&#x2F;guillaumekln&#x2F;faster-whisper</a>. Whisper.cpp works faster on a CPU, especially on Apple Silicon, but still nowhere near the performance you could get on a GPU (understandably).<p>How does Whisper Jax compares to faster-whisper on a GPU?
评论 #35666443 未加载
评论 #35668088 未加载
toomuchtodoabout 2 years ago
Could this be used on device to generate closed captions of video on the fly?
mysteryboxabout 2 years ago
Whisper JAX is an optimised implementation of the Whisper model by OpenAI. It runs on JAX with a TPU v4-8 in the backend. Compared to PyTorch on an A100 GPU, it is over 70x faster, making it the fastest Whisper API available.
评论 #35664111 未加载