TechEcho

12 comments

lxeabout 2 years ago

For "actually serverless" voice chat, check out <a href="https://whisper.ggerganov.com/" rel="nofollow">https://whisper.ggerganov.com/</a>

评论 #35705134 未加载

评论 #35702904 未加载

评论 #35704832 未加载

评论 #35703373 未加载

评论 #35702630 未加载

IronWolveabout 2 years ago

Been using textgen and downloading tons of models, the models are all over the place. The problems of accuracy and short term memory are major issues that people are trying to implement work arounds.Check out textgen, it has voice in/out, graphics in/out, memory plugin, api, plugins, etc, all running locally.<a href="https://github.com/oobabooga/text-generation-webui">https://github.com/oobabooga/text-generation-webui</a>

评论 #35703794 未加载

评论 #35703803 未加载

wongarsuabout 2 years ago

That's a pretty cool showcase of modal [1]. From a marketing perspective I have to congratulate, this is a really well done way to get people to check out your platform.1: <a href="https://modal.com/" rel="nofollow">https://modal.com/</a>

评论 #35702671 未加载

评论 #35702276 未加载

评论 #35702058 未加载

评论 #35702172 未加载

forgingaheadabout 2 years ago

Nice to see Tortoise being used - I still think it's the best TTS system out there now. Generation time is slow, but quality is incredible. I wonder if the code can be optimised to speed up the generation, but I don't think the author is maintaining it any longer.[0][0]<a href="https://github.com/neonbjb/tortoise-tts">https://github.com/neonbjb/tortoise-tts</a>

评论 #35703849 未加载

评论 #35703904 未加载

评论 #35703285 未加载

评论 #35702833 未加载

评论 #35705314 未加载

tasty_freezeabout 2 years ago

I pitched this on a recently thread, but it was 12+ hours after it was posted, so I'll try again here.What I really want is a program to waste the time of phone calls making unsolicited sales pitches.It would do voice to text, run a simple language model to generate responses, then synthesize the voice back. It doesn't need to be a sophisticated model, not much more sophisticated than the classic "Eliza" program. A few years back someone did this with a canned loop of vague responses and it fooled the sales people for surprisingly long:<a href="https://www.youtube.com/watch?v=XSoOrlh5i1k">https://www.youtube.com/watch?v=XSoOrlh5i1k</a>It seems like it could all run locally for low latency. Probably the most important part to get right would be a TTS system that isn't immediately pegged as a robot.

评论 #35707174 未加载

评论 #35706695 未加载

评论 #35707844 未加载

sramamabout 2 years ago

Very cool - the demo was simple, functional and clear.It was a bit laggy, but for a free demo from an open source project, I should be the one being shamed!Well done.

juliennakacheabout 2 years ago

Interesting. How does local development work or remote debugging work if the entire production toolchain is abstracted away with proprietary software?

评论 #35703949 未加载

yewenjieabout 2 years ago

Does anybody know if there is an easy-to-deploy setup for tortoise-tts only?

评论 #35703752 未加载

syntaxingabout 2 years ago

Off tangent but can someone at Apple please just replace the Siri word recognition with whisper. We can finally have multi language support and not dogshit recognition.

loudmaxabout 2 years ago

Sorry if this is off-topic, but those are some really good book recommendations in the demonstration image! If those are coming from Vicuna, that speaks well of it.

testernewsabout 2 years ago

How about a self hosted version?

teacpdeabout 2 years ago

Why is it serverless? It clearly has an API server.

评论 #35702705 未加载

评论 #35702618 未加载

评论 #35702624 未加载

评论 #35702718 未加载

评论 #35702619 未加载

12 comments

lxeabout 2 years ago

For "actually serverless" voice chat, check out <a href="https://whisper.ggerganov.com/" rel="nofollow">https://whisper.ggerganov.com/</a>

评论 #35705134 未加载

评论 #35702904 未加载

评论 #35704832 未加载

评论 #35703373 未加载

评论 #35702630 未加载

IronWolveabout 2 years ago

评论 #35703794 未加载

评论 #35703803 未加载

wongarsuabout 2 years ago

评论 #35702671 未加载

评论 #35702276 未加载

评论 #35702058 未加载

评论 #35702172 未加载

forgingaheadabout 2 years ago

评论 #35703849 未加载

评论 #35703904 未加载

评论 #35703285 未加载

评论 #35702833 未加载

评论 #35705314 未加载

tasty_freezeabout 2 years ago

评论 #35707174 未加载

评论 #35706695 未加载

评论 #35707844 未加载

sramamabout 2 years ago

Very cool - the demo was simple, functional and clear.It was a bit laggy, but for a free demo from an open source project, I should be the one being shamed!Well done.

juliennakacheabout 2 years ago

Interesting. How does local development work or remote debugging work if the entire production toolchain is abstracted away with proprietary software?

评论 #35703949 未加载

yewenjieabout 2 years ago

Does anybody know if there is an easy-to-deploy setup for tortoise-tts only?

评论 #35703752 未加载

syntaxingabout 2 years ago

Off tangent but can someone at Apple please just replace the Siri word recognition with whisper. We can finally have multi language support and not dogshit recognition.

loudmaxabout 2 years ago

Sorry if this is off-topic, but those are some really good book recommendations in the demonstration image! If those are coming from Vicuna, that speaks well of it.

QuiLLMan: Voice chat with Vicuna-13B

12 comments

QuiLLMan: Voice chat with Vicuna-13B

12 comments