TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

A new state-of-the-art open source chatbot

36 pointsby olibawabout 5 years ago

2 comments

drusepthabout 5 years ago
According to the &quot;Get the code&quot; link [1], it looks like these models need pretty huge GPUs to even interact with the pre-trained models. Is that abnormal? I was under the impression that training the model is generally what takes the beefy GPU, and then using that model requires more consumer-adjacent hardware. A P100 GPU is $3000 [2].<p>[1] <a href="https:&#x2F;&#x2F;parl.ai&#x2F;projects&#x2F;blender&#x2F;" rel="nofollow">https:&#x2F;&#x2F;parl.ai&#x2F;projects&#x2F;blender&#x2F;</a><p>[2] <a href="https:&#x2F;&#x2F;www.amazon.com&#x2F;dp&#x2F;B06WV7HFWV&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.amazon.com&#x2F;dp&#x2F;B06WV7HFWV&#x2F;</a>
评论 #23035487 未加载
评论 #23035547 未加载
shermanmccoyabout 5 years ago
Boiling it all down, when prompted, these models just regurgitate a similar sentence to what is observed in the training data for loosely that same input, using some glorified curve fitting. This does not necessarily imply the model understands the meaning of what it is spitting out. So the uninitiated will be really impressed with this kind of toy.<p>The researchers here appear to have placed particular emphasis on cleaning up what the model is spitting out, but I think it&#x27;s lipstick on a pig. The area begging for more research is parsing out the meaning of anything but the most simple sentence.
评论 #23036476 未加载
评论 #23026594 未加载