TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Open source implementation for LLaMA-based ChatGPT

289 点作者 georgehill超过 2 年前

21 条评论

amrb超过 2 年前
I would care more about LLaMA architecture when I can get hands on, honestly this project is more interesting and lighting fast on even a 2060 laptop <a href="https:&#x2F;&#x2F;github.com&#x2F;BlinkDL&#x2F;RWKV-LM">https:&#x2F;&#x2F;github.com&#x2F;BlinkDL&#x2F;RWKV-LM</a>
评论 #34960139 未加载
评论 #34963015 未加载
评论 #34961372 未加载
评论 #34961318 未加载
评论 #34958777 未加载
评论 #34958816 未加载
评论 #34958388 未加载
shmatt超过 2 年前
I don&#x27;t really understand the benchmarking aspect researchers are touting. The public never cared about LLMs until they had a proper conversation with one. You can beat GPT3 at any benchmark you&#x27;d like, but if you can&#x27;t get people that &quot;feeling&quot; when chatting with your model, is it worth anything?<p>In the future there&#x27;s going to have to be a way to benchmark the &quot;human-ness&quot; or &quot;intrigue&quot; or &quot;feistiness&quot; of a model to show us if its getting better at what we want
评论 #34959257 未加载
评论 #34957318 未加载
评论 #34957266 未加载
评论 #34959780 未加载
评论 #34957569 未加载
SakiWatanabe超过 2 年前
What is the purpose of this? The model from meta is not available to public. Neither this open source &quot;LLaMA-based ChatGPT&quot; nor the &quot;open source&quot; LLaMA can be downloaded or actually used by public because it would required the actual trained model.
评论 #34958641 未加载
agolio超过 2 年前
I&#x27;m as much a META hater as anyone - their policies have consistently disappointed me in almost every aspect of their business - but their stance on this LLaMA project I must say I am happy with and seems to mark a turn for the better.<p>If they follow through on their promise of making the weights available and share source code that is a big step in the right direction for democratising this technology
评论 #34958126 未加载
评论 #34957566 未加载
评论 #34964264 未加载
georgehill超过 2 年前
For anyone wondering what LLaMA is, here are some useful links.<p><a href="https:&#x2F;&#x2F;ai.facebook.com&#x2F;blog&#x2F;large-language-model-llama-meta-ai" rel="nofollow">https:&#x2F;&#x2F;ai.facebook.com&#x2F;blog&#x2F;large-language-model-llama-meta...</a><p><a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=34925944" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=34925944</a>
评论 #34958709 未加载
davidy123超过 2 年前
I am very far from an expert on this, but I think domain specific conversational AI would be much more useful than these large models. It&#x27;s fun to ask an AI to compose a fresh 600bpm hip hop song about the relationship between materials science and the breeding habits of mosquitoes, but an open-source medical AI, application support AI, or many other applications would be much more practical, if they could be accurate enough. And especially if they could run &quot;standalone.&quot; They could also consult with each other, as a network of specialized AI. Is work inching closer to more specific, more accurate applications? Or is this just a big gimmick&#x2F;distraction phase around a maybe not so great idea of AI?
评论 #34958272 未加载
评论 #34957978 未加载
评论 #34958341 未加载
评论 #34958424 未加载
评论 #34957926 未加载
评论 #34958866 未加载
评论 #34958583 未加载
levesque超过 2 年前
In what way is this a ChatGPT implementation or equivalent? Seems like a chatbot based on a different backend, therefore it has absolutely zero link to ChatGPT.
评论 #34957052 未加载
评论 #34957045 未加载
didntreadarticl超过 2 年前
Have we got any details on the benchmarks that show LLaMa&#x27;s 13B architecture outperforming GPT-3? Because that seems kindsof fantastical. Is it just a product of a very specific benchmark or does it reflect real world performance?
评论 #34957587 未加载
评论 #34957085 未加载
visarga超过 2 年前
This whole debate - if a 13B model can really be as good as GPT3 - would have been settled if we had a live demo. I am not sure their licence allows running public demos, even if you get the weights.
rnosov超过 2 年前
Looks like they are making ChatGPT clone that would be possible to run a single GPU. HN dream come true!
评论 #34956979 未加载
评论 #34965432 未加载
评论 #34958686 未加载
voytec超过 2 年前
Fake title riding on ChatGPT popularity. I think that it should be updated to something like:<p>Open source implementation for LLaMA-based chat bot*<p>Open source implementation for LLaMA-based ChatGPT alternative*
评论 #34957413 未加载
holtkam2超过 2 年前
I don&#x27;t have a decent gpu at my disposal... has anyone tried to run LLaMA on an EC2 GPU instance? If so, which instance type? (I don&#x27;t wanna overpay)
评论 #34960201 未加载
jstsch超过 2 年前
Are LLaMA&#x27;s weights generally available&#x2F;floating around yet?
评论 #34957058 未加载
评论 #34957374 未加载
bethecloud超过 2 年前
open-assistant from LAION is in the process of creating an OSS RLHF dataset for a personal assistant, may be useful for this project
threevox超过 2 年前
Can someone leak the weights please
vivegi超过 2 年前
This obsession with <i>locking up</i> model weights behind a gate-keeping <i>application form</i> and calling it <i>open source</i> is weird. I don&#x27;t know who the high priests are trying to fool.<p>If your model is really that good, unleash it into the open so that others can truly evaluate it-warts and all-and help improve it by identifying the flaws.
评论 #35006564 未加载
louis030195超过 2 年前
The difference between invention &amp; innovation is that innovation is when you ship your product to the masses. Can I query llama in a line of code? No
gersh超过 2 年前
Is the trained model available to download anywhere?
评论 #34957750 未加载
rvz超过 2 年前
&gt; LLaMA is creating a lot of excitement because it is smaller than GPT-3 but has better performance. For example, LLaMA&#x27;s 13B architecture outperforms GPT-3 despite being 10 times smaller.<p>Exactly. Best part is that it is open-source.<p>That is worth getting excited about. Not a AI SaaS API owned by a so-called pseudo-non profit company which struggles on API uptime and availablity, just like GitHub.<p>This is the &#x27;revolution&#x27; you are looking for that changes everything. Not ChatGPT.
评论 #34957336 未加载
评论 #34958291 未加载
karmasimida超过 2 年前
Is it of any good?
Jack5500超过 2 年前
This seems like a great first step to a truly open source LLM
评论 #34962341 未加载
评论 #34958851 未加载