Open source implementation for LLaMA-based ChatGPT

289 点作者 georgehill超过 2 年前

21 条评论

amrb超过 2 年前

I would care more about LLaMA architecture when I can get hands on, honestly this project is more interesting and lighting fast on even a 2060 laptop <a href="https://github.com/BlinkDL/RWKV-LM">https://github.com/BlinkDL/RWKV-LM</a>

评论 #34960139 未加载

评论 #34963015 未加载

评论 #34961372 未加载

评论 #34961318 未加载

评论 #34958777 未加载

评论 #34958816 未加载

评论 #34958388 未加载

shmatt超过 2 年前

I don't really understand the benchmarking aspect researchers are touting. The public never cared about LLMs until they had a proper conversation with one. You can beat GPT3 at any benchmark you'd like, but if you can't get people that "feeling" when chatting with your model, is it worth anything?In the future there's going to have to be a way to benchmark the "human-ness" or "intrigue" or "feistiness" of a model to show us if its getting better at what we want

评论 #34959257 未加载

评论 #34957318 未加载

评论 #34957266 未加载

评论 #34959780 未加载

评论 #34957569 未加载

SakiWatanabe超过 2 年前

What is the purpose of this? The model from meta is not available to public. Neither this open source "LLaMA-based ChatGPT" nor the "open source" LLaMA can be downloaded or actually used by public because it would required the actual trained model.

评论 #34958641 未加载

agolio超过 2 年前

I'm as much a META hater as anyone - their policies have consistently disappointed me in almost every aspect of their business - but their stance on this LLaMA project I must say I am happy with and seems to mark a turn for the better.If they follow through on their promise of making the weights available and share source code that is a big step in the right direction for democratising this technology

评论 #34958126 未加载

评论 #34957566 未加载

评论 #34964264 未加载

georgehill超过 2 年前

For anyone wondering what LLaMA is, here are some useful links.<a href="https://ai.facebook.com/blog/large-language-model-llama-meta-ai" rel="nofollow">https://ai.facebook.com/blog/large-language-model-llama-meta...</a><a href="https://news.ycombinator.com/item?id=34925944" rel="nofollow">https://news.ycombinator.com/item?id=34925944</a>

评论 #34958709 未加载

davidy123超过 2 年前

I am very far from an expert on this, but I think domain specific conversational AI would be much more useful than these large models. It's fun to ask an AI to compose a fresh 600bpm hip hop song about the relationship between materials science and the breeding habits of mosquitoes, but an open-source medical AI, application support AI, or many other applications would be much more practical, if they could be accurate enough. And especially if they could run "standalone." They could also consult with each other, as a network of specialized AI. Is work inching closer to more specific, more accurate applications? Or is this just a big gimmick/distraction phase around a maybe not so great idea of AI?

评论 #34958272 未加载

评论 #34957978 未加载

评论 #34958341 未加载

评论 #34958424 未加载

评论 #34957926 未加载

评论 #34958866 未加载

评论 #34958583 未加载

levesque超过 2 年前

In what way is this a ChatGPT implementation or equivalent? Seems like a chatbot based on a different backend, therefore it has absolutely zero link to ChatGPT.

评论 #34957052 未加载

评论 #34957045 未加载

didntreadarticl超过 2 年前

Have we got any details on the benchmarks that show LLaMa's 13B architecture outperforming GPT-3? Because that seems kindsof fantastical. Is it just a product of a very specific benchmark or does it reflect real world performance?

评论 #34957587 未加载

评论 #34957085 未加载

visarga超过 2 年前

This whole debate - if a 13B model can really be as good as GPT3 - would have been settled if we had a live demo. I am not sure their licence allows running public demos, even if you get the weights.

rnosov超过 2 年前

Looks like they are making ChatGPT clone that would be possible to run a single GPU. HN dream come true!

评论 #34956979 未加载

评论 #34965432 未加载

评论 #34958686 未加载

voytec超过 2 年前

Fake title riding on ChatGPT popularity. I think that it should be updated to something like:Open source implementation for LLaMA-based chat bot*Open source implementation for LLaMA-based ChatGPT alternative*

评论 #34957413 未加载

holtkam2超过 2 年前

I don't have a decent gpu at my disposal... has anyone tried to run LLaMA on an EC2 GPU instance? If so, which instance type? (I don't wanna overpay)

评论 #34960201 未加载

jstsch超过 2 年前

Are LLaMA's weights generally available/floating around yet?

评论 #34957058 未加载

评论 #34957374 未加载

bethecloud超过 2 年前

open-assistant from LAION is in the process of creating an OSS RLHF dataset for a personal assistant, may be useful for this project

threevox超过 2 年前

Can someone leak the weights please

vivegi超过 2 年前

This obsession with locking up model weights behind a gate-keeping application form and calling it open source is weird. I don't know who the high priests are trying to fool.If your model is really that good, unleash it into the open so that others can truly evaluate it-warts and all-and help improve it by identifying the flaws.

评论 #35006564 未加载

louis030195超过 2 年前

The difference between invention & innovation is that innovation is when you ship your product to the masses. Can I query llama in a line of code? No

gersh超过 2 年前

Is the trained model available to download anywhere?

评论 #34957750 未加载

rvz超过 2 年前

> LLaMA is creating a lot of excitement because it is smaller than GPT-3 but has better performance. For example, LLaMA's 13B architecture outperforms GPT-3 despite being 10 times smaller.Exactly. Best part is that it is open-source.That is worth getting excited about. Not a AI SaaS API owned by a so-called pseudo-non profit company which struggles on API uptime and availablity, just like GitHub.This is the 'revolution' you are looking for that changes everything. Not ChatGPT.

评论 #34957336 未加载

评论 #34958291 未加载

karmasimida超过 2 年前

Is it of any good?

Jack5500超过 2 年前

This seems like a great first step to a truly open source LLM

评论 #34962341 未加载

评论 #34958851 未加载