TechEcho

8 comments

jiggawattsabout 2 years ago

It keeps saying the phrase “model you can run locally”, but despite days of trying, I failed to compile any of the GitHub repos associated with these models.None of the Python dependencies are strongly versioned, and “something” happened to the CUDA compatibility of one of them about a month ago. The original developers “got lucky” but now nobody else can compile this stuff.After years of using only C# and Rust, both of which have sane package managers with semantic versioning, lock files, reproducible builds, and even SHA checksums the Python package ecosystem looks ridiculously immature and even childish.Seriously, can anyone here build a docker image for running these models on CUDA? I think right now it’s borderline impossible, but I’d be happy to be corrected…

评论 #35758763 未加载

评论 #35761453 未加载

评论 #35758057 未加载

评论 #35759320 未加载

评论 #35758636 未加载

评论 #35758006 未加载

评论 #35759766 未加载

评论 #35757903 未加载

评论 #35757987 未加载

评论 #35757917 未加载

评论 #35759923 未加载

doodlesdevabout 2 years ago

<pre><code> > Our system thinks you might be a robot! We're really sorry about this, but it's getting harder and harder to tell the difference between humans and bots these days. </code></pre> Yeah, fuck you too. Come on, really, why put this in front of a _blog post_? Is it that hard to keep up with the bot requests when serving a static page?

评论 #35758552 未加载

vessenesabout 2 years ago

Most places that recommend llama.cpp for mac fail to mention <a href="https://github.com/jankais3r/LLaMA_MPS">https://github.com/jankais3r/LLaMA_MPS</a>, which runs unquantized 7b and 13b models on the M1/M2 GPU directly. It's slightly slower, (not a lot), and significantly lower energy usage. To me the win not having to quantize while not melting a hole in my lap is huge; I wish more people knew about it.

评论 #35759431 未加载

评论 #35761498 未加载

simonwabout 2 years ago

I'm running Vicuna (a LLaMA variant) on my iPhone right now. <a href="https://twitter.com/simonw/status/1652358994214928384" rel="nofollow">https://twitter.com/simonw/status/1652358994214928384</a>The same team that built that iPhone app - MLC - also got Vicuna running directly in a web browser using Web GPU: <a href="https://simonwillison.net/2023/Apr/16/web-llm/" rel="nofollow">https://simonwillison.net/2023/Apr/16/web-llm/</a>

评论 #35757088 未加载

brucethemoose2about 2 years ago

There is also CodyCapybara (7B finetuned on code competitions), the "uncensored" Vicuna, OpenAssistant 13B (which is said to be very good), various non English tunes, medalpaca... the release pace maddening.

评论 #35757567 未加载

brianjkingabout 2 years ago

I'll never understand why everyone is spending so much time on a model you cannot use commercially (at all).Secondly, most of us can't even use the model for research or personal use, given the license.

评论 #35759832 未加载

评论 #35761038 未加载

评论 #35760510 未加载

评论 #35759805 未加载

评论 #35761968 未加载

评论 #35759714 未加载

评论 #35761081 未加载

FloatArtifactabout 2 years ago

There needs to be a slight dedicated to tracking all these models with regular updates.

评论 #35758393 未加载

foobarbecueabout 2 years ago

Ok I gotta know... what's the art?

8 comments

jiggawattsabout 2 years ago

评论 #35758763 未加载

评论 #35761453 未加载

评论 #35758057 未加载

评论 #35759320 未加载

评论 #35758636 未加载

评论 #35758006 未加载

评论 #35759766 未加载

评论 #35757903 未加载

评论 #35757987 未加载

评论 #35757917 未加载

评论 #35759923 未加载

doodlesdevabout 2 years ago

评论 #35758552 未加载

vessenesabout 2 years ago

评论 #35759431 未加载

评论 #35761498 未加载

simonwabout 2 years ago

评论 #35757088 未加载

brucethemoose2about 2 years ago

评论 #35757567 未加载

brianjkingabout 2 years ago

评论 #35759832 未加载

评论 #35761038 未加载

评论 #35760510 未加载

评论 #35759805 未加载

评论 #35761968 未加载

评论 #35759714 未加载

评论 #35761081 未加载

FloatArtifactabout 2 years ago

There needs to be a slight dedicated to tracking all these models with regular updates.

评论 #35758393 未加载

foobarbecueabout 2 years ago

Ok I gotta know... what's the art?

A brief history of LLaMA models

8 comments

A brief history of LLaMA models

8 comments