LM Studio – Discover, download, and run local LLMs

461 pointsby victormustarover 1 year ago

33 comments

reustleover 1 year ago

This looks great!If you're looking to do the same with open source code, you could likely run Ollama and a UI.<a href="https://github.com/jmorganca/ollama">https://github.com/jmorganca/ollama</a> + <a href="https://github.com/ollama-webui/ollama-webui">https://github.com/ollama-webui/ollama-webui</a>

评论 #38385697 未加载

评论 #38382156 未加载

评论 #38384068 未加载

评论 #38383975 未加载

评论 #38384237 未加载

omneityover 1 year ago

I really like LM Studio and had it open when I came across this post. LM Studio is an interesting mixture of:- A local model runtime- A model catalog- A UI to chat with the models easily- An openAI compatible APIAnd it has several plugins such as for RAG (using ChromaDB) and others.Personally I think the positioning is very interesting. They're well positioned to take advantage of new capabilities in the OS ecosystem.It's still unfortunate that it is not itself open-source.

评论 #38382028 未加载

评论 #38379692 未加载

pentagramaover 1 year ago

Curious about this and I just download it.Want to try uncensored models.I have a question, looking for the most popular "uncensored" model I just find "TheBloke/Luna-AI-Llama2-Uncensored-GGML", but it has 14 files to download between 2 to 7 GB, I just download the first one: <a href="https://imgur.com/a/DE2byOB" rel="nofollow noreferrer">https://imgur.com/a/DE2byOB</a>I try the model and it works: <a href="https://imgur.com/a/2vtPcui" rel="nofollow noreferrer">https://imgur.com/a/2vtPcui</a>I should download all the 14 files to get better results?Also, asking how to make a bomb it looks that at least this model isn't "uncesored": <a href="https://imgur.com/a/iYz7VYQ" rel="nofollow noreferrer">https://imgur.com/a/iYz7VYQ</a>

评论 #38378915 未加载

评论 #38381963 未加载

评论 #38379271 未加载

评论 #38383559 未加载

eurekinover 1 year ago

This app could use some simple UI improvements:- The chatbox field has a normal "write here" state, when no chat is really selected. I thought my keyboard broke until I discovered that- I didn't find a way to set cuda acceleration before loading a model, only managed to set gpu offloaded layers and using "relaunch to apply"- Some HugginFace models are simply not listed and there's no indication about why. I guess models are really curated, but somehow presented as a HuggingFace browser?- Scrolling in the accordion parts of the interface seems to be responding to mouse wheel scroll only. I have a mouse with a damaged one and couldn't find a way to reliably navigate to bottom drawersThat said, I really liked the server tab, which allowed for initial debugging very easily

评论 #38377924 未加载

评论 #38377936 未加载

andy99over 1 year ago

I don't mean this as a criticism, I'm just curious because I work in this space too: who is this for? What is the niche of people savvy enough to use this who can't run one of the many open source local llm software? It looks in the screenshot like it's exposing much of the complexity of configuration anyway. Is the value in the interface and management of conversation and models? It would be nice to see info or even speculation about the potential market segments of LLM users.

评论 #38378019 未加载

评论 #38383671 未加载

评论 #38378173 未加载

评论 #38378058 未加载

评论 #38429681 未加载

评论 #38390548 未加载

porcodaover 1 year ago

Amusing qualifications for the senior engineering roles they’re hiring for:“Deep understanding of what is a computer, what is computer software, and how the two relate.”Right after the senior ML role that requires people understand how to write “algorithms and programs.”Kinda hard to take those kinds of requirements seriously.

评论 #38383848 未加载

评论 #38383766 未加载

kgeistover 1 year ago

For my experiments with new self-hostable models on Linux, I've been using a script to download GGUF-models from TheBloke on HuggingFace (currently, TheBloke's repository has 657 models in the GGUF format) which I feed to a simple program I wrote which invokes llama.cpp compiled with GPU support. The GGUF format and TheBloke are a blessing, because I'm able to check out new models basically on the day of their release (TheBloke is very fast) and without an issue. However, the only frontend I have is console. Judging by their site, their setup is exactly the same as mine (which I implemented over a weekend), except that they also added a React-based UI on top. I wonder, how they're planning to commercialize it, because it's pretty trivial to replicate, and there're already open-source UI's like oogabooga.

评论 #38378189 未加载

评论 #38380464 未加载

评论 #38380926 未加载

hnuser123456over 1 year ago

This works, but I've noticed that my CPU use goes up to about 30 percent, all in kernel time (windows), after installing and opening this, even when it's not doing anything, on two separate machines... I also hear the fan spinning fast on my laptop.Killed the LM studio process and re-opened it and the ghost background usage is down to about 5%.

machiawelicznyover 1 year ago

<a href="https://github.com/enricoros/big-agi">https://github.com/enricoros/big-agi</a> seems better and is open source

pointlessoneover 1 year ago

M1 is only 3 years old and no one cares to support intel macs any more. There are surely a lot of them out there. Are they that much worse to run LLMs on?

评论 #38377869 未加载

whartungover 1 year ago

I’m late to the game about this. So I’ll ask a stupid question.As a contrived example, what happens if you feed the LoTR books, the Hobbit, the Silmarillion, and whatever else is germane, into an LLM?Is there a base, empty, “ignorant” LLM that is used as a seed?Do you end up with a Middle Earth savant?Just how does all this work?

评论 #38385552 未加载

评论 #38386961 未加载

rgbrgbover 1 year ago

LMStudio is great, if a bit daunting. If you’re on Mac and want a native open source interface, try out FreeChat <a href="https://www.freechat.run" rel="nofollow noreferrer">https://www.freechat.run</a>

评论 #38380750 未加载

RecycledEleover 1 year ago

This is what I use on Windows 10.I have an HP z440 with an E5-1630 v4 and 64GB DDR4 quad channel RAM.I run LLMs on my CPU, and the 7 billion parameter models spit out text faster than I can read it.I wish it supported LMMs (multi modal models.)

isilofiover 1 year ago

Disappointing, no proper Linux support. Just "ask on discord".

评论 #38386546 未加载

评论 #38384861 未加载

MuffinFlavoredover 1 year ago

1. Mistral2. Llama 23. Code Llama4. Orca Mini5. VicunaWhat can I do with any of these models that won't result in 50% hallucinations/it recommending code with APIs that don't exist/it recommending basically regurgitated StackOverflow historical out of date answers (that it was trained on) for libraries that have had their versions/APIs change, etc?Can somebody share one real use case they are using any of these models for?

评论 #38413982 未加载

评论 #38386142 未加载

cyphertechincover 1 year ago

If you just want to try something quick. You can try AskCyph LITE <a href="https://askcyph.cypherchat.app" rel="nofollow noreferrer">https://askcyph.cypherchat.app</a>. It runs AI model natively on browser without having to do any installation, etc.

manyosoover 1 year ago

For those looking for an open source alternative with Mac, Windows, Linux support check out GPT4All.io

评论 #38380352 未加载

smusamashahover 1 year ago

Why purple or some shade of purple is the color of all AI products? For some reason, the landing pages of AI products immediately remind of Crypto products. This one does not have Crypto vibes but the colour is purple. I don't get why.

评论 #38379904 未加载

评论 #38378149 未加载

cztomsikover 1 year ago

Also, if you don't know what all the toggles are for, this is a simpler attempt by me: <a href="https://www.avapls.com/" rel="nofollow noreferrer">https://www.avapls.com/</a>

hugovieover 1 year ago

LMStudio is great to run local LLMs, also support OpenAI-compatible API. In the case you need more advance UI/UX, you can use LMStudio with MindMac(<a href="https://mindmac.app" rel="nofollow noreferrer">https://mindmac.app</a>), just check this video for details <a href="https://www.youtube.com/watch?v=3KcVp5QQ1Ak" rel="nofollow noreferrer">https://www.youtube.com/watch?v=3KcVp5QQ1Ak</a>.

评论 #38388257 未加载

评论 #38378760 未加载

评论 #38381408 未加载

deskamessover 1 year ago

Newbie question... Is this purely for hosting text language models? Is there something similar for image models? i.e., upload an image and have some local model provide some detection/feedback on it.

LoganDarkover 1 year ago

I was working on something like this by myself, but ADHD deleted all my motivation. I'll definitely want to try this soon, especially if it's free!I hope it supports my (3060) GPU, though.

stuckinhellover 1 year ago

After the latest Chatgpt debacles, the poor performance I'm getting from 4 turbo, I'd really like a local version of chatgpt4 or equivalent. I'd even buy a new pc if I had too.

评论 #38382970 未加载

FooBarWidgetover 1 year ago

The settings say saving chats can take 2 GB. Why? What states do chat LLMs have? Isn't the only state the chat history text?

sumedhover 1 year ago

How does this App make money?

评论 #38377585 未加载

cvhashim04over 1 year ago

Wow this is sleek, good job.

DrNosferatuover 1 year ago

The Linux version is not loading models. What’s your experience?

jhoechtlover 1 year ago

Is this an alternative to privategpt or GPT4All?

评论 #38390761 未加载

repleteover 1 year ago

RIP Intel users

d_meezeover 1 year ago

So if you suspect I am using this for business related purposes you may take any action you want to spy on me? Such Terms. Very Use.

评论 #38377747 未加载

评论 #38377568 未加载

评论 #38378378 未加载

评论 #38380908 未加载

评论 #38378121 未加载

sunsmileover 1 year ago

Considering the code is closed source and they can change the ToS anytime to send conversation data to their servers whenever they want, i would like to know what would be the benefit of using this over ChatGPT?

评论 #38382658 未加载

chatmastaover 1 year ago

Am I missing something here? I'm on a recent M2 machine. Every model I've downloaded fails to load immediately when trying to load it. Is there some way to get feedback on the reason for failure, like a log file or something?EDIT: The problem is I'm on macOS 13.2 (Ventura). According to a message in Discord, the minimum version for some (most?) models is 13.6.

评论 #38380943 未加载

cloudkingover 1 year ago

Is anyone using open source models to actually get work done or solving problems in their software architecture? So far I haven't found anything near the quality of GPT-4.

评论 #38380699 未加载

评论 #38384887 未加载

评论 #38382348 未加载

评论 #38380970 未加载

评论 #38380934 未加载