TechEcho

13 comments

rhdunnover 1 year ago

If you want to run Mixtral 8x7B locally you can use llama.cpp (including with any of the supporting libraries/interfaces such as text-generation-webui) with <a href="https://huggingface.co/TheBloke/Nous-Hermes-2-Mixtral-8x7B-SFT-GGUF" rel="nofollow">https://huggingface.co/TheBloke/Nous-Hermes-2-Mixtral-8x7B-S...</a>.The smallest quantized version (2bit) needs 20GB of RAM (which can be offloaded onto the VRAM of a decent 4090 GPU). The 4bit quantized versions are the largest models that can just about fit onto a 32GB system (29GB-31B). The 6bit (41GB) and 8bit (52GB) models need a 64GB system. You would need multiple GPUs with shared memory if you wanted to offload the higher precision models to VRAM.I've experimented with the 7B and 13B models, but haven't experimented with these models yet, nor other larger models.

评论 #39154306 未加载

评论 #39154560 未加载

评论 #39156209 未加载

评论 #39154189 未加载

评论 #39154304 未加载

评论 #39154572 未加载

评论 #39154834 未加载

评论 #39156794 未加载

评论 #39154371 未加载

评论 #39159077 未加载

评论 #39155340 未加载

wolverine876over 1 year ago

Kudos to Brave (for this and other privacy features):Unlinkable subscription: If you sign up for Leo Premium, you’re issued unlinkable tokens that validate your subscription when using Leo. This means that Brave can never connect your purchase details with your usage of the product, an extra step that ensures your activity is private to you and only you. The email you used to create your account is unlinkable to your day-to-day use of Leo, making this a uniquely private credentialing experience.

评论 #39159155 未加载

syntaxingover 1 year ago

Interesting, I must have missed the first Leo announcement. I really like how privacy conscious it is. They don’t store any chat record which is what I want.

评论 #39156013 未加载

评论 #39158204 未加载

firtozover 1 year ago

What are good API providers that serve mixtral? I know only octo ai which seems decent but will be good to know alternatives too

评论 #39156429 未加载

评论 #39163370 未加载

评论 #39154721 未加载

评论 #39156494 未加载

评论 #39155399 未加载

评论 #39154666 未加载

评论 #39202353 未加载

评论 #39156231 未加载

评论 #39155200 未加载

评论 #39155897 未加载

评论 #39154665 未加载

frozenportover 1 year ago

I've been running the version on poe and chat.groq.com for the last week.Much better than llama 70b.

charcircuitover 1 year ago

It's interesting that they made it so you can ask LLM queries right from the omnibar. I wonder if they eventually will come up with some heuristic to determine if thr query should be sent directly to an LLM or if the query should use the default search provider.

m3kw9over 1 year ago

If you have used gpt4 and then use mistral, it’s like looking at a Retina display and then have to go back to a low res screen. You are always thinking “but GPT4 could do this though”

评论 #39158034 未加载

kristianpaulover 1 year ago

I run Mixtral locally using ollama

emmanueloga_over 1 year ago

Does anyone know of a good chrome extension for AI page summarization? I tried a bunch of the top Google search hits, they work fine but are really bloated with superfluous features.

评论 #39175769 未加载

andaiover 1 year ago

Asked Mistral 8x7B for an essay on ham. It started telling me about Hamlet.

评论 #39156174 未加载

fifteen1506over 1 year ago

Just checking: PDF summarization is not yet implemented, right?

评论 #39155078 未加载

评论 #39155268 未加载

finikytouover 1 year ago

quick question I have 24GB VRAM and I need to close everything to run MIXTRAL at 4 bit quant with bitsandbyte. there is no way to run it at 3,5 on windows?

davikrover 1 year ago

It's nice using Brave because you have Chromium's better performance, without having to worry about Manifest V2 dying and taking adblocking down with it. I have uBlock Origin enabled, but it has barely caught anything that slipped past the browser filters.

评论 #39154585 未加载

评论 #39156211 未加载

评论 #39161307 未加载

评论 #39155108 未加载

评论 #39154934 未加载

评论 #39154455 未加载

评论 #39155454 未加载

13 comments

rhdunnover 1 year ago

评论 #39154306 未加载

评论 #39154560 未加载

评论 #39156209 未加载

评论 #39154189 未加载

评论 #39154304 未加载

评论 #39154572 未加载

评论 #39154834 未加载

评论 #39156794 未加载

评论 #39154371 未加载

评论 #39159077 未加载

评论 #39155340 未加载

wolverine876over 1 year ago

评论 #39159155 未加载

syntaxingover 1 year ago

Interesting, I must have missed the first Leo announcement. I really like how privacy conscious it is. They don’t store any chat record which is what I want.

评论 #39156013 未加载

评论 #39158204 未加载

firtozover 1 year ago

What are good API providers that serve mixtral? I know only octo ai which seems decent but will be good to know alternatives too

评论 #39156429 未加载

评论 #39163370 未加载

评论 #39154721 未加载

评论 #39156494 未加载

评论 #39155399 未加载

评论 #39154666 未加载

评论 #39202353 未加载

评论 #39156231 未加载

评论 #39155200 未加载

评论 #39155897 未加载

评论 #39154665 未加载

frozenportover 1 year ago

I've been running the version on poe and chat.groq.com for the last week.Much better than llama 70b.

charcircuitover 1 year ago

m3kw9over 1 year ago

If you have used gpt4 and then use mistral, it’s like looking at a Retina display and then have to go back to a low res screen. You are always thinking “but GPT4 could do this though”

评论 #39158034 未加载

kristianpaulover 1 year ago

I run Mixtral locally using ollama

emmanueloga_over 1 year ago

Does anyone know of a good chrome extension for AI page summarization? I tried a bunch of the top Google search hits, they work fine but are really bloated with superfluous features.

评论 #39175769 未加载

andaiover 1 year ago

Asked Mistral 8x7B for an essay on ham. It started telling me about Hamlet.

评论 #39156174 未加载

fifteen1506over 1 year ago

Just checking: PDF summarization is not yet implemented, right?

评论 #39155078 未加载

评论 #39155268 未加载

finikytouover 1 year ago

quick question I have 24GB VRAM and I need to close everything to run MIXTRAL at 4 bit quant with bitsandbyte. there is no way to run it at 3,5 on windows?

Brave Leo now uses Mixtral 8x7B as default

13 comments

Brave Leo now uses Mixtral 8x7B as default

13 comments