TechEcho

12 comments

ccozanalmost 2 years ago

Ok, since is running all private, how can I add my own private data? For example I have a 20+ years of an email archive that I'd like to be ingested.

评论 #37155282 未加载

评论 #37154064 未加载

评论 #37154364 未加载

评论 #37155040 未加载

Atlas-Marblesalmost 2 years ago

Very cool, this looks like a combination of chatbot-ui and llama-cpp-python? A similar project I've been using is <a href="https://github.com/serge-chat/serge">https://github.com/serge-chat/serge</a>. Nous-Hermes-Llama2-13b is my daily driver and scores high on coding evaluations (<a href="https://huggingface.co/spaces/mike-ravkine/can-ai-code-results" rel="nofollow noreferrer">https://huggingface.co/spaces/mike-ravkine/can-ai-code-resul...</a>).

评论 #37156498 未加载

belvalalmost 2 years ago

Nice project! I could not find the information in the README.md, can I run this with a GPU? If so what do I need to change? Seems like it's hardcoded to 0 in the run script: <a href="https://github.com/getumbrel/llama-gpt/blob/master/api/run.sh#L12">https://github.com/getumbrel/llama-gpt/blob/master/api/run.s...</a>

评论 #37164076 未加载

评论 #37154554 未加载

评论 #37153819 未加载

SubiculumCodealmost 2 years ago

I didn't see any info on how this is different than installing/running llamacpp or koboldcpp. New offerings are awesome of course, but what is it adding?

评论 #37154565 未加载

avivoalmost 2 years ago

What is the advantage of this versus running something like <a href="https://github.com/simonw/llm">https://github.com/simonw/llm</a> , which also gives you options to e.g. use <a href="https://github.com/simonw/llm-mlc">https://github.com/simonw/llm-mlc</a> for accelerated inference?

caesilalmost 2 years ago

So many projects still using GPT in their name.<p>Is the thinking here that OpenAI is not going to defend that trademark? Or just kicking the can down the road on rebranding until the C&D letter arrives?

评论 #37153841 未加载

评论 #37153814 未加载

评论 #37153831 未加载

QuinnyPigalmost 2 years ago

I've been looking for something like this for a while. Nice!

stormfatheralmost 2 years ago

Which layers are best to use as vector embeddings? Is it the initial embedding layer afer tokenization? First hidden layer? Second?

synaesthesisxalmost 2 years ago

How this compare to just running llama.cpp locally?

评论 #37154530 未加载

albert_ealmost 2 years ago

Oh I thought this was a quick guide to host it on any server (AWS / other clouds) of our choosing.

评论 #37154716 未加载

评论 #37150241 未加载

chasd00almost 2 years ago

is it a free model or is the politically-correct-only response constraints in place?

评论 #37154579 未加载

评论 #37153416 未加载

评论 #37153408 未加载

lazzlazzlazzalmost 2 years ago

(1) What are the best more creative/less lobotomized versions of Llama 2? (2) What's the best way to get one of those running in a similarly easy way?

评论 #37154220 未加载

评论 #37153869 未加载

评论 #37153851 未加载

评论 #37154651 未加载

12 comments

ccozanalmost 2 years ago

Ok, since is running all private, how can I add my own private data? For example I have a 20+ years of an email archive that I'd like to be ingested.

评论 #37155282 未加载

评论 #37154064 未加载

评论 #37154364 未加载

评论 #37155040 未加载

Atlas-Marblesalmost 2 years ago

评论 #37156498 未加载

belvalalmost 2 years ago

评论 #37164076 未加载

评论 #37154554 未加载

评论 #37153819 未加载

SubiculumCodealmost 2 years ago

I didn't see any info on how this is different than installing/running llamacpp or koboldcpp. New offerings are awesome of course, but what is it adding?

评论 #37154565 未加载

avivoalmost 2 years ago

caesilalmost 2 years ago

评论 #37153841 未加载

评论 #37153814 未加载

评论 #37153831 未加载

QuinnyPigalmost 2 years ago

I've been looking for something like this for a while. Nice!

stormfatheralmost 2 years ago

Which layers are best to use as vector embeddings? Is it the initial embedding layer afer tokenization? First hidden layer? Second?

synaesthesisxalmost 2 years ago

How this compare to just running llama.cpp locally?

评论 #37154530 未加载

albert_ealmost 2 years ago

Oh I thought this was a quick guide to host it on any server (AWS / other clouds) of our choosing.

评论 #37154716 未加载

评论 #37150241 未加载

chasd00almost 2 years ago

is it a free model or is the politically-correct-only response constraints in place?

评论 #37154579 未加载

评论 #37153416 未加载

评论 #37153408 未加载

lazzlazzlazzalmost 2 years ago

(1) What are the best more creative/less lobotomized versions of Llama 2? (2) What's the best way to get one of those running in a similarly easy way?

评论 #37154220 未加载

评论 #37153869 未加载

评论 #37153851 未加载

评论 #37154651 未加载

Show HN: LlamaGPT – Self-hosted, offline, private AI chatbot, powered by Llama 2

12 comments

Show HN: LlamaGPT – Self-hosted, offline, private AI chatbot, powered by Llama 2

12 comments