I’ve found ggerganovs work on llama.cpp to be amazing, and I’ve loved playing around with it. However, has anyone used it in production? I’m sure there are some really cool use cases, but I haven’t seen any yet.
Some people are.<p><a href="https://old.reddit.com/r/localllama" rel="nofollow noreferrer">https://old.reddit.com/r/localllama</a>