Mistral: Our first AI endpoints are available in early access

491 点作者 georgehill超过 1 年前

22 条评论

brandall10超过 1 年前

I'm surprised this isn't firmly attached to the top of HN right now for the entire day.This is a tiny company (appears 30 or so people?) that just scored a 2B valuation, produced easily the most performant 7B model and a 7B*8 MOE model that performs at the level of a 70B requiring the inference power of a 14B.I feel this could be a potential bigger threat to OpenAI than Google or Anthropic. I gather with the huge recent investment they'll be able to a) scale out to a reasonable traffic load in the near future and b) attract the best and brightest researchers put off w/ various chest puffing and drama that has been front and center in this industry.

评论 #38605840 未加载

评论 #38603223 未加载

评论 #38604508 未加载

评论 #38603833 未加载

评论 #38605486 未加载

Palmik超过 1 年前

This is extremely impressive if benchmarks translate to real-world performance [1]. The mistral-medium beats GPT3.5 and also Gemini Pro (Google's best available model) with a huge margin on all available comparable benchmarks: <a href="https://screenbud.com/shot/c0d904e3-24a3-4c23-a1e4-2f18bc0215cf/image.png" rel="nofollow noreferrer">https://screenbud.com/shot/c0d904e3-24a3-4c23-a1e4-2f18bc021...</a>[1] I would expect real world-performance gap to be even larger if Mistral 7B is anything to go by. The fact that safety filters are opt-in is a huge benefit (even for safe applications).

评论 #38602024 未加载

评论 #38601398 未加载

评论 #38605309 未加载

评论 #38601159 未加载

评论 #38622725 未加载

rrsp超过 1 年前

<a href="https://docs.mistral.ai/platform/pricing" rel="nofollow noreferrer">https://docs.mistral.ai/platform/pricing</a>Pricing has been released too.Per 1 million output tokens:Mistral-medium $8Mistral-small $1.94gpt-3.5-turbo-1106 $2gpt-4-1106-preview $30gpt-4 $60gpt-4-32k $120This suggests that they’re reasonably confident that the mistral-medium model is substantially better than gpt3-5

评论 #38600088 未加载

评论 #38599253 未加载

评论 #38599536 未加载

评论 #38599394 未加载

评论 #38600233 未加载

评论 #38599486 未加载

评论 #38599833 未加载

评论 #38599333 未加载

yzydserd超过 1 年前

"endpoints are available in early access" is in reality "we have a waitlist (of unspecified length) for early access to endpoints"When I try to access: “Access to our API is currently invitation-only, but we'll let you know when you can subscribe to get access to our best models.”

评论 #38599428 未加载

tarruda超过 1 年前

> Mistral-embed, our embedding endpoint, serves an embedding model with a 1024 embedding dimension. Our embedding model has been designed with retrieval capabilities in mind. It achieves a retrieval score of 55.26 on MTEB.Is there any information if this embedding model is or will be open source?

marviel超过 1 年前

> Our API follows the specifications of the popular chat interface initially proposed by our dearest competitor.I like it, also made me laugh

georgehill超过 1 年前

> Mistral-Medium outperforms GPT-4 in Winogrande benchmark 88% vs 87.5%from: <a href="https://twitter.com/yupiop12/status/1734137238177698106" rel="nofollow noreferrer">https://twitter.com/yupiop12/status/1734137238177698106</a>

ingojoseph超过 1 年前

It's interesting that many platforms, like Lemonfox.ai, offer Mistral finetunes at lower prices. They also already announced a Mistral 8x7B API. This raises the question of whether they'll still publish future models as open-source (like the Medium version) if they want to make money.

评论 #38601286 未加载

lioeters超过 1 年前

By chance I noticed that Fabrice Bellard's TextSynth server has newly added support for Mistral 7B model.> 2023-10-21: CUDA support in the Windows version, mistral model support. Speculative sampling is supported. BNF grammar and JSON schema sampling.> mistral_7B_instruct_q4 - 3.9GB - Mistral 7B chat model<a href="https://bellard.org/ts_server/" rel="nofollow noreferrer">https://bellard.org/ts_server/</a>

georgehill超过 1 年前

> Mistral-medium. Our highest-quality endpoint currently serves a prototype model, that is currently among the top serviced models available based on standard benchmarks.This is interesting. This model outperforms ChatGPT 3.5. I'm not sure what type of model it is, and it is not open-sourced.

评论 #38598934 未加载

rgbrgb超过 1 年前

> Mistral-tiny. Our most cost-effective endpoint currently serves Mistral 7B Instruct v0.2, a new minor release of Mistral 7B Instruct. Mistral-tiny only works in English. It obtains 7.6 on MT-Bench. The instructed model can be downloaded here."download here" link is to v0.1 [0]. Oversight or are they holding back the state of the art tiny model?[0]: <a href="https://huggingface.co/mistralai/Mistral-7B-v0.1" rel="nofollow noreferrer">https://huggingface.co/mistralai/Mistral-7B-v0.1</a>

评论 #38605972 未加载

munro超过 1 年前

Wow, beating ChatGPT-3.5 is really an accomplishment. Congrats! That's literally the default of OpenAI's product. I had to fallback to GPT-3.5 the other day because I ran out of usage on ChatGPT-4 (playing 20 questions lol). So I really hope someone can come up on GPT-4! For me GPT-3.5 isn't good enough for daily things, it gets too much wrong.

ur-whale超过 1 年前

This actually begs the question:Does anyone know the kind of actual infrastructure something like gpt4-32k actually run on?I mean when I actually type something in the prompt, what actually happens behind the scenes?Is the answer computed on a single NVidia GPU?Or is it dedicated H/W not known to the general public?How big is that GPU?How much RAM does it have?Is my conversation run by a single GPU instance that is dedicated to me or is that GPU shared by multiple users?If the latter, how many queries per seconds can a single GPU handle?Where is that GPU?Does it run in an Azure data center?Is the API usage cost actually reflective of the HW cost or is it heavily subsidized?Is a single GPU RAM size the bottleneck for how large a model can be?Is any of that info public ?

评论 #38599788 未加载

评论 #38599494 未加载

评论 #38599549 未加载

评论 #38599552 未加载

ComputerGuru超过 1 年前

I’m surprised no one has commented on the context size limitations of these offerings when comparing to the other models. The sliding window technique really does effectively cripple its recall to approximately just 8k tokens which is just plain insufficient for a lot of tasks.All these llama2 derivatives are only effective if you fine tune them, not just because of the parameter count as people keep harping but perhaps even more so because of the tiny context available.A lot of my GPT3.5/4 usage involves “one offs” where it would be faster to do the thing by hand than to train/fine-tune first, made possible because of the generous context window and some amount of modest context stuffing (drives up input token costs but still a big win).

评论 #38602845 未加载

评论 #38602491 未加载

nojvek超过 1 年前

Competition is how the world moves forward. I'm super glad small and big players have competitive models.The thing that makes me a bit sad is how announcements show the benchmarks but they way they test is tweaked to make their metrics favorable. They aren't apple to apple benchmarks across different paper publications.Super grateful that they openly share the weights and code with Apache license.25 shot - is having 25 tries and selecting the best answer.Is anyone working on an open benchmark where they take the major models and compare them apples to apples.

davidkunz超过 1 年前

Well done, Mistral! "Show, don't tell" par excellence.

ianpurton超过 1 年前

So, what would be the hardware setup for this?Can it run on 1 GPU and swap between experts.

mark_l_watson超过 1 年前

I just signed up for the API waiting list. I have been enjoying running Mistral-7B on my home system, and it feels right to give them some of my paid for API business.

infecto超过 1 年前

Until we can verify, I think it’s safe to place the in the smoke category. It’s invite only so until it hits GA it’s impossible to know if the pricing is real and the true capabilities of what they are offering.

hospitalJail超过 1 年前

Is there anything Mistral + tuned on ChatGPT4?

jacquesm超过 1 年前

dupe, see:<a href="https://news.ycombinator.com/item?id=38598559">https://news.ycombinator.com/item?id=38598559</a>

评论 #38599817 未加载

LanzVonL超过 1 年前

Two out of the top three stories today on HN! What an achievement. Is mistral.ai a YC property?

22 条评论

brandall10超过 1 年前

评论 #38605840 未加载

评论 #38603223 未加载

评论 #38604508 未加载

评论 #38603833 未加载

评论 #38605486 未加载

Palmik超过 1 年前

评论 #38602024 未加载

评论 #38601398 未加载

评论 #38605309 未加载

评论 #38601159 未加载

评论 #38622725 未加载

rrsp超过 1 年前

评论 #38600088 未加载

评论 #38599253 未加载

评论 #38599536 未加载

评论 #38599394 未加载

评论 #38600233 未加载

评论 #38599486 未加载

评论 #38599833 未加载

评论 #38599333 未加载

yzydserd超过 1 年前

评论 #38599428 未加载

tarruda超过 1 年前

marviel超过 1 年前

> Our API follows the specifications of the popular chat interface initially proposed by our dearest competitor.I like it, also made me laugh

georgehill超过 1 年前

ingojoseph超过 1 年前

评论 #38601286 未加载

lioeters超过 1 年前

georgehill超过 1 年前

评论 #38598934 未加载

rgbrgb超过 1 年前

评论 #38605972 未加载

munro超过 1 年前

ur-whale超过 1 年前

评论 #38599788 未加载

评论 #38599494 未加载

评论 #38599549 未加载

评论 #38599552 未加载

ComputerGuru超过 1 年前

评论 #38602845 未加载

评论 #38602491 未加载

nojvek超过 1 年前

davidkunz超过 1 年前

Well done, Mistral! "Show, don't tell" par excellence.

ianpurton超过 1 年前

So, what would be the hardware setup for this?Can it run on 1 GPU and swap between experts.

mark_l_watson超过 1 年前

I just signed up for the API waiting list. I have been enjoying running Mistral-7B on my home system, and it feels right to give them some of my paid for API business.

infecto超过 1 年前

hospitalJail超过 1 年前

Is there anything Mistral + tuned on ChatGPT4?

jacquesm超过 1 年前

dupe, see:<a href="https://news.ycombinator.com/item?id=38598559">https://news.ycombinator.com/item?id=38598559</a>

评论 #38599817 未加载

LanzVonL超过 1 年前

Two out of the top three stories today on HN! What an achievement. Is mistral.ai a YC property?