TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Zephyr 141B, a Mixtral 8x22B fine-tune, is now available in Hugging Chat

30 pointsby osansevieroabout 1 year ago

3 comments

osansevieroabout 1 year ago
Zephyr 141B is a Mixtral 8x22B fine-tune. Here are some interesting details<p>- Base model: Mixtral 8x22B, 8 experts, 141B total params, 35B activated params<p>- Fine-tuned with ORPO, a new alignment algorithm with no SFT step (hence much faster than DPO&#x2F;PPO)<p>- Trained with 7K open data instances -&gt; high-quality, synthetic, multi-turn<p>- Apache 2<p>Everything is open:<p>- Final Model: <a href="https:&#x2F;&#x2F;huggingface.co&#x2F;HuggingFaceH4&#x2F;zephyr-orpo-141b-A35b-v0.1" rel="nofollow">https:&#x2F;&#x2F;huggingface.co&#x2F;HuggingFaceH4&#x2F;zephyr-orpo-141b-A35b-v...</a><p>- Base Model: <a href="https:&#x2F;&#x2F;huggingface.co&#x2F;mistral-community&#x2F;Mixtral-8x22B-v0.1" rel="nofollow">https:&#x2F;&#x2F;huggingface.co&#x2F;mistral-community&#x2F;Mixtral-8x22B-v0.1</a><p>- Fine-tune data: <a href="https:&#x2F;&#x2F;huggingface.co&#x2F;datasets&#x2F;argilla&#x2F;distilabel-capybara-dpo-7k-binarized" rel="nofollow">https:&#x2F;&#x2F;huggingface.co&#x2F;datasets&#x2F;argilla&#x2F;distilabel-capybara-...</a><p>- Recipe&#x2F;code to train the model: <a href="https:&#x2F;&#x2F;huggingface.co&#x2F;datasets&#x2F;argilla&#x2F;distilabel-capybara-dpo-7k-binarized" rel="nofollow">https:&#x2F;&#x2F;huggingface.co&#x2F;datasets&#x2F;argilla&#x2F;distilabel-capybara-...</a><p>- Open-source inference engine: <a href="https:&#x2F;&#x2F;github.com&#x2F;huggingface&#x2F;text-generation-inference">https:&#x2F;&#x2F;github.com&#x2F;huggingface&#x2F;text-generation-inference</a><p>- Open-source UI code <a href="https:&#x2F;&#x2F;github.com&#x2F;huggingface&#x2F;chat-ui">https:&#x2F;&#x2F;github.com&#x2F;huggingface&#x2F;chat-ui</a><p>Have fun!
评论 #40015145 未加载
评论 #40015440 未加载
评论 #40015703 未加载
adtabout 1 year ago
Added, thanks.<p><a href="https:&#x2F;&#x2F;lifearchitect.ai&#x2F;models-table&#x2F;" rel="nofollow">https:&#x2F;&#x2F;lifearchitect.ai&#x2F;models-table&#x2F;</a>
mjewkesabout 1 year ago
My current favorite “LLM breaker” below. GPT4, Claude, and this all fail.<p>—-<p>Apples are better than bananas. Cherries are worse than apples. Are cherries better than bananas?
评论 #40017919 未加载
评论 #40016779 未加载