TechEcho

3 comments

Zephyr 141B is a Mixtral 8x22B fine-tune. Here are some interesting details- Base model: Mixtral 8x22B, 8 experts, 141B total params, 35B activated params- Fine-tuned with ORPO, a new alignment algorithm with no SFT step (hence much faster than DPO/PPO)- Trained with 7K open data instances -> high-quality, synthetic, multi-turn- Apache 2Everything is open:- Final Model: <a href="https://huggingface.co/HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1" rel="nofollow">https://huggingface.co/HuggingFaceH4/zephyr-orpo-141b-A35b-v...</a>- Base Model: <a href="https://huggingface.co/mistral-community/Mixtral-8x22B-v0.1" rel="nofollow">https://huggingface.co/mistral-community/Mixtral-8x22B-v0.1</a>- Fine-tune data: <a href="https://huggingface.co/datasets/argilla/distilabel-capybara-dpo-7k-binarized" rel="nofollow">https://huggingface.co/datasets/argilla/distilabel-capybara-...</a>- Recipe/code to train the model: <a href="https://huggingface.co/datasets/argilla/distilabel-capybara-dpo-7k-binarized" rel="nofollow">https://huggingface.co/datasets/argilla/distilabel-capybara-...</a>- Open-source inference engine: <a href="https://github.com/huggingface/text-generation-inference">https://github.com/huggingface/text-generation-inference</a>- Open-source UI code <a href="https://github.com/huggingface/chat-ui">https://github.com/huggingface/chat-ui</a>Have fun!

评论 #40015145 未加载

评论 #40015440 未加载

评论 #40015703 未加载

adtabout 1 year ago

Added, thanks.<a href="https://lifearchitect.ai/models-table/" rel="nofollow">https://lifearchitect.ai/models-table/</a>

mjewkesabout 1 year ago

My current favorite “LLM breaker” below. GPT4, Claude, and this all fail.—-Apples are better than bananas. Cherries are worse than apples. Are cherries better than bananas?

评论 #40017919 未加载

评论 #40016779 未加载

3 comments

osansevieroabout 1 year ago

评论 #40015145 未加载

评论 #40015440 未加载

评论 #40015703 未加载

adtabout 1 year ago

Added, thanks.<a href="https://lifearchitect.ai/models-table/" rel="nofollow">https://lifearchitect.ai/models-table/</a>

mjewkesabout 1 year ago

My current favorite “LLM breaker” below. GPT4, Claude, and this all fail.—-Apples are better than bananas. Cherries are worse than apples. Are cherries better than bananas?

评论 #40017919 未加载

评论 #40016779 未加载

Zephyr 141B, a Mixtral 8x22B fine-tune, is now available in Hugging Chat

3 comments

Zephyr 141B, a Mixtral 8x22B fine-tune, is now available in Hugging Chat

3 comments