DeepSeek just released this week their new DeepSeek-V2.5 model, which is a "combination" of their DeepSeek-V2-0628 and DeepSeek-Coder-V2-0724 models according to their release tweet. [0]<p>They claim to surpass GPT-4-Turbo, Claude 3 Opus, and the previous DeepSeek-Coder-V2 model in coding, scripting, and math tasks on their official website [1] and it's fully open sourced [2] with a 128k context window.<p>It still doesn't show on the LMSYS Chatbot Arena coding leaderboard, which is common with new models, but livebench [3] has it ranked 7th for their coding benchmark, which is the highest ranking for any open source model (not counting mistral-large-2407 as fully open sourced since weights not public), beating meta-llama-3.1-405b-instruct-turbo.<p>Since this model's strength is coding, I've also made it available for free to anyone who wants to try it as a coding copilot in VS Code [4] (Disclaimer: I'm a co-founder at Double and this is my extension).<p>Hope others find this as exciting as I do, it's great to see open source models continue to improve!<p>[0] - <a href="https://x.com/deepseek_ai/status/1832026579180163260" rel="nofollow">https://x.com/deepseek_ai/status/1832026579180163260</a><p>[1] - <a href="https://www.deepseek.com/" rel="nofollow">https://www.deepseek.com/</a><p>[2] - <a href="https://huggingface.co/deepseek-ai/DeepSeek-V2.5" rel="nofollow">https://huggingface.co/deepseek-ai/DeepSeek-V2.5</a><p>[3] - <a href="https://livebench.ai/" rel="nofollow">https://livebench.ai/</a><p>[4] - <a href="https://double.bot/">https://double.bot/</a>