TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Viking 7B: open LLM for the Nordic languages trained on AMD GPUs

113 点作者 reqo12 个月前

12 条评论

jug12 个月前
If you&#x27;re interested in this, don&#x27;t miss AI Sweden&#x27;s GPT-SW3 @ 126M to 40B trained on Nordic languages (not Finnish) and English. It&#x27;s funded by the Swedish government and partners, and freely available with a pretty lively Discord for ongoing AI research focusing on the Nordic languages. I think Viking is called &quot;first&quot; because it includes Finnish, because otherwise, GPT-SW3 was released earlier.<p><a href="https:&#x2F;&#x2F;huggingface.co&#x2F;AI-Sweden-Models" rel="nofollow">https:&#x2F;&#x2F;huggingface.co&#x2F;AI-Sweden-Models</a>
评论 #40372347 未加载
smokracek12 个月前
First thing I notice is that Finnish is part of a completely different language family from the other Nordic languages and English (Uralic vs. Indo-European). I wonder to what extent this affects the effectiveness of their low-resource training. Finnish is highly agglutinative, adding prefixes and suffixes to modify a root. My (amateur) take is that the tokenization and attention patterns may differ a lot? Would love to see more educated people than I discuss this.
评论 #40371230 未加载
评论 #40371392 未加载
评论 #40371554 未加载
larodi12 个月前
The fact it was trained on HPC which covers 20% heat consumption in a city is absolutely wild and on par with how wild it is to have English&#x2F;Nordic model.<p>“ Further emphasizing digital sovereignty, Viking is trained on the EuroHPC supercomputer LUMI, utilizing up to 4096 AMD MI-250X GPUs. LUMI is not only Europe’s most powerful supercomputer and the 5th most powerful in the world, but also the 3rd greenest supercomputer among the top 500 supercomputers. LUMI’s energy consumption is covered with power produced 100% with hydroelectricity, and the waste heat of LUMI will account for about 20 percent of the district heating in the surrounding city of Kajaani. ”
ganzuul12 个月前
Great talking points. These are highly relevant subjects and I&#x27;m delighted we in the Nordics are keeping up with current developments. This work is important for preserving our culture.<p>I hope to see this used to generate a customized curriculum for each neurodiverse child so that we can live in a more equitable society.
评论 #40370037 未加载
bangaladore12 个月前
I have had this question. How much better would common LLMs (Llama, GPTN) be if they were only trained in one language? I have to assume they would perform better, but I might be wrong.
评论 #40369447 未加载
评论 #40369438 未加载
评论 #40371610 未加载
评论 #40373494 未加载
评论 #40369924 未加载
matsemann12 个月前
Would an LLM trained on a smaller language have better cultural awareness etc than one trained in English? Because English is written all over the world by all kinds of people, an English LLM will average that (and for instance feel a bit off for an American). But a Norwegian LLM for instance, trained on a language mostly written by Norwegians, would that feel more natural to me in comparison?
jarbus12 个月前
Would love to know more about their experience training on AMD GPUs. Was it just as seamless as using Cuda?
评论 #40369803 未加载
评论 #40369784 未加载
评论 #40369884 未加载
Bedon29212 个月前
I cannot seem to find a link to the actual model from this page or anywhere on the website. This appears to be it: <a href="https:&#x2F;&#x2F;huggingface.co&#x2F;LumiOpen&#x2F;Viking-7B" rel="nofollow">https:&#x2F;&#x2F;huggingface.co&#x2F;LumiOpen&#x2F;Viking-7B</a>
halgir12 个月前
&gt; extends to include Danish, Finnish, Norwegian, Icelandic, Swedish<p>* cries in Faroese *
dmichulke12 个月前
Is there something similar for romance or Germanic languages?<p>And how did they decide that, e.g., German or Dutch would make the model worse?
评论 #40369406 未加载
评论 #40370282 未加载
melenaboija12 个月前
Although not nordic not including basque which I guess could also be considered an European low-resource language.
评论 #40371305 未加载
ChrisArchitect12 个月前
double slash in the shared link probably not ideal (though inconsequential)<p><a href="https:&#x2F;&#x2F;www.silo.ai&#x2F;blog&#x2F;viking-7b-the-first-open-llm-for-the-nordic-languages" rel="nofollow">https:&#x2F;&#x2F;www.silo.ai&#x2F;blog&#x2F;viking-7b-the-first-open-llm-for-th...</a>