TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Salesforce releases language model bigger than GPT-2 large

145 点作者 strin超过 5 年前

10 条评论

minimaxir超过 5 年前
I am working on a guide (should be released tomorrow) to easily get it up and running for personal use. Here&#x27;s my Twitter thread of current experiments with the model: <a href="https:&#x2F;&#x2F;twitter.com&#x2F;minimaxir&#x2F;status&#x2F;1173081315177975810" rel="nofollow">https:&#x2F;&#x2F;twitter.com&#x2F;minimaxir&#x2F;status&#x2F;1173081315177975810</a><p>I recommend reading the linked paper in the repo as it gives decent examples&#x2F;instructions on how to use the model. Although the size and architecture is comparable to GPT-2, the emphasis on conditional generation differentiates it.
评论 #20977823 未加载
评论 #20984445 未加载
评论 #20978736 未加载
评论 #20978020 未加载
purple_ducks超过 5 年前
Wow, that&#x27;s some license addendum:<p>&gt; This software should not be used to promote or profit from:<p>&gt; violence, hate, and division,<p>&gt; environmental destruction,<p>&gt; abuse of human rights, or<p>&gt; the destruction of people&#x27;s physical and mental health.
评论 #20978909 未加载
评论 #20978225 未加载
评论 #20978168 未加载
评论 #20978617 未加载
评论 #20978092 未加载
评论 #20978049 未加载
rdiddly超过 5 年前
Anyone have a real-world use case for something like this? I must admit I&#x27;m having trouble thinking of any that aren&#x27;t essentially deceptive. Because in my little biased world, I have no need of &quot;text&quot; per se, and what value any text has to me is closely linked to the fact that it came from a human.
评论 #20978659 未加载
评论 #20978698 未加载
评论 #20978961 未加载
skybrian超过 5 年前
From the blog post: &quot;Beyond the technical work to develop this model, we’ve also taken several steps to anticipate and mitigate malicious use cases where possible.&quot;<p>From the preprint, this seems to be doing some review before release and having a code of conduct in the GitHub repo.
novalis78超过 5 年前
The unicorn prompt is the new text generator lorem ipsum
visarga超过 5 年前
It was trained on 140GB of text on 256 TPUs for 2 weeks, the model being made of 48 transformer layers. I&#x27;m wondering when we will see a model trained on 1TB or 10TB of text.
评论 #20979455 未加载
foundart超过 5 年前
Could someone provide a high level summary of what this is for a technical person not conversant with the field?
评论 #20978544 未加载
buboard超过 5 年前
&gt; Advertisement<p>Yeap, This one is indistinguishable from reality
dan_mctree超过 5 年前
Are there any hardware reqs to work with this?
评论 #20979397 未加载
kevinwang超过 5 年前
Open AI did the right thing by not releasing their model; it&#x27;s disappointing that researchers are so callous about the potential effects of their research in the name of progress.
评论 #20978492 未加载
评论 #20978456 未加载
评论 #20978650 未加载