TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Developing an LLM: Building, Training, Finetuning (A 1h Video Explainer)

43 点作者 rasbt11 个月前

4 条评论

htrp11 个月前
Not Sebastian (who I assume is the OP), but his blog&#x2F;substack is also a great resource<p><a href="https:&#x2F;&#x2F;magazine.sebastianraschka.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;magazine.sebastianraschka.com&#x2F;</a>
评论 #40680719 未加载
mdp202111 个月前
Seems very good, thank you.<p>The channel: <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;@SebastianRaschka&#x2F;videos" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;@SebastianRaschka&#x2F;videos</a><p>contains hundreds of video lessons, originally seemingly originating from Sebastian Raschka teaching at Wisconsin-Madison Uni (before he went full-time entrepreneur).
评论 #40680443 未加载
yoouareperfect11 个月前
Is anyone training LLMs outside of Meta, OpenAI, etc... ?<p>I don&#x27;t much get the point. For huge models, it&#x27;s impossible to outcompete them. For smaller models, isn&#x27;t mistral or LLaMa good enough?<p>What are other startups finetuning LLMs for?
评论 #40680758 未加载
评论 #40685371 未加载
评论 #40680622 未加载
oneshtein11 个月前
Can someone train an AI to perform all that?