TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Let's build GPT: from scratch, in code, spelled out by Andrej Karpathy [video]

1110 点作者 georgehill超过 2 年前

17 条评论

ultrasounder超过 2 年前
Just started watching and Andrej is an excellent &quot;thingexplainer&quot;. His in-depth knowledge of the underlying atoms&#x2F;bits comes through. As an extra benefit to watching his lecture using the <a href="https:&#x2F;&#x2F;karpathy.ai&#x2F;zero-to-hero.html" rel="nofollow">https:&#x2F;&#x2F;karpathy.ai&#x2F;zero-to-hero.html</a> is the link to his &quot;discord chat&quot;. This is a very active community and Andrej is very active there. So feel free to watch lectures, cry through the assignments and come to the discord with questions. And they will be answered.
vorticalbox超过 2 年前
He has a website[0] for these videos<p>[0] <a href="https:&#x2F;&#x2F;karpathy.ai&#x2F;zero-to-hero.html" rel="nofollow">https:&#x2F;&#x2F;karpathy.ai&#x2F;zero-to-hero.html</a>
xt00超过 2 年前
I might be too new to this area -- but is this actually explaining how to create like a small version of the actual trained model -- not like &quot;using the trained model for X&quot;? like I can imagine in the future people won&#x27;t start from pure scratch, there will be building blocks that everybody starts from, but mostly just wondering like how hard is it to actually replicate what openAI has done if you had the money to pay for the training?
评论 #34421515 未加载
评论 #34421577 未加载
评论 #34423655 未加载
评论 #34421544 未加载
评论 #34425419 未加载
lvl102超过 2 年前
A bit off topic but, the power of GPT (and DL in general) is in the data. Yet, we’ve allowed private enterprises to control what should be distinctly public goods. I don’t know where we took the wrong turn within the past decade but we desperately need to correct this mistake.
评论 #34426926 未加载
评论 #34426002 未加载
评论 #34426569 未加载
评论 #34426288 未加载
diego898超过 2 年前
Andrej’s entire series is by far one of the most useful resources I’ve found. Even as an instructor of these topics myself I learn something new about the material and about how to teach it!
cdzm超过 2 年前
Karpathy also has a great Recipe for Training Neural Networks:<p><a href="http:&#x2F;&#x2F;karpathy.github.io&#x2F;2019&#x2F;04&#x2F;25&#x2F;recipe&#x2F;" rel="nofollow">http:&#x2F;&#x2F;karpathy.github.io&#x2F;2019&#x2F;04&#x2F;25&#x2F;recipe&#x2F;</a>
yonz超过 2 年前
Just finished this, need a part 2 for the PPO reward functionality
pottspotts超过 2 年前
This is really great, thank you. I would love to see a real &quot;from scratch&quot; that doesn&#x27;t use torch.py et. al., though.
评论 #34422987 未加载
评论 #34422475 未加载
评论 #34422384 未加载
评论 #34422534 未加载
评论 #34423348 未加载
评论 #34422964 未加载
tailfra超过 2 年前
Fantastic material! By the way, one of the simplest explanation of the difference between BetchNorm and LayerNorm
abraxas超过 2 年前
For me he is the best educator in this space bar none. When Karpathy explains stuff it just clicks in my head.
thecleaner超过 2 年前
His cs231n course is how I learnt about neutral networks. And now on to learning GPT and Transformers.
davewasthere超过 2 年前
Now be meta and build GPT with GPT.
hambes超过 2 年前
I really hate the phrase &quot;from scratch&quot; in the title.
keepquestioning超过 2 年前
The char-rnn guy does it again.
albert_e超过 2 年前
I am a simple man. I see a video post by karpathy, I upvote and watch.<p>&lt;end reddit-speak&gt;<p>I discovered Andrej very recently and I am a huge fan. Kudos to this whole effort!<p>Two ideas --<p>1. While these explainers are outstanding -- I can think of supplementary material&#x2F;presentation that can nicely complement these explanations if they are presented visually. Especially the concepts of multidimensional tensors.<p>Something like what 3B1B (or his followers that create &#x27;Summer of Math Exposition&#x27; videos) does -- which is not a skill I have.<p>I am thinking of creating some visual slides (my forte) but would there be interest in making this a larger collaboration that creates explainers for &quot;visual learners&quot;?<p>2. There should really be a discussion forum for people who follow along these &quot;make more&quot; tutorials -- to have discussions about each specific video, infact each specific timestamped chapter of these videos in that context.<p>Is there a framework or tool that lets us integrate YouTube videos and timestamped chapters into a &quot;discussion forum&quot; -- whether a simple website or a discord&#x2F;slack. Once again this is slightly outside my skillset but if it appeals to people, maybe some ideas and effort can come together to make this happen?<p>EDIT: #facepalm -- I see there is already a discord [0] on the webpage [1] (but not likely tied to very specific chapters as I imagined -- but should be a excellent start anyway)<p>[0] - <a href="https:&#x2F;&#x2F;discord.gg&#x2F;3zy8kqD9Cp" rel="nofollow">https:&#x2F;&#x2F;discord.gg&#x2F;3zy8kqD9Cp</a> [1] - <a href="https:&#x2F;&#x2F;karpathy.ai&#x2F;zero-to-hero.html" rel="nofollow">https:&#x2F;&#x2F;karpathy.ai&#x2F;zero-to-hero.html</a>
评论 #34424255 未加载
评论 #34428047 未加载
评论 #34425242 未加载
notjonheyman超过 2 年前
I had the please of working under Andrej a few years ago. It was very interesting the problems we were solving until this other guy that is a billionaire would throw a wrench (and fits) into the whole thing.
评论 #34423933 未加载
评论 #34426525 未加载
评论 #34425187 未加载
nlowell超过 2 年前
Karpathy&#x27;s videos (and blogs) are excellent. I wonder how history will reflect on his time at Tesla, however.
评论 #34422848 未加载
评论 #34442630 未加载
评论 #34422407 未加载