TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Implementing a ChatGPT-like LLM from scratch, step by step

739 点作者 rasbt超过 1 年前

21 条评论

canyon289超过 1 年前
For an additional resource I&#x27;m writing a guide book, though its in various stages of completion<p>The fine tuning guide is the best resource so far <a href="https:&#x2F;&#x2F;ravinkumar.com&#x2F;GenAiGuidebook&#x2F;language_models&#x2F;finetuning.html" rel="nofollow">https:&#x2F;&#x2F;ravinkumar.com&#x2F;GenAiGuidebook&#x2F;language_models&#x2F;finetu...</a>
评论 #39289077 未加载
turnsout超过 1 年前
This looks amazing @rasbt! Out of curiosity, is your primary goal to cultivate understanding and demystify, or to encourage people to build their own small models tailored to their needs?
评论 #39157862 未加载
AndrewKemendo超过 1 年前
Writing a technical book in public is a level of anxiety I can’t imagine, so kudos to the author!
评论 #39157471 未加载
评论 #39158985 未加载
npalli超过 1 年前
<p><pre><code> import torch </code></pre> From the first code sample, not quite from scratch :-)
评论 #39158215 未加载
评论 #39158761 未加载
评论 #39158703 未加载
评论 #39164753 未加载
评论 #39160680 未加载
评论 #39161464 未加载
评论 #39160572 未加载
评论 #39158287 未加载
wslh超过 1 年前
I jumped to Github thinking this is would be a free resource (with all due respect to the author work).<p>What free resources are available and recommended in the &quot;from scratch vein&quot;?
评论 #39158453 未加载
评论 #39158425 未加载
评论 #39160709 未加载
评论 #39158870 未加载
评论 #39158432 未加载
whartung超过 1 年前
Can I use any of the information in this book to learn about reinforcement learning?<p>My goal is to have something learn to land, like a lunar lander. Simple, start at 100 feet, thrust in one direction, keep trying until you stop making craters.<p>Then start adding variables, such as now it&#x27;s moving horizontally, adding a horizontal thruster.<p>next, remove the horizontal thruster and let the lander pivot.<p>Etc.<p>I just have no idea how to start with this, but this seems &quot;mainstream&quot; ML, curious if this book would help with that.
评论 #39158586 未加载
评论 #39158700 未加载
评论 #39158738 未加载
评论 #39158592 未加载
评论 #39162392 未加载
评论 #39158609 未加载
评论 #39159056 未加载
malermeister超过 1 年前
How does this compare to the karpathy video [0]? I&#x27;m trying to get into LLMs and am trying to figure out what the best resource to get that level of understanding would be.<p>[0] <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=kCc8FmEb1nY" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=kCc8FmEb1nY</a>
评论 #39158188 未加载
评论 #39158130 未加载
Buttons840超过 1 年前
Question for the author:<p>I&#x27;m not interested in language models specifically, but there are techniques involved with language models I would like to understand better and use elsewhere. For example, I know &quot;attention&quot; is used in a variety of models, and I know transformers are used in more than just language models. Will this book help me understand attention and transformers well enough that I can use them outside of language models?
评论 #39166382 未加载
intalentive超过 1 年前
The model architecture itself is really not too complex, especially with torch. The whole process is pretty straightforward. Nice feasible project.
SushiHippie超过 1 年前
fyi probably qualifies as an &quot;Show HN:&quot;
towelpluswater超过 1 年前
Bought a copy! Your posts and newsletter content has been such a huge inspiration for me throughout 2023 - good luck, this is a huge effort!
评论 #39189238 未加载
two_in_one超过 1 年前
As it&#x27;s still work in progress may I suggest? It would be nice if you go beyond what others have already published and add more details. Like different position encodings, MoE, decoding methods, tokenization. As it&#x27;s educational easy to use should be a priority, of course.
评论 #39161469 未加载
photon_collider超过 1 年前
Bought a copy! Looking forward to reading it. :)<p>Is there a way for readers to give feedback on the book as you write it?
评论 #39161529 未加载
评论 #39160596 未加载
ijustwanttovote超过 1 年前
Wow, great info. Thanks for sharing.
kif超过 1 年前
Looks like just the kind of book I&#x27;d want to read. I bought a copy :)
评论 #39157711 未加载
theogravity超过 1 年前
Purchased the book. Really excited to read it!
评论 #39161556 未加载
bosky101超过 1 年前
How was the process of pitching to Manning?
评论 #39158252 未加载
corethree超过 1 年前
Nowadays anyone can probably put together a good book about this topic by using an LLM.
iamcreasy超过 1 年前
Thank you for this endeavour.<p>Do you have an ETA for the completion of the book?
评论 #39161518 未加载
Karupan超过 1 年前
Bought a copy. Good luck rasbt!
评论 #39161532 未加载
clueless超过 1 年前
are the code for chapter 4 through 8 missing?
评论 #39157972 未加载
评论 #39158095 未加载