TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Implementing a ChatGPT-like LLM from scratch, step by step

739 pointsby rasbtover 1 year ago

21 comments

canyon289over 1 year ago
For an additional resource I&#x27;m writing a guide book, though its in various stages of completion<p>The fine tuning guide is the best resource so far <a href="https:&#x2F;&#x2F;ravinkumar.com&#x2F;GenAiGuidebook&#x2F;language_models&#x2F;finetuning.html" rel="nofollow">https:&#x2F;&#x2F;ravinkumar.com&#x2F;GenAiGuidebook&#x2F;language_models&#x2F;finetu...</a>
评论 #39289077 未加载
turnsoutover 1 year ago
This looks amazing @rasbt! Out of curiosity, is your primary goal to cultivate understanding and demystify, or to encourage people to build their own small models tailored to their needs?
评论 #39157862 未加载
AndrewKemendoover 1 year ago
Writing a technical book in public is a level of anxiety I can’t imagine, so kudos to the author!
评论 #39157471 未加载
评论 #39158985 未加载
npalliover 1 year ago
<p><pre><code> import torch </code></pre> From the first code sample, not quite from scratch :-)
评论 #39158215 未加载
评论 #39158761 未加载
评论 #39158703 未加载
评论 #39164753 未加载
评论 #39160680 未加载
评论 #39161464 未加载
评论 #39160572 未加载
评论 #39158287 未加载
wslhover 1 year ago
I jumped to Github thinking this is would be a free resource (with all due respect to the author work).<p>What free resources are available and recommended in the &quot;from scratch vein&quot;?
评论 #39158453 未加载
评论 #39158425 未加载
评论 #39160709 未加载
评论 #39158870 未加载
评论 #39158432 未加载
whartungover 1 year ago
Can I use any of the information in this book to learn about reinforcement learning?<p>My goal is to have something learn to land, like a lunar lander. Simple, start at 100 feet, thrust in one direction, keep trying until you stop making craters.<p>Then start adding variables, such as now it&#x27;s moving horizontally, adding a horizontal thruster.<p>next, remove the horizontal thruster and let the lander pivot.<p>Etc.<p>I just have no idea how to start with this, but this seems &quot;mainstream&quot; ML, curious if this book would help with that.
评论 #39158586 未加载
评论 #39158700 未加载
评论 #39158738 未加载
评论 #39158592 未加载
评论 #39162392 未加载
评论 #39158609 未加载
评论 #39159056 未加载
malermeisterover 1 year ago
How does this compare to the karpathy video [0]? I&#x27;m trying to get into LLMs and am trying to figure out what the best resource to get that level of understanding would be.<p>[0] <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=kCc8FmEb1nY" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=kCc8FmEb1nY</a>
评论 #39158188 未加载
评论 #39158130 未加载
Buttons840over 1 year ago
Question for the author:<p>I&#x27;m not interested in language models specifically, but there are techniques involved with language models I would like to understand better and use elsewhere. For example, I know &quot;attention&quot; is used in a variety of models, and I know transformers are used in more than just language models. Will this book help me understand attention and transformers well enough that I can use them outside of language models?
评论 #39166382 未加载
intalentiveover 1 year ago
The model architecture itself is really not too complex, especially with torch. The whole process is pretty straightforward. Nice feasible project.
SushiHippieover 1 year ago
fyi probably qualifies as an &quot;Show HN:&quot;
towelpluswaterover 1 year ago
Bought a copy! Your posts and newsletter content has been such a huge inspiration for me throughout 2023 - good luck, this is a huge effort!
评论 #39189238 未加载
two_in_oneover 1 year ago
As it&#x27;s still work in progress may I suggest? It would be nice if you go beyond what others have already published and add more details. Like different position encodings, MoE, decoding methods, tokenization. As it&#x27;s educational easy to use should be a priority, of course.
评论 #39161469 未加载
photon_colliderover 1 year ago
Bought a copy! Looking forward to reading it. :)<p>Is there a way for readers to give feedback on the book as you write it?
评论 #39161529 未加载
评论 #39160596 未加载
ijustwanttovoteover 1 year ago
Wow, great info. Thanks for sharing.
kifover 1 year ago
Looks like just the kind of book I&#x27;d want to read. I bought a copy :)
评论 #39157711 未加载
theogravityover 1 year ago
Purchased the book. Really excited to read it!
评论 #39161556 未加载
bosky101over 1 year ago
How was the process of pitching to Manning?
评论 #39158252 未加载
corethreeover 1 year ago
Nowadays anyone can probably put together a good book about this topic by using an LLM.
iamcreasyover 1 year ago
Thank you for this endeavour.<p>Do you have an ETA for the completion of the book?
评论 #39161518 未加载
Karupanover 1 year ago
Bought a copy. Good luck rasbt!
评论 #39161532 未加载
cluelessover 1 year ago
are the code for chapter 4 through 8 missing?
评论 #39157972 未加载
评论 #39158095 未加载