科技回声

21 条评论

canyon289超过 1 年前

For an additional resource I'm writing a guide book, though its in various stages of completionThe fine tuning guide is the best resource so far <a href="https://ravinkumar.com/GenAiGuidebook/language_models/finetuning.html" rel="nofollow">https://ravinkumar.com/GenAiGuidebook/language_models/finetu...</a>

评论 #39289077 未加载

turnsout超过 1 年前

This looks amazing @rasbt! Out of curiosity, is your primary goal to cultivate understanding and demystify, or to encourage people to build their own small models tailored to their needs?

评论 #39157862 未加载

AndrewKemendo超过 1 年前

Writing a technical book in public is a level of anxiety I can’t imagine, so kudos to the author!

评论 #39157471 未加载

评论 #39158985 未加载

npalli超过 1 年前

<pre><code> import torch </code></pre> From the first code sample, not quite from scratch :-)

评论 #39158215 未加载

评论 #39158761 未加载

评论 #39158703 未加载

评论 #39164753 未加载

评论 #39160680 未加载

评论 #39161464 未加载

评论 #39160572 未加载

评论 #39158287 未加载

wslh超过 1 年前

I jumped to Github thinking this is would be a free resource (with all due respect to the author work).What free resources are available and recommended in the "from scratch vein"?

评论 #39158453 未加载

评论 #39158425 未加载

评论 #39160709 未加载

评论 #39158870 未加载

评论 #39158432 未加载

whartung超过 1 年前

Can I use any of the information in this book to learn about reinforcement learning?My goal is to have something learn to land, like a lunar lander. Simple, start at 100 feet, thrust in one direction, keep trying until you stop making craters.Then start adding variables, such as now it's moving horizontally, adding a horizontal thruster.next, remove the horizontal thruster and let the lander pivot.Etc.I just have no idea how to start with this, but this seems "mainstream" ML, curious if this book would help with that.

评论 #39158586 未加载

评论 #39158700 未加载

评论 #39158738 未加载

评论 #39158592 未加载

评论 #39162392 未加载

评论 #39158609 未加载

评论 #39159056 未加载

malermeister超过 1 年前

How does this compare to the karpathy video [0]? I'm trying to get into LLMs and am trying to figure out what the best resource to get that level of understanding would be.[0] <a href="https://www.youtube.com/watch?v=kCc8FmEb1nY" rel="nofollow">https://www.youtube.com/watch?v=kCc8FmEb1nY</a>

评论 #39158188 未加载

评论 #39158130 未加载

Buttons840超过 1 年前

Question for the author:I'm not interested in language models specifically, but there are techniques involved with language models I would like to understand better and use elsewhere. For example, I know "attention" is used in a variety of models, and I know transformers are used in more than just language models. Will this book help me understand attention and transformers well enough that I can use them outside of language models?

评论 #39166382 未加载

intalentive超过 1 年前

The model architecture itself is really not too complex, especially with torch. The whole process is pretty straightforward. Nice feasible project.

SushiHippie超过 1 年前

fyi probably qualifies as an "Show HN:"

towelpluswater超过 1 年前

Bought a copy! Your posts and newsletter content has been such a huge inspiration for me throughout 2023 - good luck, this is a huge effort!

评论 #39189238 未加载

two_in_one超过 1 年前

As it's still work in progress may I suggest? It would be nice if you go beyond what others have already published and add more details. Like different position encodings, MoE, decoding methods, tokenization. As it's educational easy to use should be a priority, of course.

评论 #39161469 未加载

photon_collider超过 1 年前

Bought a copy! Looking forward to reading it. :)Is there a way for readers to give feedback on the book as you write it?

评论 #39161529 未加载

评论 #39160596 未加载

ijustwanttovote超过 1 年前

Wow, great info. Thanks for sharing.

kif超过 1 年前

Looks like just the kind of book I'd want to read. I bought a copy :)

评论 #39157711 未加载

theogravity超过 1 年前

Purchased the book. Really excited to read it!

评论 #39161556 未加载

bosky101超过 1 年前

How was the process of pitching to Manning?

评论 #39158252 未加载

corethree超过 1 年前

Nowadays anyone can probably put together a good book about this topic by using an LLM.

iamcreasy超过 1 年前

Thank you for this endeavour.Do you have an ETA for the completion of the book?

评论 #39161518 未加载

Karupan超过 1 年前

Bought a copy. Good luck rasbt!

评论 #39161532 未加载

clueless超过 1 年前

are the code for chapter 4 through 8 missing?

评论 #39157972 未加载

评论 #39158095 未加载

21 条评论

canyon289超过 1 年前

评论 #39289077 未加载

turnsout超过 1 年前

This looks amazing @rasbt! Out of curiosity, is your primary goal to cultivate understanding and demystify, or to encourage people to build their own small models tailored to their needs?

评论 #39157862 未加载

AndrewKemendo超过 1 年前

Writing a technical book in public is a level of anxiety I can’t imagine, so kudos to the author!

评论 #39157471 未加载

评论 #39158985 未加载

npalli超过 1 年前

<pre><code> import torch </code></pre> From the first code sample, not quite from scratch :-)

评论 #39158215 未加载

评论 #39158761 未加载

评论 #39158703 未加载

评论 #39164753 未加载

评论 #39160680 未加载

评论 #39161464 未加载

评论 #39160572 未加载

评论 #39158287 未加载

wslh超过 1 年前

I jumped to Github thinking this is would be a free resource (with all due respect to the author work).What free resources are available and recommended in the "from scratch vein"?

评论 #39158453 未加载

评论 #39158425 未加载

评论 #39160709 未加载

评论 #39158870 未加载

评论 #39158432 未加载

whartung超过 1 年前

评论 #39158586 未加载

评论 #39158700 未加载

评论 #39158738 未加载

评论 #39158592 未加载

评论 #39162392 未加载

评论 #39158609 未加载

评论 #39159056 未加载

malermeister超过 1 年前

评论 #39158188 未加载

评论 #39158130 未加载

Buttons840超过 1 年前

评论 #39166382 未加载

intalentive超过 1 年前

The model architecture itself is really not too complex, especially with torch. The whole process is pretty straightforward. Nice feasible project.

SushiHippie超过 1 年前

fyi probably qualifies as an "Show HN:"

towelpluswater超过 1 年前

Bought a copy! Your posts and newsletter content has been such a huge inspiration for me throughout 2023 - good luck, this is a huge effort!

评论 #39189238 未加载

two_in_one超过 1 年前

评论 #39161469 未加载

photon_collider超过 1 年前

Bought a copy! Looking forward to reading it. :)Is there a way for readers to give feedback on the book as you write it?

评论 #39161529 未加载

评论 #39160596 未加载

ijustwanttovote超过 1 年前

Wow, great info. Thanks for sharing.

kif超过 1 年前

Looks like just the kind of book I'd want to read. I bought a copy :)

评论 #39157711 未加载

theogravity超过 1 年前

Purchased the book. Really excited to read it!

评论 #39161556 未加载

bosky101超过 1 年前

How was the process of pitching to Manning?

评论 #39158252 未加载

corethree超过 1 年前

Nowadays anyone can probably put together a good book about this topic by using an LLM.

iamcreasy超过 1 年前

Thank you for this endeavour.Do you have an ETA for the completion of the book?

Implementing a ChatGPT-like LLM from scratch, step by step

21 条评论

Implementing a ChatGPT-like LLM from scratch, step by step

21 条评论