TechEcho

21 comments

canyon289over 1 year ago

For an additional resource I'm writing a guide book, though its in various stages of completionThe fine tuning guide is the best resource so far <a href="https://ravinkumar.com/GenAiGuidebook/language_models/finetuning.html" rel="nofollow">https://ravinkumar.com/GenAiGuidebook/language_models/finetu...</a>

评论 #39289077 未加载

turnsoutover 1 year ago

This looks amazing @rasbt! Out of curiosity, is your primary goal to cultivate understanding and demystify, or to encourage people to build their own small models tailored to their needs?

评论 #39157862 未加载

AndrewKemendoover 1 year ago

Writing a technical book in public is a level of anxiety I can’t imagine, so kudos to the author!

评论 #39157471 未加载

评论 #39158985 未加载

npalliover 1 year ago

<pre><code> import torch </code></pre> From the first code sample, not quite from scratch :-)

评论 #39158215 未加载

评论 #39158761 未加载

评论 #39158703 未加载

评论 #39164753 未加载

评论 #39160680 未加载

评论 #39161464 未加载

评论 #39160572 未加载

评论 #39158287 未加载

wslhover 1 year ago

I jumped to Github thinking this is would be a free resource (with all due respect to the author work).What free resources are available and recommended in the "from scratch vein"?

评论 #39158453 未加载

评论 #39158425 未加载

评论 #39160709 未加载

评论 #39158870 未加载

评论 #39158432 未加载

whartungover 1 year ago

Can I use any of the information in this book to learn about reinforcement learning?My goal is to have something learn to land, like a lunar lander. Simple, start at 100 feet, thrust in one direction, keep trying until you stop making craters.Then start adding variables, such as now it's moving horizontally, adding a horizontal thruster.next, remove the horizontal thruster and let the lander pivot.Etc.I just have no idea how to start with this, but this seems "mainstream" ML, curious if this book would help with that.

评论 #39158586 未加载

评论 #39158700 未加载

评论 #39158738 未加载

评论 #39158592 未加载

评论 #39162392 未加载

评论 #39158609 未加载

评论 #39159056 未加载

malermeisterover 1 year ago

How does this compare to the karpathy video [0]? I'm trying to get into LLMs and am trying to figure out what the best resource to get that level of understanding would be.[0] <a href="https://www.youtube.com/watch?v=kCc8FmEb1nY" rel="nofollow">https://www.youtube.com/watch?v=kCc8FmEb1nY</a>

评论 #39158188 未加载

评论 #39158130 未加载

Buttons840over 1 year ago

Question for the author:I'm not interested in language models specifically, but there are techniques involved with language models I would like to understand better and use elsewhere. For example, I know "attention" is used in a variety of models, and I know transformers are used in more than just language models. Will this book help me understand attention and transformers well enough that I can use them outside of language models?

评论 #39166382 未加载

intalentiveover 1 year ago

The model architecture itself is really not too complex, especially with torch. The whole process is pretty straightforward. Nice feasible project.

SushiHippieover 1 year ago

fyi probably qualifies as an "Show HN:"

towelpluswaterover 1 year ago

Bought a copy! Your posts and newsletter content has been such a huge inspiration for me throughout 2023 - good luck, this is a huge effort!

评论 #39189238 未加载

two_in_oneover 1 year ago

As it's still work in progress may I suggest? It would be nice if you go beyond what others have already published and add more details. Like different position encodings, MoE, decoding methods, tokenization. As it's educational easy to use should be a priority, of course.

评论 #39161469 未加载

photon_colliderover 1 year ago

Bought a copy! Looking forward to reading it. :)Is there a way for readers to give feedback on the book as you write it?

评论 #39161529 未加载

评论 #39160596 未加载

ijustwanttovoteover 1 year ago

Wow, great info. Thanks for sharing.

kifover 1 year ago

Looks like just the kind of book I'd want to read. I bought a copy :)

评论 #39157711 未加载

theogravityover 1 year ago

Purchased the book. Really excited to read it!

评论 #39161556 未加载

bosky101over 1 year ago

How was the process of pitching to Manning?

评论 #39158252 未加载

corethreeover 1 year ago

Nowadays anyone can probably put together a good book about this topic by using an LLM.

iamcreasyover 1 year ago

Thank you for this endeavour.Do you have an ETA for the completion of the book?

评论 #39161518 未加载

Karupanover 1 year ago

Bought a copy. Good luck rasbt!

评论 #39161532 未加载

cluelessover 1 year ago

are the code for chapter 4 through 8 missing?

评论 #39157972 未加载

评论 #39158095 未加载

21 comments

canyon289over 1 year ago

评论 #39289077 未加载

turnsoutover 1 year ago

This looks amazing @rasbt! Out of curiosity, is your primary goal to cultivate understanding and demystify, or to encourage people to build their own small models tailored to their needs?

评论 #39157862 未加载

AndrewKemendoover 1 year ago

Writing a technical book in public is a level of anxiety I can’t imagine, so kudos to the author!

评论 #39157471 未加载

评论 #39158985 未加载

npalliover 1 year ago

<pre><code> import torch </code></pre> From the first code sample, not quite from scratch :-)

评论 #39158215 未加载

评论 #39158761 未加载

评论 #39158703 未加载

评论 #39164753 未加载

评论 #39160680 未加载

评论 #39161464 未加载

评论 #39160572 未加载

评论 #39158287 未加载

wslhover 1 year ago

I jumped to Github thinking this is would be a free resource (with all due respect to the author work).What free resources are available and recommended in the "from scratch vein"?

评论 #39158453 未加载

评论 #39158425 未加载

评论 #39160709 未加载

评论 #39158870 未加载

评论 #39158432 未加载

whartungover 1 year ago

评论 #39158586 未加载

评论 #39158700 未加载

评论 #39158738 未加载

评论 #39158592 未加载

评论 #39162392 未加载

评论 #39158609 未加载

评论 #39159056 未加载

malermeisterover 1 year ago

评论 #39158188 未加载

评论 #39158130 未加载

Buttons840over 1 year ago

评论 #39166382 未加载

intalentiveover 1 year ago

The model architecture itself is really not too complex, especially with torch. The whole process is pretty straightforward. Nice feasible project.

SushiHippieover 1 year ago

fyi probably qualifies as an "Show HN:"

towelpluswaterover 1 year ago

Bought a copy! Your posts and newsletter content has been such a huge inspiration for me throughout 2023 - good luck, this is a huge effort!

评论 #39189238 未加载

two_in_oneover 1 year ago

评论 #39161469 未加载

photon_colliderover 1 year ago

Bought a copy! Looking forward to reading it. :)Is there a way for readers to give feedback on the book as you write it?

评论 #39161529 未加载

评论 #39160596 未加载

ijustwanttovoteover 1 year ago

Wow, great info. Thanks for sharing.

kifover 1 year ago

Looks like just the kind of book I'd want to read. I bought a copy :)

评论 #39157711 未加载

theogravityover 1 year ago

Purchased the book. Really excited to read it!

评论 #39161556 未加载

bosky101over 1 year ago

How was the process of pitching to Manning?

评论 #39158252 未加载

corethreeover 1 year ago

Nowadays anyone can probably put together a good book about this topic by using an LLM.

iamcreasyover 1 year ago

Thank you for this endeavour.Do you have an ETA for the completion of the book?

Implementing a ChatGPT-like LLM from scratch, step by step

21 comments

Implementing a ChatGPT-like LLM from scratch, step by step

21 comments