TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: MyGPT a toy LLM which can be trained on Project Gutenberg and dad jokes

25 pointsby disconnectionover 1 year ago
My puny version of ChatGPT.<p>This was based on the excellent LLM lecture series by Andrej Karpathy: <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=kCc8FmEb1nY">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=kCc8FmEb1nY</a><p>The main points of differentiation are that my version is token-based (tiktoken) with code to load up multiple text files as a trining set. Plus, it has a minimal server which is a drop-in replacement for the OpenAI REST API.<p>So you can train the default tiny 15M parameter model, and use that in your projects instead of ChatGPT.<p>I trained it on 20Mb of Project Gutenberg encyclopaedias, then fine-tuned it on 120 dad jokes, to get a Q: A: prompt format.<p>This model + training set is so small that the results are basically a joke; it&#x27;s for entertainment purposes only. The code is also very rough, and the server only has the minimum functionality filled in.<p>I embodied this model in my talking LLM-driven hexapod robot, and it could give very silly answers to spoken questions.

2 comments

ferfumarmaover 1 year ago
Can we see some examples of the jokes it produces?
评论 #37673055 未加载
getwiththeprogover 1 year ago
This is a great idea. I want to make a &#x27;pet&#x27; for my kid. I can&#x27;t get them a real dog, so why not a tinyLLM?<p>Training on guttenberg data is a great idea. What I would do is train it on all the e-books I have that are suitable for kids (I managed to find quite a lot online).<p>The dad jokes idea is great, please keep doing things along this line.
评论 #37673098 未加载