TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Can Gemini 1.5 read all the Harry Potter books at once?

57 点作者 petulla大约 1 年前

13 条评论

rryan大约 1 年前
ML 101: Do not evaluate on the training data.<p>Yes of course it can, because they fit in the context window. But this is an awful test of the model&#x27;s capabilities because it was certainly trained on these books and websites talking about the books and the HP universe.
评论 #40020674 未加载
评论 #40020637 未加载
评论 #40020643 未加载
评论 #40020892 未加载
throwup238大约 1 年前
How much of that character map is already in its training data and how much of it is actually read from the input prompt?<p>I’m always suspicious of these kinds of tests. It needs to be run with an unpublished book, not one of the most popular series in the 21st century.
评论 #40020588 未加载
JonSolomon大约 1 年前
Not sure about all of the Harry Potter books, but I gave it My entire data export from ChatGPT and handled it very well. I was able to search through it and have conversation again from past conversations. It was good.
评论 #40020621 未加载
julianpye大约 1 年前
So, according to Gemini pricing, the call would cost approx. $11. Now, hopefully all goes to plan and the input correct and the result is what you wished for. If not, how many $11 calls do you need? Sure, pricing will go down, but my observation is that people just ignore the cost of context. When it&#x27;s all about tech it&#x27;s fine, but not if it&#x27;s about efficiency.
评论 #40022699 未加载
westurner大约 1 年前
&gt; <i>All the books have ~1M words (1.6M tokens). Gemini fits about 5.7 books out of 7. I used it to generate a graph of the characters and it CRUSHED it.</i><p>An LLM could read all of the books with <i>Infini-attention</i> (2024-04): <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=40001626#40020560">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=40001626#40020560</a>
gs17大约 1 年前
There might be enough Harry Potter related content in its training set that it&#x27;s not really &quot;reading&quot; the books in its context.
OscarTheGrinch大约 1 年前
OK so my next question is what can you do with a model loaded with Harry Potter Context? Answer Harry Potter Trivia at a superhuman level? Write the next Harry Potter adventure?<p>Having used GPTs to do creative writing I can report that they are good for solving the tyranny of the blank page, but then you have to read and edit hundreds of pages of dank AI prose, which never quite aligns with your creative vision, to harvest a few nuggets of creativity. Does it end up saving any time?
评论 #40020834 未加载
ec109685大约 1 年前
I can’t see how this map would be useful to anyone. While it gets some of the relationships right, it has a bunch of unneeded detail and focuses on areas not crucial to the stories.<p>At a service level, LLM’s wow, but when you dig into the details there are often still huge gaps in output quality for many tasks.
sinuhe69大约 1 年前
It would be more impressive (and cleaner, btw) if it was fed with fan-fiction books and not the original books. Then we can see what it can make out of the context and what it &quot;borrows&quot; from the training data.<p>Why fan-fiction? Well, fan-fictions are not famous enough to be included in any training corpus, I believe. But fan-fictions of Harry Potter are numerous enough to test the context limit. There are also similarities and distinctions from the originals, which require correct recall to distinguish between them. That would be a good test, isn&#x27;t it?
评论 #40021436 未加载
fennecbutt大约 1 年前
Shouldn&#x27;t the title be rephrased to not be clickbait?<p>I refuse even read it bc clickbait makes me sad but something like &quot;Gemini 1.5 can read all the HP books at once&quot; would be a more appropriate title for this forum, imo.
iJohnDoe大约 1 年前
FWIW, i actually think this is pretty cool.<p>People created a map of all the Star Wars characters manually years ago. Being able to see all the characters mapped out from a story you’re interested in is pretty fun and helpful.
he0001大约 1 年前
How can I trust a result like this without reading it myself to verify?
bhaney大约 1 年前
Answer: No (but almost)