TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Your website's content -> Q&A bot / chatbot

84 pointsby mpaepperabout 2 years ago

11 comments

grogenautabout 2 years ago
Reading the readme makes me think it&#x27;s only searching the top 4 most likely docs via the embeddings, not the wiki at any time? or am I misunderstanding how this works? With embeddings being close to just term vector matching via dot(?) product?<p>So basically get all the sub-prhases&#x2F;sounds -&gt; vector -&gt; check vector db for closest matching documents -&gt; send to gpt for summarization and answering the quetsion.<p>If that&#x27;s ture wouldn&#x27;t that have severe limitations with scattered information? I guess it would help you get answers and walk the data better than the &quot;I don&#x27;t even know the term&quot; problem with google?
评论 #35257994 未加载
评论 #35255915 未加载
layoricabout 2 years ago
Nice to have tools like this to wrap up features, definitely makes these types of solutions more accessible, thanks!<p>It would be nice to know from your experience if there is a kind of rule of thumb for calculating cost of fine tuning and running a solution like this against a docs site?
评论 #35258031 未加载
petesergeantabout 2 years ago
I tried to do something tangentially similar recently, telling ChatGPT that I&#x27;d ask it a question, but rather than a response, I wanted search terms for Wikipedia and Wikidata that I could give it that would have the answer in. The thinking is I&#x27;d then be able to provide those to it, and get it to synthesize that data, providing answers that had decent citations in them.<p>Perhaps it was the example I chose &quot;flight time from New York to London&quot; but I couldn&#x27;t really get it to provide sensible search terms for the information it wanted or needed
评论 #35257145 未加载
limcheekinabout 2 years ago
Thanks for sharing the code. What happen when the existing content get updated and new contents created, would it need to create embeddings for all contents again? The current approach is not good as create embeddings cost money? Please see <a href="https:&#x2F;&#x2F;github.com&#x2F;mpaepper&#x2F;content-chatbot&#x2F;blob&#x2F;main&#x2F;create_embeddings.py#L49">https:&#x2F;&#x2F;github.com&#x2F;mpaepper&#x2F;content-chatbot&#x2F;blob&#x2F;main&#x2F;create...</a>. Would it be possible progressively update the vector store?<p>Please advise. Thank you.
mdotkabout 2 years ago
Make this a Wordpress plugin and I&#x27;d pay for it
评论 #35255312 未加载
评论 #35255088 未加载
wedn3sdayabout 2 years ago
I would absolutely love to take our internal Wiki and use this against it.
评论 #35258037 未加载
jonaraphaelabout 2 years ago
Awesome work! Thanks for sharing.<p>For anyone interested in an audio version that talks to you, that you can get on your site today, my brother put this together a few weeks ago! <a href="https:&#x2F;&#x2F;siteguide.ai&#x2F;" rel="nofollow">https:&#x2F;&#x2F;siteguide.ai&#x2F;</a>
nicoabout 2 years ago
Awesome!<p>Are you planning on adding agent&#x2F;tools support?<p>It would be cool to use this with internal data, then allow clients to chat with a bot fine-tunes on their data, but that can also run queries, or get reports for specific dates, or charts, all via tools.
评论 #35258054 未加载
rcarmoabout 2 years ago
Curious to see if it can take my entire site content: <a href="https:&#x2F;&#x2F;taoofmac.com&#x2F;static&#x2F;graph" rel="nofollow">https:&#x2F;&#x2F;taoofmac.com&#x2F;static&#x2F;graph</a><p>Might be a fun weekend experiment.
评论 #35259535 未加载
friendlypegabout 2 years ago
How does this handle websites with complicated structure instead of your typical blogposts where ideas are divided neatly into separate paragraph?
评论 #35258045 未加载
mdmglrabout 2 years ago
See also: <a href="https:&#x2F;&#x2F;github.com&#x2F;whitead&#x2F;paper-qa">https:&#x2F;&#x2F;github.com&#x2F;whitead&#x2F;paper-qa</a>