TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Searching a Codebase in English

56 点作者 dakshgupta9 个月前

5 条评论

tonyoconnell9 个月前
Summary &quot;Semantic search on codebases works better if you first translate the code to natural language, before generating embedding vectors. It also works better if you chunk more “tightly” - on a per-function level rather than a per-file level. This is because noise negatively impacts retrieval quality in a huge way.&quot;<p>This makes a lot of sense. You should also embed information about how the code is related to other functions&#x2F;code and where it is located in the codebase. One approach is to add really wonderful comments to the code so that when humans and machines read it they are brought on a step by step journey of how the code fulfills a goal. I tell the LLM to explain step by step to junior developers and and to inspire seniour engineers with glimpse of the profound beauty of the code and its architecture.
评论 #41296300 未加载
byearthithatius9 个月前
I think I found a mistake. In the article you write: &quot;We then compare that against our database of vectors and find the one(s) that match the closest, i.e., have the lowest dot product and highest similarity.&quot;<p>We want to maximize the normalized dot product (or cosine similarity) to find semantically similar text chunks.
评论 #41302555 未加载
oshams9 个月前
Interesting direction. We also have a codebase chat (example here <a href="https:&#x2F;&#x2F;wiki.mutable.ai&#x2F;ollama&#x2F;ollama">https:&#x2F;&#x2F;wiki.mutable.ai&#x2F;ollama&#x2F;ollama</a>) that HN might find appealing. It uses a wiki as a living artifact owned by your team to power the chat, gives us increased context length and reasoning capabilities. We didn&#x27;t really like the results we got with embeddings. Have been pretty thrilled with the results on Q&amp;A, search, and even codegen (more on that soon).
deisteve9 个月前
is there a free version of greptile
评论 #41296294 未加载
Zambyte9 个月前
The page is unreadable on Firefox Focus
评论 #41295261 未加载
评论 #41296401 未加载
评论 #41295586 未加载