TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

DeepSearcher: A local open-source Deep Research

229 pointsby stephen373 months ago

12 comments

gslepak3 months ago
This doesn&#x27;t seem to use local LLMs... so it&#x27;s not really local. :-\<p>Is there a deep searcher that can also use local LLMs like those hosted by Ollama and LM Studio?
评论 #43176269 未加载
评论 #43330152 未加载
vineyardmike3 months ago
I’m curious how this compares to the open-source version made by HuggingFace [1]. As I can tell, the HF version uses reasoning LLMs to search&#x2F;traverse and parse the web and gather results, then evaluates the results before eventually synthesizing a result.<p>This version appears to show off a vector store for documents generated from a web crawl (the writer is a vector-store-aaS company)<p>[1] <a href="https:&#x2F;&#x2F;github.com&#x2F;huggingface&#x2F;smolagents&#x2F;tree&#x2F;main&#x2F;examples&#x2F;open_deep_research">https:&#x2F;&#x2F;github.com&#x2F;huggingface&#x2F;smolagents&#x2F;tree&#x2F;main&#x2F;examples...</a>
评论 #43177672 未加载
parhamn3 months ago
I think the magic of Grok&#x27;s implementation of this is that they already have most of the websites cached (guessing via their twitter crawler) so it all feels very snappy. Bing&#x2F;Brave search don&#x27;t seem to offer that in their search apis. Does such a thing exist as a service?
评论 #43175260 未加载
评论 #43176143 未加载
评论 #43179807 未加载
评论 #43175265 未加载
bilater3 months ago
Nice - I like people&#x27;s different twist on Deep Research. Here is mine...with Flow I&#x27;m trying a new workflow.<p><a href="https:&#x2F;&#x2F;github.com&#x2F;btahir&#x2F;open-deep-research">https:&#x2F;&#x2F;github.com&#x2F;btahir&#x2F;open-deep-research</a>
fuddle3 months ago
Considering all the major AI companies have basically created the same deep research product, it would make sense that they focus on a shared open source platform instead.
stefanwebb3 months ago
There&#x27;s two blog posts that go with this, check it out:<p><a href="https:&#x2F;&#x2F;milvus.io&#x2F;blog&#x2F;i-built-a-deep-research-with-open-source-so-can-you.md" rel="nofollow">https:&#x2F;&#x2F;milvus.io&#x2F;blog&#x2F;i-built-a-deep-research-with-open-sou...</a><p><a href="https:&#x2F;&#x2F;milvus.io&#x2F;blog&#x2F;introduce-deepsearcher-a-local-open-source-deep-research.md" rel="nofollow">https:&#x2F;&#x2F;milvus.io&#x2F;blog&#x2F;introduce-deepsearcher-a-local-open-s...</a>
Daniel_Van_Zant3 months ago
Have been searching for a deep research tool that I can hook up to both my personal notes (in Obsidian) and the web and this looks like this has those capabilities. Now the only piece left is to figure out a way to export the deep research outputs back into my Obsidian somehow.
评论 #43174868 未加载
zitterbewegung3 months ago
I actually tried using this and I came into some issues and I had to replace the openAI text embeddings with the MilvusEmbedding.<p><a href="https:&#x2F;&#x2F;gist.github.com&#x2F;zitterbewegung&#x2F;086dd344d16d4fd4b8931b7830d512b7" rel="nofollow">https:&#x2F;&#x2F;gist.github.com&#x2F;zitterbewegung&#x2F;086dd344d16d4fd4b8931...</a><p>The QuickStart had a good response. [1] <a href="https:&#x2F;&#x2F;gist.github.com&#x2F;zitterbewegung&#x2F;086dd344d16d4fd4b8931b7830d512b7?permalink_comment_id=5456928#gistcomment-5456928" rel="nofollow">https:&#x2F;&#x2F;gist.github.com&#x2F;zitterbewegung&#x2F;086dd344d16d4fd4b8931...</a>
mtrovo3 months ago
I&#x27;m wondering about the practical implications of integrating web crawling. Could this, in theory, be used solely for reading papers from Sci-Hub and producing valid graduate-level research?<p>It could be useful for comparing reports built using DeepSeek R1 vs. GPT-4o and other large models. The code being open source might highlight the limitations of different LLMs much faster and help develop better reasoning loops in future prompts for specific needs. Really interesting stuff.
namlem3 months ago
The real magic bullet would be searching lib-gen and sci-hub as well
redskyluan3 months ago
Amazing!<p>Search is not a problem . What to search is!<p>Using reasoning model, it is much easier to split task and focus on what to search
评论 #43178397 未加载
cma3 months ago
Cloudflare is going to ruin self hosted things like this and force centralization to a few players. I guess we&#x27;ll need decentralized efforts to scrape the web and be able to run it on that.