TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

The Anatomy Of Search Technology: Blekko’s NoSQL Database

74 点作者 McKittrick大约 13 年前

7 条评论

krishna2大约 13 年前
This is the same technology that is used by the webgrepper tool [<a href="http://blekko.com/webgrep" rel="nofollow">http://blekko.com/webgrep</a>] (a grep for the web pages' sources).<p>Disclaimer: I work at blekko and I developed the webgrepper.<p>As a side note, we have used this for various other purposes - some fun ones being, store a big music collection (to extract meta data via mapjob), citizenship test q&#38;a (to pick random questions), the 'joke of the day' (of course, this is our "hello world" example internally to new employees) ..etc.
评论 #3890338 未加载
评论 #3892517 未加载
JohnGolt大约 13 年前
Hey, great stuff! How do you implement high availability?
评论 #3890108 未加载
评论 #3890364 未加载
kveykva大约 13 年前
Isn't DynamoDB on SSDs? AWS alone might not be cost effective but, how does it look with the other services also offered?
评论 #3890700 未加载
Juha大约 13 年前
Very good article in deed. I have always been amazed of how the search engines can query so huge data sets so quickly. This brings some light to it.<p>It would had been nice to see some examples of the query language they use, it if is comparable to other NoSql databases.
评论 #3891267 未加载
gruseom大约 13 年前
Can you say more about the choice of swarm algorithms instead of Paxos?
评论 #3891268 未加载
lsuejung大约 13 年前
Awesome insights into what goes into building a search engine. Very impressive indeed.
tinyjoe大约 13 年前
does blekko DB partition data base on primary key or it just query all nodes everytime like elasticsearch?
评论 #3891891 未加载