The world needs a non-profit search engine

533 点作者 daoudc将近 3 年前

78 条评论

javajosh将近 3 年前

>Google makes $40bn...If I can create something that just a tiny fraction of people find useful, then I can create a huge amount of value.You conflate two meanings of value: monetary value, and intrinsic value. Search engines are intrinsically but not monetarily valuable to users. Search engines are monetarily, but not intrinsically, valuable to advertisers. You can get into trouble when you conflate these two meaning of "value".In fact, right here is the pivot on which the internet goes from an idealistic shang-ri-la for geeks, to a commercial hellscape for the unwashed masses. It is surprisingly easy to create intrinsic value with computers! You see it all day every day on HN: some geek had a thought, spends a weekend making it, and then deploys a solution.It is surprisingly hard to extract monetary value from an intrinsically valuable solution. In fact, I believe that creating artificial scarcity is the hardest part of building an internet business, requiring invention on par with the intrinsically valuable part - and yet its the very thing that idealists rail against.(And making something artificially scarce does seem morally repugnant. And yet I don't see any other way to pay developers. Full stop. Open source software + consulting fees is a good way to go, but that can't apply to hosted search for the public. Well I guess it could, you could teach businesses how to game your own engine!)

评论 #32046359 未加载

评论 #32044843 未加载

评论 #32045977 未加载

评论 #32048109 未加载

评论 #32048440 未加载

kolinko将近 3 年前

Perhaps a better approach would be building an open source www index or even a full current cache - as an enabler for people to build their own search engines?Right now it is extremely difficult to build your own web crawler that would compete with Google. And that is not because of the technology, but because multiple sites will prevent your bot from accessing them if you're not Google or Bing - either through robots.txt, or through directly banning your IP if it's trying to crawl and it's not a confirmed google-bot.Having a non-profit, open source, crawler that keeps an up to date index (or web cache) of the web would help competition spring up.

评论 #32043096 未加载

评论 #32044354 未加载

评论 #32042830 未加载

评论 #32043880 未加载

评论 #32043505 未加载

评论 #32042915 未加载

评论 #32042903 未加载

评论 #32048888 未加载

larsrc将近 3 年前

Disclaimer: I work for Google, though far away from Search.Regardless of search engine design, there's HUGE money in SEO. Any successful search engine will be gamed. Do you have the developer power to go red-queen against all the large companies in the world?

评论 #32043953 未加载

评论 #32043623 未加载

评论 #32043452 未加载

评论 #32048212 未加载

评论 #32044318 未加载

评论 #32046088 未加载

评论 #32043588 未加载

评论 #32047104 未加载

评论 #32043438 未加载

评论 #32043446 未加载

评论 #32044644 未加载

评论 #32044744 未加载

TulliusCicero将近 3 年前

> Instead of looking at how long people spend on a site, we would encourage users to give explicit feedback on rankings and use this to improve our ranking system.While they're not wrong about how the way Google determines ranking has its issues, this way has its own set of problems. If you explicitly use user ratings as part of your rankings in some way, people can punish sites they don't like, ala review bombing on Yelp, Steam, etc.Not saying it's necessarily a bad idea because of that, but I hope they don't fall victim to the mentality of, "let's just trust the users" as an ironclad rule, because that doesn't always work out well.

评论 #32042824 未加载

评论 #32042737 未加载

评论 #32042692 未加载

评论 #32042663 未加载

irrational将近 3 年前

I use duck duck go. Recently someone showed me their screen where they were using google to do a search. I was absolutely aghast. The last time I used google when you searched for something you saw a simple text list of sites (which is how DDG still works). Instead the google results were… a disaster. You had to scroll through some much garbage before finding actual search results - a list of sites. It was like google was saying, “here, look at all this trash instead of clicking a link and going to a different site”. When did google become so bad?

评论 #32042559 未加载

评论 #32042659 未加载

评论 #32042410 未加载

评论 #32042756 未加载

评论 #32042515 未加载

评论 #32042391 未加载

评论 #32045439 未加载

mattrick将近 3 年前

One feature that I really wish more search engines would have is the ability to blocklist certain domains, particularly ones whose results are never relevant or helpful to the query itself (Pinterest, Quora, etc). It could even be used as a factor in the site’s search rankings.

评论 #32042618 未加载

评论 #32042597 未加载

评论 #32044270 未加载

评论 #32042994 未加载

mcv将近 3 年前

I've been thinking in this same direction. Especially the community-driven part. Google seems to be more interested in what corporations and advertisers want, rather than what users want. With their tendency to crowd-source their AI training, I'm surprised they don't let users vote on search results.If I were to make a search engine, I'd definitely give users more control over their results. Block crap sites, vote up your favourite sites, vote down questionable sites, maybe different context profiles, because if you're searching for Java in the context of vacation or news events you want different results than if you're searching for it in a programming context.There's so much that search could do better than what Google is doing, but I'm not doing it because it's way too much work, and it requires serious resources to index everything.

dmje将近 3 年前

He's doing that thing [1] where's he's writing about a thing and presumably wants me - the interested reader - to know more about that thing because it's the thing he's spending all his time on, but he gives zero navigational options to his thing. So as that interested reader, it's down to me to find the name of his thing [Mwmbl] and then (hilariously, given the context!) use a search engine (probably The Evil Google) to find HIS thing.Seriously, people, if you're writing about anything at all, making assumptions is always a bad idea. If you're writing about a product, make it more than easy to get to it. Provide plentiful CTA's (that's Calls To Action, defined so as not to make the same mistake of assumption) - links, bittons, a big banner at the top: ("I'm building a non profit search engine called Mwmbl! Find out more").K, thanks, </ moan >[1] <a href="https://news.ycombinator.com/item?id=31494925" rel="nofollow">https://news.ycombinator.com/item?id=31494925</a>

评论 #32042440 未加载

评论 #32042462 未加载

评论 #32042448 未加载

marginalia_nu将近 3 年前

Honestly, the easy part is building a search engine, like just document retrieval stuff and domain ranking, SEO-mitigation etc. Anyone can build a Google '98 and get it to work well, not that hard, doesn't require all too much hardware. I have done that and got one running out of my living room.The tricky part, if you want people to use your search engine for more than the novelty factor, and what most Google competitors struggle with is drawing the rest of the damn owl. For example, commercial searches, local businesses, that sort of thing. As much as Google flounders with some queries, the overall package is still really good.

评论 #32044637 未加载

tyropita将近 3 年前

Quite a neat way to crawl websites using a browser extension. That by itself is a form of donation to the search engine. Maybe in the future you can have dedicated software for self-hosted clients that users can run to crawl and index websites for mwmbl? Kinda like folding@home.How are the batches of URLs to be crawled generated/discovered and posted at your API?How do you deal with duplicate crawls?

评论 #32043154 未加载

评论 #32042483 未加载

phtrivier将近 3 年前

The "funding options" part has the unsurprising blind spot that, maybe, a search engine is the kind of basic infrastructure that ought to be paid (at least in part) by... The taxpayers ?I have a long standing bet that, at some point, some company will be "globalized" (operated under some common funding by many different countries, like many research projects or defense organization or aid funds, etc...), and the "search engine" part of google is the prime candidate.That being said, I'm from Europe, so "sharing the cost of something useful" is not culturally untolerable.Far fetched and controversial opinion, I know. We'll see.

评论 #32043547 未加载

评论 #32043439 未加载

mmazing将近 3 年前

> The technology for organizing the world’s knowledge should be owned by everyone.This is what really nails it for me.There's far too much black box in pretty much every major search engine out there. Maybe it's by design "so that people can't game it". Even so, it's not working very well.I'm excited for the next 10 years to see what we (humans) come up with to solve the state of the internet, because something's gonna give at some point.

评论 #32042364 未加载

评论 #32042614 未加载

oidar将近 3 年前

I get an incredible value from search engines. Google even (their shopping and book search features are very helpful). But right now, I am liking paid search as the way forward. Kagi is doing pretty good things right now. I love how I can up the weight of certain domains so that their results come in at the top without having to add site:awesomesite.com at the end of every search string. In fact, I can have 20 sites that I trust a lot that show up pinned at the top of the search for every query. It's 10 bucks a month, but I find it valuable.

mordae将近 3 年前

I think that we are past the point the search engine could just crawl the web and rank results based on some heuristics. We need both community curation and get librarians involved with their classification systems, because in 3 years the results are going to be dominated by automated GPT-xy content farms.Case in point: www.forkandspoonkitchen.orgThe first search engine that provides community curation and manages to get most tech-savvy people on board, classifying the content for free, is going to reign in the upcoming decade as Google loses its grip.

pronlover723将近 3 年前

This seems really really naive. Do you really think a non-profit is going to fight the hordes of spammers, scammers, seo masses, mechanical turk hordes, etc that are going to game your system?

评论 #32042802 未加载

评论 #32042782 未加载

O__________O将近 3 年前

Worth noting attempts at non-profit search engines are not new. In 2015, Wikimedia Foundation attempted to start one called the “Knowledge Engine” using at least $250,000 from a grant. Wikipedia likely started the project as a response to Google’s use of “knowledge panels” based on Wikipedia Creative Commons license alongside search results in 2012, which reduced traffic to Wikipedia.<a href="https://en.m.wikipedia.org/wiki/Knowledge_Engine_(Wikimedia_Foundation)" rel="nofollow">https://en.m.wikipedia.org/wiki/Knowledge_Engine_(Wikimedia_...</a>Also worth noting that Google is a significant donor (and now enterprise customer) of Wikipedia, but unclear if this had any impact of Wikipedia’s choice not to continue the project.

评论 #32046526 未加载

评论 #32043791 未加载

noisenotsignal将近 3 年前

As the underlying project discussed by this post is a search engine, I searched for “mwmbl” on mwmbl.org [0], and no results were found! Relevant results like the main site and GitHub repo show up when searched on Google or Kagi.[0] - <a href="https://mwmbl.org/?q=mwmbl" rel="nofollow">https://mwmbl.org/?q=mwmbl</a>

评论 #32042591 未加载

评论 #32042595 未加载

labrador将近 3 年前

I don't know what the world needs but I need a personalized search engine. I would like to filter out anything to do with sports. I would like filter articles that contain marketing jargon and technobabble. I would like to filter articles written below high school grade level. And so on.

评论 #32043684 未加载

y42将近 3 年前

I totally like the idea but I dare to doubt that this would solve the SEO problem. Website owners who are participating in those notorious affiliate programs or earn money with ads will still use the search engine to drag people onto their generic websites, using methodolgies to fit the search engines ranking mechanisms, no matter if they are public or not.SEO and all it's results seem to be immanent to the system.

评论 #32042500 未加载

评论 #32042432 未加载

评论 #32045748 未加载

评论 #32055310 未加载

评论 #32042681 未加载

benrmatthews将近 3 年前

How is the name of the site pronounced? “Mah-wim-ball”?Google’s name was so ubiquitous it became a verb, Duck Duck Go is a smart memorable name.Mwmbl is a challenging product name, even if the .org domain name was available.

评论 #32042953 未加载

评论 #32042529 未加载

soperj将近 3 年前

I really like your idea to have users help you crawl the web. I just don't like what your extension is asking for when I try to install it:- Access your data for all websites - Monitor extension usage and manage themes

评论 #32043664 未加载

bacan将近 3 年前

The issue with Google is SEO optimized spam sites beat out real content. Until we get rid of web advertising, it'll continue to be this way.Spam site operators have a huge incentive to get users to click on their links & provide them with ad-revenueFor Google, they could make things so much better by down-ranking sites that show ads

chris_f将近 3 年前

Worth mentioning is the Alexandria.org project [0]. It is a non-profit search engine built on data from Common Crawl. The coverage is limited because of Common Crawl, but the relevance is decent. They also provide an API.I believe one of the biggest impacts toward breaking up Google's monopoly on search is making them open up access to their index, even requiring Google to provide direct API search access for others to build alternative search products. They have a search API today, but it is prohibitively expensive to build on ($5/1000 calls).I built a fairly popular search engine a couple years back, but the cost of Google's search API and increasing number of bot attacks make it difficult to reason keeping it online.[0] <a href="https://www.alexandria.org/" rel="nofollow">https://www.alexandria.org/</a>

pasdechance将近 3 年前

I go Mojeek, then DDG.I kept forgetting about Kagi. I have a login for that.Yep.com has a different model, I haven't read into it far enough to decide if they actually do as they say they'll do.

naillo将近 3 年前

With all the great progress in large language models lately, and them being excellent text compressors, I've started to wonder if you couldn't just replace a search engine with a like 100mb file full of weights that let you query essentially google scale results except all locally.

评论 #32044184 未加载

ZeroGravitas将近 3 年前

I'd quite like a browser extension that records all my searches and where I end up, just for my own review. I feel like many of my searches aren't actually searches, but I can't quantify that at the moment.Feels like that would be good info to share, once it's depersonalised.

Sujeto将近 3 年前

Most of the time you want answers from a certain place, be it reddit, or stackoverflow.It's usually easy enough to add the site, by simply writing its name, but it could be easier.I'm thinking right now this:- Compile a browser without the cross origin limitations- Make a site that uses iframes with all the answer-providing websites in it- Simply focus the text input in the site/iframe you want and search away- Have a way to open the results in your main browser or just use that patched browserLike one of those internet explorer toolbars, except they cover the whole area.

评论 #32043706 未加载

评论 #32043668 未加载

epolanski将近 3 年前

The way I see it, Google is no longer in the business of searching websites, but in the business of ranking them from at least a decade.I still remember helping a friend finding informations on the accounting balance of Rome's, Italy, public transport, and finding the most relevant link buried deep at around page 20. The first 15 pages were almost completely news websites with completely irrelevant news to the search query but they would consistently rank much higher.

mrkramer将近 3 年前

The main enemy of a better search engine are casual users who are satisfied with Google's mediocrity and don't seek nothing more advanced and better. Power users are the one who suffer the most.Google will have to reinvent itself or it will eventually destroy itself with negligence of its core business. There isn't yet critical mass of casual users who think Google sucks, all they think is that the Google is internet. That's their intellectual level.

solarkraft将近 3 年前

The internet, somewhat ironically, really needs a search engine that works in the current day. You can't find anything anymore. It's like Google has been un-invented.Hopefully some day soon the internet will be searchable again.Thanks to everyone involved in attempting to make this happen (preferably in a non-profit-maximized way).(Said before at <a href="https://news.ycombinator.com/item?id=32034390" rel="nofollow">https://news.ycombinator.com/item?id=32034390</a>)

评论 #32043558 未加载

renonce将近 3 年前

The biggest challenge with making a search engine is to combat adversarial SEO. It's an issue that's very easy to be overlooked when you are small, but at Google scale, your enemies have billions of dollars to make from your visitors.I bet Google spends at least as much to combat that, and it's extremely hard to deal with while being open-source. It's useless to call for a non-profit search engine without tackling this very core issue.

评论 #32043089 未加载

6510将近 3 年前

> There are two ways forward that I can see:> The paid subscription model > Donation funded, non-profit modelNo! There is a 3rd! You could do a search app eco system where you leave the unlimited overly complicated puzzles a search engine could address as an exercise for the user.I always have a bazillion ideas but couldn't think of a single good phone app before mobile phones. I mean, should I want my phone to be a gaming console? It seems ridiculous. Writing is writing books, all other kinds are watered down. Do I want to write books with an onscreen keyboard? It all sounded idiotic, nothing worth using.But the idea you mention, typing an overly popular domain name without extension should take you to the website directly... What you are trying to say IMHO is CLI! Search is just the failback if the provided query/instruction doesn't make sense to any of the apps.I cant think of many but there are no doubt thousands of activities that could benefit from an at least somewhat themed search engine. An app could be a biochemistry web directory that ranks results from a chosen sub folder above the normal results.Any FOSS or other company could create a web dir tree with the few or many pages about it self. A check box lets you pick the ones you want to query. Normal results go under those results. The biochem wont bother you when searching for pokemon.People love my stores. What they really want is to see illustrated results from my inventory above all other results. Uncheck the box if you are not in the mood. (edit: I'm joking of course but I do have a good fews shopping apps that I actually use)

DisjointedHunt将近 3 年前

This is the kind of idiocy that makes me despise the developer community every time i see something like this. It is one of my pet peeves, so if you're going to have an opinion on this comment, please, read the whole thing.The ad supported free internet is one of the most important business models the world has arguably ever seen. Very few can argue with the fact that poor kids in developing countries over the past two decades and longer have had their lives changed beyond anyones wildest dreams thanks to the free resources at the tip of their fingertips.On the same note, much of the wealth accumulation in the developer community has been on the backs of this very business model. The immense demand for dev talent and the astronomical salaries paid out is a consequence of the difficult financial choices made by so many before us.When i read absolutely low-effort activism such as the text in the link about how(paraphrasing) 'sEaRcH eNgEnEs mAkE mOnEyY" and thus they are bad. I'm astounded at how intelligent people who can write code can simultaneously be so fucking moronic in their grasp of economics.The web is an ecosystem. There are always going to be incentives that don't fit your moral compass that are getting optimized for and against. The answer isn't to burn it all down and shit all over a business model because it apparently doesn't fit your childish understanding of the ideal. By all means, compete, but atleast try to understand the various actors and participants in this complex web of entities and what role they're playing in the flow of investment, content, data and economic activity that is far more nuanced than "wEb rEsUlTs wIll B beTtTeR iF nOT oPtImiZeD fUr $$$ "Face fucking palm

评论 #32043811 未加载

amadeuspagel将近 3 年前

> Fast ... Instant SearchGreat idea, and "instant search the web" would probably a better pitch then "non-profit search engine". Interesting argument that google doesn't do this because it isn't compatible with their ad model, but that doesn't mean a new ad-funded search engine can't do this. For google it might be billions of dollars in lost revenue while they adjust their ad model, a new ad-funded search engine wouldn't have this problem.> Frictionless ... For example if you are typing “facebook” or “hmrc login” you could go straight there from the address bar.No thanks. I sometimes do search for "company name" looking for the wikipedia article for the company, or news about the company, or information about the company in general. If you used facebook before, then it's going to autocomplete as soon as you type "face" in your addressbar, and you won't need the search engine. So if someone searches for facebook, they're either using the browser for the first time, or they're looking for information about facebook. Latter seems more likely.

M0r13n将近 3 年前

I am conflicted when it comes to stuff like that:- on the one hand I really want free, open and non profit services to succeed- at the same time I greatly value the user experienceDon't get me wrong: These two things can go hand in hand. There are tons of good examples out there.But, the closer you get to classic user-centric applications and leave the software developer bubble, the greater the discrepancy becomes in my experience. Brave, DuckDuckGo, Firefox and so on are desirable. But I always feel like I am missing out on the UX.Google still yields better search results FOR ME(even with all those ads and clickbait).Firefox still feels a bit dated and slow compared to Chrome.I value the positive effects of free software so much that I am willing to accept limitations in usability in the hope that it will improve over time. But I feel like it should not be this way.I can't support every project financially or contribute to its success as a contributor. My time and financial resources are limited.I haven't really found a solution for this problem. My best guess is that the government should intervene in the free market and install market barriers to tame giants like Google. But this is repugnant to the liberal in me.

Cupertino95014将近 3 年前

Tell me how this doesn't quickly devolve into a consensus-rules hellscape, where minority views are either ignored or certain minorities are artificially boosted.There is no way that design choices (especially the ordering of results) can be made in a way that pleases everyone. So either you dumb it down to the point of meaninglessness OR you enforce a mainstream-only ruleset.

samwillis将近 3 年前

The cost of building a “general” search engine for the “whole” web is astronomically high, in the 10s to 100s billions. It’s not achievable, Google were only able to do it by growing as a business at the same time as the internet itself. I don’t believe it’s possible to compete with Google (or Bing) by starting at zero.The route forward, and what should be advocated for, is a distributed network of search engines, each for a specific vertical. If it operated as a cooperative they could share expertise and technology, they could then build a “meta” search engine for the co-op that combined all the results from the specialist niches. Each member basically “owning” the “franchise” for a specific type of search or category.So, I don’t believe a single non-profit is the answer. More a co-op type arrangement where the co-op organisation (which may be a non-profit) has a mission to advance internet search through it’s network and strategic investment.

dalbasal将近 3 年前

That the world needs a non-profit search engine is near trivially true at this point. So good luck Daud.I think the pertinent question though, is what's the best way to demonopolize search. Maybe the answer to that is non profit, maybe something else.Google has a most search users. They have an even higher (much higher) portion of search revenue and essentially all of the sector's profits. One advantage a non profit might have is going after the low profit parts of search. Use cases where Google is likely to be under-serving users.Also, search isn't just websearch anymore. It's a way of calling a calculator, translating, etc. It's a text box that does stuff. The newest gen of language models may be the technical catalyst for some rapid evolution in the "clever text box" space. Google is obviously super active in this space, but shifts are a good time to get in.Where would you skate, if you were skating towards where the search puck is going?

leobg将近 3 年前

A recent comment here mentioned search in early browsers (1991ish). The browser would fetch all links from the current page n levels deep in the background and uses that to build a local index.I wonder if something like that could work today, only with the index being shared across the user base.The benefit would be that it’s a decentralized system. No giant infrastructure required which needs to be paid for by a big corporation. Basically, the infrastructure needs would be outsourced to millions of devices. And for websites, users and crawlers would be the same thing. Which is to say, you cannot block one without also blocking the other.It could also add feedback mechanisms. Active ones, such as commenting on pages and discussing them, as we do on HN. But also passive ones such as tracking how long the user interacted with the page, to score the value of pages/domains and improve the ranking algorithm.

jsmith99将近 3 年前

I think Google's results could be a lot better but I'm relatively ok with my search being provided by a for profit company. Their incentive is to get me to want to use their product. A non profit with that much power might be more tempted to manipulate search in ways that suit their personal preferences.

评论 #32043010 未加载

dgudkov将近 3 年前

It's unfortunate that with all the immense value that search engines provide the idea of paying a small monthly or annual fee to use a search engine is incomprehensible for most people.

评论 #32043181 未加载

评论 #32043214 未加载

评论 #32043139 未加载

评论 #32043441 未加载

tinodb将近 3 年前

How should I pronounce this search engine? I know naming is hard, but if you want something to be easily adopted, having a sticky and pronounceable name is paramount!

avgcorrection将近 3 年前

This search engine is supported by The Bill (and Melinda?) Gates Foundation, The Organization for Promotion of Democracy, The Organization for Prosperity, The Organization For Truth And Transparency And Against Fake News, The Organization Against Renegade Knowledge, The Organization For Helping Silly Citizens Think Better, The Organization For The Truth About Qatar, The Organization For Freedom And Good Things And Not At All Tied to the CIA, and some other folks.

guerby将近 3 年前

I would love to have a search engine with buy/nobuy tag: shows only shops with "buy", and no shops with nobuy in the search results.

评论 #32042796 未加载

keynesyoudigit将近 3 年前

I hope I'm not too late and this doesn't get buried - anyone interested should check out <a href="https://www.findhelp.org/" rel="nofollow">https://www.findhelp.org/</a> ! I work here and we are super hiring for engineers :)Edit - ah, he means the search engine should be a non-profit. Not what I thought he meant.

Timwi将近 3 年前

Love the idea and the project! However, if you are aiming to become popular, definitely the first thing you need is a better name than “Mwmbl”.

frozencell将近 3 年前

> Just 1% of 1% of this would be more money than I’d know what to do with ($4m).Not knowing what to do with $4m means the failure of the education systems.

评论 #32043356 未加载

unsignednoop将近 3 年前

<a href="https://en.wikipedia.org/wiki/Quaero" rel="nofollow">https://en.wikipedia.org/wiki/Quaero</a>

ilaksh将近 3 年前

We don't need another centralized search service. We need protocols for publishing and finding information that do not rely on servers.

评论 #32043484 未加载

voltagex_将近 3 年前

There are so so many anti-scraping sites around - it's very difficult to do without pretending to be Googlebot or whatever.

laserbeam将近 3 年前

> [google gets 40 billion a year from search.] I can’t even conceive how big it is. Just 1% of 1% of this would be more money than I’d know what to do with ($4m).Ouch. I wish you the best but that statement makes me lose hope. Employees are expensive. Servers aren't exactly cheap either. And unexpected mistakes along theyl way cost a lot.

评论 #32044056 未加载

guerrilla将近 3 年前

Please add it to [1] since Firefox (absurdly!) doesn't seem to let us add arbitrary search engines anymore.1. <a href="https://addons.mozilla.org/en-US/firefox/extensions/category/search-tools/" rel="nofollow">https://addons.mozilla.org/en-US/firefox/extensions/category...</a>

评论 #32042649 未加载

eterevsky将近 3 年前

> Google tries to work out which sites are interesting by how long you spend on the site.How would Google know how long you spend on the site? It only sees what links you clicked and doesn't know what happens next. (Unless the website uses Analytics, but Analytics doesn't affect search ranking.)

评论 #32042541 未加载

评论 #32042493 未加载

formerkrogemp将近 3 年前

I mean the IRS makes 990 forms publically available. They may be a year or two behind, but it's valuable financial and personnel data from nonprofits.EDIT: Ok, I see that this is about a search engine structured as a not-for-profit, not as a search engine for nonprofits.

geekamongus将近 3 年前

> Google has an incentive to rank pages that contain Google ads because it makes them more revenue. Google has an incentive to rank profit-making sites higher so that they make more money.Is there evidence that they do this?

评论 #32045843 未加载

O__________O将近 3 年前

Appears you’re in the UK, is that where you intend to registered the non-profit? If so, in the UK, what are the real costs of forming a non-profit, keeping records, generating reports, (shut it down), etc.?

Terry_Roll将近 3 年前

When using a VPN to access Youtube, the adverts played to you will be in the local language of the VPN destination, yet Youtube can deliver the appropriate language content. Strange that!

评论 #32048260 未加载

bernardlunn将近 3 年前

Has to be decentralized. Huge data centers need huge amounts of capital

greenie_beans将近 3 年前

Does this work? <a href="https://projects.propublica.org/nonprofits/" rel="nofollow">https://projects.propublica.org/nonprofits/</a> America only

happymellon将近 3 年前

First thing I try is:Star trek imdbFirst result is startrek.com, second result is Star Trek into Darkness IMDb but 3rd is xkcd.It then goes off into Q and William Shatner Wikipedia links and Muppet Movie IMDB in Russian.I tried putting a plus in front of IMDB and quoting Star Trek. It doesn't seem to be able to find Star Trek on IMDB. I admire the concept, and it is extremely fast.

评论 #32043694 未加载

mkozak将近 3 年前

Not for profit? Ecosia is one. They do make money, but in general it's not-for-profit organization that use majority of money they make to plant trees.

lukeschwartz将近 3 年前

Never thought real-time searching will be so cool <a href="https://mwmbl.org" rel="nofollow">https://mwmbl.org</a>

评论 #32042452 未加载

siquick将近 3 年前

Been using Brave search for around 6 months and probably 10% of searches need the !g parameter added. Braves working well.

golf_mike将近 3 年前

Hope this takes off. Also hope he works on his math when it comes to funding, 1% of 40 billion is 400 million, not 4 :p

评论 #32043333 未加载

评论 #32043165 未加载

iamjbn将近 3 年前

If you need a good search engine, pay for it. Business needs to make money, it’s that simple.

nova22033将近 3 年前

This is a Elon "I'm going to build a hyperloop" Musk feel to it..

tommoor将近 3 年前

I wish the Google One subscription would just remove ads from results.

cvccvroomvroom将近 3 年前

The world needs a non-profit and co-op social enterprise most things.

0xMatt将近 3 年前

another thing you could offer is really nice clean themes. I've paid for more than one app in my time JUST to get dark mode lol.obviously good search trump's the dark mode tho.

charlieyu1将近 3 年前

Wikipedia is non-profit and still manipulated by shills.

carvking将近 3 年前

Build one. DAO could be a good soil for this.

throwaway2056将近 3 年前

I bet two-thirds of hn crowd is some how affiliated to the progress of search, ads, lead-generation, analytics, user-tracking (FAANG) etc. Think of their children...

1-6将近 3 年前

And become the next Wikipedia? No thanks.

nix23将近 3 年前

YaCy?

评论 #32042421 未加载

factfindingisfn将近 3 年前

It definitly does

dredmorbius将近 3 年前

For an information based on standards --- HTML as a document markup language, HTTP as a transport layer, TLS/SSL for security, TCP/IP as an underlying networking protocol, among others --- one that is conspciuously missing is an indexing standard.That is, even if a site wanted to, there's no way for it to declare "I have content related to X". Even better would be if these indices could then be distributed in a cache-and-forward model similar to how DNS (another distributed discovery index) works. There was some exceedingly rudimentary attempt at this through elements such as keyword meta tags, but even at best these referenced a vanishingly small fraction of the actual content of a site or article. Sitemaps also address a component of the problem, but again, only in part.Some might see a few immediate issues. One is that not all site are sufficiently dynamic to know what content they actually contain. To an extent this might be addressable through extension to the webserver protocol such that a server would be aware, or become aware, of what content it contained.Another is that a site might in some instances be inclined to misrepresent what it contained. This may be hard for some to believe, but I'm given to understand it occasionally does occur. To help guard against this, there might be vetted indices, in which one or more third parties vouch for the validity of an index. These reputation-sources could of course themselves be assessed for accuracy.But if sites were responsible for reporting on what content they actually contained, and could be constrained to doing so accurately, a huge part of the overhead in creating independent search engine, and breaking the seach-engine monopoly, would be eliminated.One might imagine why certain existing gatekeepers over Web standards might oppose such an initiative.There would still remain other problems to solve within search space. It's possible to divide General Web Search into a set of specific problems:- Site crawling: this includes determining search targets, any exclusions from such lists, and performing the actual crawling. Self-indexing addresses part of this problem.- Indexing: Mapping of actual contents to keyword and query terms which might address that content.- Ranking: Assigning a preference / deprecation to specific sites. This is essentially a trust / reputation assessment, with a canonicity / authenticity assessment (e.g., where did a specific item or document first appear).- SEO: This is the Red Queen's Race issue in addressing insincere / malicous actors. Strong and durable penalties for abuse, and long-term reputational accrual, should be useful here.- Query interpretation: There's a considerable art to figuring out what a question actually means. In some cases queries should be taken strictly verbatim. Quite often, however, interpretation is necessary. How those alternatives are posed might vary, with an option not often employed presently being to suggest a range of potential interpretations or related queries which might produce better results for specific query scenarios.- Presentation: This is generation of the serch engine result page itself, incorporating several of the other considerations listed, but also addressing usability, accessibility, clarity, and other concerns.- Revalidation: As the editors of the Hitchiker's Guide observed, the Universe is not static, and circumstances change. Revalidating, revisiting, and revising results and reputational assessments is necessary.- Monetisation/Funding: I'm partial to a public goods model, or perhaps a farebox role via ISPs, pro-rated to general income/wealth within a region. Advertising, as a famous Stanford research paper prophetically observed, forces disallignment with searchers' interests and objectives.

kebman将近 3 年前

It's perhaps a bit on the side but still part of the topic of search.Have you noticed how newspapers systematically do not supply a clear source for their articles? It's especially prevalent on political cases where there are easy-to-link paper trails. This makes it a lot harder to find the source for their article, so you end up just taking their word for their angle on the story.A great recent example is Biden's Executive Order on the protection of women. When the newspapers writes about his EO, they're never doing it form a neutral standpoint. In this case they're either pro or anti abortion. But if you want to know the contents of Biden's EO for yourself, then you're forced to search for it. And depending on the search engine, that might also be hard because also search engines are politically biased.Just so we're clear, this post isn't pro or anti abortion. Instead it's an example on how newspapers systematically force you to take their word for their angle on any given news story. So if you want to know source material, then you're forced to search for it. And when you do search for it, you're then at the mercy of the political bias of the search engine.For that reason I'm not so sure a non-profit search engine will make political biases go away, especially when you consider what happened to Wikipedia. While not a search engine, it is a non-profit and communal project that set out with the ideal of being truly neutral, but in the end it failed at that, and some would say spectacularly. And the main reason is exactly bullshit, or rather the BS that comes with political bias.Don't get me wrong, it's still a great source for information, but when you search for any topic that is in any shape or form politically sensitive, then you have to know about Wikipedia's clear political bias beforehand, or else you might take their angle as gospel.This is especially insidious when it comes to search engines and also social networks, because most people assume that what is shown to them there is neutral, or at least coming from a friendly party. But then it turns out, that's not always the case.When you systematically get biased information, then it's a democratic problem, because it prevents people from making up their own mind about political topics. Thus when people finally vote, the risk is that we get a society that does not reflect peoples actual opinions.I think most people in here has been on the receiving end of that, no matter which side of the aisle you're on. And the result is always resentment and bitterness which in turn does not make for a healthy democratic environment.Instead the political bias should be more clearly visible and out in the open on both newspapers, encyclopaedias and search engines alike. And while a non-profit search engine would certainly save you from corporate interests, it still won't save you from political ones, though it might be a good trade-off to save privacy.

评论 #32048069 未加载

评论 #32046567 未加载

skitout将近 3 年前

About the funding model, it would be great if normal donation works.If not, I think the way some WEB3 projects are funded maybe an interesting inspiration (not talking about Ponzi scheme here). Many projects are "non profit" and sale tokens before the service is 100% ready. It funds the amelioration and scaling of the project. And the possibility to resale the tokens at a higher price in the future sometime attract token holders and often increase the "motivation" of the token holders / supporter of the project... fuel the community (money is only part of the motivation). Here token could be associated to symbolic "privileges" (badge, access to early releases), or governance (taking part of some votes).This system have clearly some drawbacks, but allows sometimes to increase the number of early users and supporters, and get more funding while staying a non-profit.

78 条评论

javajosh将近 3 年前

评论 #32046359 未加载

评论 #32044843 未加载

评论 #32045977 未加载

评论 #32048109 未加载

评论 #32048440 未加载

kolinko将近 3 年前

评论 #32043096 未加载

评论 #32044354 未加载

评论 #32042830 未加载

评论 #32043880 未加载

评论 #32043505 未加载

评论 #32042915 未加载

评论 #32042903 未加载

评论 #32048888 未加载

larsrc将近 3 年前

评论 #32043953 未加载

评论 #32043623 未加载

评论 #32043452 未加载

评论 #32048212 未加载

评论 #32044318 未加载

评论 #32046088 未加载

评论 #32043588 未加载

评论 #32047104 未加载

评论 #32043438 未加载

评论 #32043446 未加载

评论 #32044644 未加载

评论 #32044744 未加载

TulliusCicero将近 3 年前

评论 #32042824 未加载

评论 #32042737 未加载

评论 #32042692 未加载

评论 #32042663 未加载

irrational将近 3 年前

评论 #32042559 未加载

评论 #32042659 未加载

评论 #32042410 未加载

评论 #32042756 未加载

评论 #32042515 未加载

评论 #32042391 未加载

评论 #32045439 未加载

mattrick将近 3 年前

评论 #32042618 未加载

评论 #32042597 未加载

评论 #32044270 未加载

评论 #32042994 未加载

mcv将近 3 年前

dmje将近 3 年前

评论 #32042440 未加载

评论 #32042462 未加载

评论 #32042448 未加载

marginalia_nu将近 3 年前

评论 #32044637 未加载

tyropita将近 3 年前

评论 #32043154 未加载

评论 #32042483 未加载

phtrivier将近 3 年前

评论 #32043547 未加载

评论 #32043439 未加载

mmazing将近 3 年前

评论 #32042364 未加载

评论 #32042614 未加载

oidar将近 3 年前

mordae将近 3 年前

pronlover723将近 3 年前

This seems really really naive. Do you really think a non-profit is going to fight the hordes of spammers, scammers, seo masses, mechanical turk hordes, etc that are going to game your system?

评论 #32042802 未加载

评论 #32042782 未加载

O__________O将近 3 年前

评论 #32046526 未加载

评论 #32043791 未加载

noisenotsignal将近 3 年前

评论 #32042591 未加载

评论 #32042595 未加载

labrador将近 3 年前

评论 #32043684 未加载

y42将近 3 年前

评论 #32042500 未加载

评论 #32042432 未加载

评论 #32045748 未加载

评论 #32055310 未加载

评论 #32042681 未加载

benrmatthews将近 3 年前

评论 #32042953 未加载

评论 #32042529 未加载

soperj将近 3 年前

评论 #32043664 未加载

bacan将近 3 年前

chris_f将近 3 年前

pasdechance将近 3 年前

naillo将近 3 年前

评论 #32044184 未加载

ZeroGravitas将近 3 年前

Sujeto将近 3 年前

评论 #32043706 未加载

评论 #32043668 未加载

epolanski将近 3 年前

mrkramer将近 3 年前

solarkraft将近 3 年前

评论 #32043558 未加载

renonce将近 3 年前

评论 #32043089 未加载

6510将近 3 年前

DisjointedHunt将近 3 年前

评论 #32043811 未加载

amadeuspagel将近 3 年前

M0r13n将近 3 年前

Cupertino95014将近 3 年前

samwillis将近 3 年前

dalbasal将近 3 年前

leobg将近 3 年前

jsmith99将近 3 年前

评论 #32043010 未加载

dgudkov将近 3 年前

It's unfortunate that with all the immense value that search engines provide the idea of paying a small monthly or annual fee to use a search engine is incomprehensible for most people.

评论 #32043181 未加载

评论 #32043214 未加载

评论 #32043139 未加载

评论 #32043441 未加载

tinodb将近 3 年前

How should I pronounce this search engine? I know naming is hard, but if you want something to be easily adopted, having a sticky and pronounceable name is paramount!

avgcorrection将近 3 年前

guerby将近 3 年前

I would love to have a search engine with buy/nobuy tag: shows only shops with "buy", and no shops with nobuy in the search results.

评论 #32042796 未加载

keynesyoudigit将近 3 年前

Timwi将近 3 年前

Love the idea and the project! However, if you are aiming to become popular, definitely the first thing you need is a better name than “Mwmbl”.

frozencell将近 3 年前

> Just 1% of 1% of this would be more money than I’d know what to do with ($4m).Not knowing what to do with $4m means the failure of the education systems.

评论 #32043356 未加载

unsignednoop将近 3 年前

<a href="https://en.wikipedia.org/wiki/Quaero" rel="nofollow">https://en.wikipedia.org/wiki/Quaero</a>

ilaksh将近 3 年前

We don't need another centralized search service. We need protocols for publishing and finding information that do not rely on servers.

评论 #32043484 未加载

voltagex_将近 3 年前

There are so so many anti-scraping sites around - it's very difficult to do without pretending to be Googlebot or whatever.

laserbeam将近 3 年前

评论 #32044056 未加载

guerrilla将近 3 年前

评论 #32042649 未加载

eterevsky将近 3 年前

评论 #32042541 未加载

评论 #32042493 未加载

formerkrogemp将近 3 年前

geekamongus将近 3 年前

评论 #32045843 未加载

O__________O将近 3 年前

Terry_Roll将近 3 年前

When using a VPN to access Youtube, the adverts played to you will be in the local language of the VPN destination, yet Youtube can deliver the appropriate language content. Strange that!

评论 #32048260 未加载

bernardlunn将近 3 年前

Has to be decentralized. Huge data centers need huge amounts of capital

greenie_beans将近 3 年前

Does this work? <a href="https://projects.propublica.org/nonprofits/" rel="nofollow">https://projects.propublica.org/nonprofits/</a> America only

happymellon将近 3 年前

评论 #32043694 未加载

mkozak将近 3 年前

Not for profit? Ecosia is one. They do make money, but in general it's not-for-profit organization that use majority of money they make to plant trees.

lukeschwartz将近 3 年前

Never thought real-time searching will be so cool <a href="https://mwmbl.org" rel="nofollow">https://mwmbl.org</a>

评论 #32042452 未加载

siquick将近 3 年前

Been using Brave search for around 6 months and probably 10% of searches need the !g parameter added. Braves working well.

golf_mike将近 3 年前

Hope this takes off. Also hope he works on his math when it comes to funding, 1% of 40 billion is 400 million, not 4 :p

评论 #32043333 未加载

评论 #32043165 未加载

iamjbn将近 3 年前

If you need a good search engine, pay for it. Business needs to make money, it’s that simple.

nova22033将近 3 年前

This is a Elon "I'm going to build a hyperloop" Musk feel to it..

tommoor将近 3 年前

I wish the Google One subscription would just remove ads from results.

cvccvroomvroom将近 3 年前

The world needs a non-profit and co-op social enterprise most things.

0xMatt将近 3 年前

another thing you could offer is really nice clean themes. I've paid for more than one app in my time JUST to get dark mode lol.obviously good search trump's the dark mode tho.

charlieyu1将近 3 年前

Wikipedia is non-profit and still manipulated by shills.

carvking将近 3 年前

Build one. DAO could be a good soil for this.

throwaway2056将近 3 年前

I bet two-thirds of hn crowd is some how affiliated to the progress of search, ads, lead-generation, analytics, user-tracking (FAANG) etc. Think of their children...

1-6将近 3 年前

And become the next Wikipedia? No thanks.

nix23将近 3 年前

YaCy?

评论 #32042421 未加载

factfindingisfn将近 3 年前

It definitly does

dredmorbius将近 3 年前

kebman将近 3 年前

评论 #32048069 未加载

评论 #32046567 未加载

skitout将近 3 年前