TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: The Population Project

7 点作者 antoine_b将近 2 年前
Two years ago, I turned 50. After a successful career as an entrepreneur, a business angel and a novelist, I set out to start a philanthropic venture under the following constraints:<p>- it had to be global. - it had to be beautiful (in my eyes, at least). - it had to be technology and stats driven.<p>I decided I would try to list the full name and date of birth of all humans alive. While some may find the concept pointless, I immediately knew I had struck gold:<p>- it was global and incredibly hard. - it had an almost artistic quality to it, like an ever-changing installation. - as a libertarian, I resent that states conduct censuses and then sit on the data. - One billion people in the world aren&#x27;t officially registered. At least someone would acknowledge their existence.<p>I created a non-profit called The Population Project. I would never make a dime off it, but at least my costs would be tax-deductible.<p>I then started researching lists of names online. I quickly adopted two principles. First I would collect a minimal set of information : full name, birth date, and birth place. Second, I would only scrape public information, i.e. nothing behind a password.<p>After a few months, I realized I needed help from more experienced developers. I chose to work on 4D, a platform I had used in the past to develop my company&#x27;s information system. It was a tough choice: 4D is not a leading player in the back-end world, but I figured the growth of API tooling would make language choice less critical.<p>The first iteration of our database was frustrating - way too slow to publish a website. I learned the power of incremental change, with each marginal improvement saving you a few percent of speed or space. I also got to implement concepts I had heard about but never implemented, such as mirroring, partitioning, or hash-indexing.<p>Then I hired a team of six data processors in Madagascar who clean up and process the lists found online. Lots of Python and Excel macros in their day-to-day. I have instilled in them an obsession with quality. A bad record will sit in our base forever. After trying dozens of softwares, we&#x27;ve settled on Adobe Acrobat and Octoparse.<p>The final piece was the website. I lucked out in finding a strong team in Romania. They build with Next.js and deploy on Vercel. I gave them Wikipedia as the model to aim for. We&#x2F;they haven&#x27;t been able to match Wikipedia&#x27;s simplicity. Our pages are too heavy. But I find the site user-friendly, pleasing to the eye and reasonably fast. We can and we will do better.<p>A word about privacy. Some people complain that because it publishes names and DOBs, the Population Project infringes on their privacy. We obviously don&#x27;t see it that way.<p>- All our info is public. That DOB you find on the site is probably in the voter list of your state, a list that anyone can request or plainfully download. - The info we publish is minimal. Basically, we say that you exist. No one will find anything about your race, religion, sexual preferences, job or income. - We have adopted Wikipedia&#x27;s privacy policy. We do not record your IP, unless you create or edit a record. - We&#x27;re using Matomo for our Analytics. Great stuff. It&#x27;s not free but they do not use your data like GA.<p>Why am I telling you all this? From the beginning, I&#x27;ve envisioned a three-step process:<p>1) Build the database and populate it with millions of Western profiles. 2) Launch the site, where anybody can create or edit records and share them with their family. 3) When we&#x27;ve reached critical mass (1B records?), start making deals with NGOs and governments, and venture into other alphabets.<p>We have just completed step 1. Step 2 is daunting as hell. I have grown a business but I have never grown a website. While I am ready to spend a bit of money on PR or SEO, I am not delusional: to reach the level of success we have in mind, we need this thing to go (somewhat) viral.<p>How do you do that?

5 条评论

saulpw将近 2 年前
This is 100% an ego project--count the number of &quot;I&quot; statements in your declaration above. Even in your stated reasons why you &quot;struck gold&quot; with this idea, the first three are solely about your effort and taste and resentment. The fourth is about your site &quot;acknowledging the existence&quot; of one billion unregistered people in the world. I&#x27;m sure their actual existence is already being acknowledged by others in their own community; they don&#x27;t need some tech chucklehead adding a row into a database for that. It&#x27;s laughably egotistical to think it matters one whit to someone whether your site has counted them for bragging rights.<p>I see absolutely no value in the result for humanity and especially not for the downtrodden billion. You would have done more philanthropy had you instead donated the funds to literally any other non-profit organization. I&#x27;m surprised you were able to claim non-profit status for this venture.<p>It&#x27;s obvious you put a lot of work into this, and you&#x27;re clearly capable of creating and executing a project. It even looks nice. So hopefully this can be a point of reflection as you decide what to do next with your remaining time on this planet.
评论 #37070160 未加载
评论 #37059283 未加载
评论 #37061476 未加载
orangea将近 2 年前
If your site becomes a spectacle&#x2F;popular&#x2F;viral, which I don&#x27;t think it will, people will submit false data. I don&#x27;t see how you can possibly deal with that.
评论 #37061555 未加载
评论 #37059300 未加载
bloak将近 2 年前
At some point you might want to consider &quot;pivoting&quot; to a gigantic database of all <i>dead</i> humans: there would be fewer data protection and privacy issues!<p>I have sometimes wanted to trace people&#x27;s ancestry, or, more often, the descendents (of the parents) of a person who died half a century ago. It&#x27;s depressing how difficult it is to look these things up. Various companies try to sell access to public records, but I don&#x27;t do this often enough to be interested in paying for a subscription.<p>With something like this you should really also publish exactly where the information came from. There&#x27;s a big difference between &quot;an anonymous contributor supplied this&quot; and &quot;this comes from a database that we downloaded from whatever.gov.uk on this date and here&#x27;s a copy of that database in case you want to check&quot;.<p>Some things that almost everyone is already aware of but I&#x27;ll mention them anyway:<p>* The concepts of &quot;first name&quot; and &quot;last name&quot; only apply to some cultures.<p>* Most people have more than one name: women who change their name when they get married, middle names that may or may not get mentioned, names that are frequently abbreviated (&quot;Kate&quot; might be &quot;Kate&quot; or &quot;Catherine&quot; or ...), punctuation and diacritics that may be modified or omitted, Macdonald&#x2F;McDonald&#x2F;Mac Donald&#x2F;..., various ways of transcribing the same name from a different alphabet, ...
评论 #37061534 未加载
jareklupinski将近 2 年前
&gt; We have just completed step 1. Step 2 is daunting as hell. I have grown a business but I have never grown a website.<p>I&#x27;ve seen incredibly successful &#x27;website&#x27; backends that were nothing more than a Google Sheet behind a few white-label HTML forms that feed into the spreadsheet; the backend almost never makes the product &#x27;viral&#x27;<p>the &#x27;magic&#x27; comes from successfully answering the question that&#x27;s on everyone&#x27;s mind: &quot;why should I give you the one precious resource I have (time), and how&#x2F;why will you make the rest of the limited time I have (before I die) any better?&quot;<p>or, &quot;what is the ROI on my invested time?&quot; for example, Sam Altman has been giving away 50 bucks as the answer, and some people are lining up<p>as an exercise, pretend you are a stranger hearing about your own product for the first time (or better, pay someone $100 to pitch your own idea to you), and ask yourself the big question: &quot;why should _I_ (the consumer, not the creator) care about spending time here?&quot;<p>also, check with a few PR companies about the name &quot;Population Project&quot;...
评论 #37068714 未加载
ildon将近 2 年前
Have you considered restrictions on personal data collection due to regulations such as the GDPR? I&#x27;m curious how you are dealing with those, especially having to track all countries of the world.<p>Sorry if this is mentioned on the website, I&#x27;ve had a brief look and I might have missed it
评论 #37061598 未加载