TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

A better approach to determining gender from a first name

27 pointsby Stromgrenover 11 years ago

14 comments

AndrewDuckerover 11 years ago
Please just don&#x27;t.<p>There is no point antagonising people by guessing information about them wrongly - particularly if it&#x27;s something they&#x27;ve become sensitised to by it occurring frequently.<p>If you need to know someone&#x27;s gender (and largely, you don&#x27;t), then ask them.
评论 #6475293 未加载
评论 #6475333 未加载
评论 #6475497 未加载
评论 #6475274 未加载
评论 #6475271 未加载
marijnover 11 years ago
&gt; {&quot;name&quot;:&quot;marijn&quot;,&quot;gender&quot;:&quot;female&quot;,&quot;probability&quot;:&quot;1.00&quot;,&quot;count&quot;:1}<p>Except, of course, that I am male. My name is used for both genders. The thing completely failed on a few other ambiguous names I tried. I&#x27;ll second AndrewDucker&#x27;s opinion—just don&#x27;t.
评论 #6475443 未加载
评论 #6475406 未加载
breyover 11 years ago
Interesting from a machine-learning perspective - but this strikes me as a solution looking for a problem.<p>If any service needs to know gender (and I&#x27;m having a hard time thinking of times you NEED to know gender - dating sites?) - why not just ask? surely in a situation where you&#x27;re reliant on having accurate gender information, guessing from $firstname and getting it wrong is worse than asking.
评论 #6475337 未加载
评论 #6475474 未加载
评论 #6475288 未加载
batemanesqueover 11 years ago
I&#x27;m sure this is interesting from a statistical point of view, but does the tech scene really need yet more reinforcement of a binary view of gender?
评论 #6475357 未加载
Filligreeover 11 years ago
The &quot;probability&quot; return value appears to be a straight average; it returns 1 for &quot;Peter&quot;, which is almost guaranteed to be incorrect - all it takes is a single female Peter, anywhere on the planet.<p>A better approach, in the absence of more complex models, would be to use Laplace&#x27;s sunrise formula.
评论 #6475502 未加载
评论 #6475365 未加载
kmike84over 11 years ago
In morphologically rich languages (like Russian) the most discriminative feature for detecting gender could be the word shape of last name or middle name, not the first name. So in many languages there is no way to have meaningful gender prediction by analyzing just the first name. Relative gender frequency for the first name is an useful information, but it is just not enough for reliable gender prediction.
bromagosaover 11 years ago
I guess it needs a better training DB, it returns {&quot;gender&quot;: null} for not-so-common names in languages other than English...<p><a href="http://api.genderize.io/?name=eloi&amp;language_id=ca" rel="nofollow">http:&#x2F;&#x2F;api.genderize.io&#x2F;?name=eloi&amp;language_id=ca</a><p><a href="http://api.genderize.io/?name=tomeu&amp;language_id=ca" rel="nofollow">http:&#x2F;&#x2F;api.genderize.io&#x2F;?name=tomeu&amp;language_id=ca</a><p><a href="http://api.genderize.io/?name=rigoberta&amp;language_id=es" rel="nofollow">http:&#x2F;&#x2F;api.genderize.io&#x2F;?name=rigoberta&amp;language_id=es</a><p><a href="http://api.genderize.io/?name=presentaci%C3%B3n&amp;language_id=es" rel="nofollow">http:&#x2F;&#x2F;api.genderize.io&#x2F;?name=presentaci%C3%B3n&amp;language_id=...</a><p>Credit for distinguishing between names in languages, though! Joan returns female in English, but male in Catalan.
eksithover 11 years ago
This project is a fine example of the &quot;Falsehoods Programmers Believe About Names&quot; <a href="http://www.kalzumeus.com/2010/06/17/falsehoods-programmers-believe-about-names/" rel="nofollow">http:&#x2F;&#x2F;www.kalzumeus.com&#x2F;2010&#x2F;06&#x2F;17&#x2F;falsehoods-programmers-b...</a>
gambitingover 11 years ago
Bear in mind that in some languages this problem doesn&#x27;t exist. In Polish for example, all female names end with an &quot;a&quot;. There is not a single exception from that rule, so if you see a name ending with an &quot;a&quot; it is always a female name.
评论 #6475487 未加载
nefastiover 11 years ago
I thought Hackers News had more people speaking more&#x2F;other languages than english.<p>A lot of complaints, excluding the binary gender complaints, totaly forget about how languages like portuguese &#x2F; french have male &#x2F; female differences for nouns and other language constructs.<p>Let´s say I have to build a phrase where I have the user profession like engineer and I don&#x27;t know upfront, for portuguese male would be &quot;engenheiro&quot; or &quot; engenheira&quot; for female. It does have a lot of practical uses. And with a big enough training, the decision to use for that user is on your hands.
cascaover 11 years ago
For Icelandic names, it&#x27;s easy to identify the gender by looking at the last name. For example Bjarni Benediktsson is definitely male while Katrín Jakobsdóttir is definitely female.<p>Another strategy is to use gender-neutral terms until you find out the gender, as asking directly might be considered rude in some cultures.
评论 #6475523 未加载
Grue3over 11 years ago
Doesn&#x27;t know my name. <a href="http://api.genderize.io/?name=timofei" rel="nofollow">http:&#x2F;&#x2F;api.genderize.io&#x2F;?name=timofei</a>
anonemouscowardover 11 years ago
{ &quot;name&quot;: &quot;петя&quot;, &quot;gender&quot;: &quot;female&quot;, &quot;probability&quot;: &quot;1.00&quot;, &quot;count&quot;: 1 }<p>Yeah, how about no.
ludicastover 11 years ago
I like this from a usability standpoint. Just as some forms auto-fill the city&#x2F;state based on the zip (and might get it wrong), this enables something similar. And it might get it wrong, but if your mom gave you a girl&#x27;s name* blame her.<p>It also seems accurate:<p>Pat = about 50&#x2F;50 David = All man Jessica = All woman<p>Also, wrt to &quot;binary gender identity&quot; complaints, are we all college freshmen here?<p>* my own name (Nord) sucks and gave a gender of null. Spent my whole life being called Nerd, Nora, etc. I&#x27;m not flipping out.
评论 #6475382 未加载