TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: List of movie and book characters I use during application development

72 点作者 NameNickHN超过 9 年前

12 条评论

spdustin超过 9 年前
Would you humor a fellow HNer and tell me if you&#x27;re in your early forties?<p>I happen to be working on a toy machine learning project that, based on the fictional characters known by someone, predicts their approximate age. Your list is the first organic validation set that happened onto my machine!
评论 #10952837 未加载
评论 #10952810 未加载
评论 #10953178 未加载
评论 #10960691 未加载
评论 #10954423 未加载
benten10超过 9 年前
Nice! May I ask if you collected those manually? Perhaps they should be sorted&#x2F;divided according to the genre they&#x27;re from? Books&#x2F;Movies&#x2F;Tv-shows, etc?<p>This list made be realize a sort-of annoying (sometimes) tendency I seem to have developed. It appears that my first reaction to cool things is now not wonder but &#x27;I need to engineer the shit out of fit&#x27;.My first thought after looking at the list was not &#x27;wow, cool&#x27;, but more of &#x27;so if I use Named Entity Recognition, and a large corpus, I could have tens of thousands of such names in hours. Maybe I can catch up on computational linguistics literature on the issue, and even identify the relative importance of characters on the text. Should be a day-long project&#x27;. Need to learn to enjoy things for what they are, sigh.
评论 #10954190 未加载
评论 #10952632 未加载
tickthokk超过 9 年前
Nice list, for just names it&#x27;s a great resource. I see some umlauts, spaces in names, punctuation in names, and it&#x27;s clearly splittable for first&#x2F;last name fields. I&#x27;m not sure what other ground could be covered that someone would need to account for.<p>I&#x27;m not &quot;book&quot; cultured, so a lot of names I don&#x27;t recognize, but nice shout outs to 30 Rock and Anchorman :p<p>Github complains that it&#x27;s not a properly formatted CSV file. Maybe consider a TSV? It&#x27;d probably still complain.<p>I&#x27;ve yet to use it, but it&#x27;s been in my back pocket for when I need it. This PHP package looks nice if you need more than just names: <a href="https:&#x2F;&#x2F;github.com&#x2F;fzaninotto&#x2F;Faker" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;fzaninotto&#x2F;Faker</a>
评论 #10952653 未加载
RobertoG超过 9 年前
I didn&#x27;t recognize most of them, I had to search some to get an idea.<p>If the goal is testing, an improvement would be to add some internationalization. There are not other than English characters there. You want to be sure that your first foreigner don&#x27;t break your program.<p>Actually, maybe it would be a nice project to accept pull request from around the world and create an standard international data set.
评论 #10953763 未加载
vbsteven超过 9 年前
Nice list. If I need to generate names for sample data I usually just use the Faker library.<p>When I&#x27;m writing database fixtures for use in tests, I like to manually choose names from movies&#x2F;tv-shows for related entities.<p>For example for an Account with multiple Users I will pick Phil Dunphy for the owner role, Claire Dunphy for the admin role and Luke&#x2F;Haley&#x2F;Alex dunphy for regular user roles.
评论 #10952808 未加载
inanutshellus超过 9 年前
At work we needed a &quot;clean&quot; dataset for a five-character code. We wanted it to be something you could say out loud, e.g. &quot;Hey, are you working on FORKS?&quot; &quot;No, I&#x27;m working on CHUCK&quot;, so random wasn&#x27;t an option, and we were afraid an algorithm, like &quot;consonant-vowel-consonant...&quot; would randomly generate naughty words.<p>We ended up using our customers&#x27; first names and it was a disaster. We had all kinds of joke entries put in, like &quot;JERK&quot;... My favorite customer name was &quot;POOP LENGTH&quot;. lol. &#x2F;facepalm.<p>Anyway, so in this multimillion dollar enterprise application we&#x27;re showing &quot;POOP&quot; to the whole company.<p>At least it was an intra-enterprise-only app.
nudpiedo超过 9 年前
Does that worth to be shared in this community? has that ever been a problem worth mention to someone? I am only aware to problems related to those names when a living person feels they are using their name&#x2F;image in a defamatory or unauthorized way but I think anyone can find by herself a fiction or historical name for that task (or generics such as John Smith&#x2F;Max Mustermann)
评论 #10952607 未加载
anotherevan超过 9 年前
My go to name is Keyser Söze, which I use whenever I&#x27;m writing examples in documentation and such. It also has the advantage of containing a unicode character.
onion2k超过 9 年前
This isn&#x27;t a great list because it assumes far too much about what a name is. Patio11 wrote a great blog post about what developers frequently get wrong when it comes to people and names; <a href="http:&#x2F;&#x2F;www.kalzumeus.com&#x2F;2010&#x2F;06&#x2F;17&#x2F;falsehoods-programmers-believe-about-names&#x2F;" rel="nofollow">http:&#x2F;&#x2F;www.kalzumeus.com&#x2F;2010&#x2F;06&#x2F;17&#x2F;falsehoods-programmers-b...</a>
评论 #10953269 未加载
chris_wot超过 9 年前
You really need to add Jack Reacher.
_mc超过 9 年前
Please add Phoebe Buffay &amp; Regina Phalange .. I will send a pull request may be :D
评论 #10953799 未加载
nazarewk超过 9 年前
Love the beginning :P
评论 #10952618 未加载