TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Evolution of words frequencies in porn

106 点作者 mazr超过 11 年前

18 条评论

route66超过 11 年前
To speak of evolution when your time frame is 2008-2012 is somewhat far fetched. But I believe I see a reassuring trend here: <a href="http://porngram.sexualitics.org/?q=erlang%2C+clojure%2C+Ada%2C+scala%2C+rust" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=erlang%2C+clojure%2C+Ada%...</a>
评论 #7149939 未加载
评论 #7149912 未加载
评论 #7149922 未加载
评论 #7150088 未加载
评论 #7149937 未加载
评论 #7149854 未加载
评论 #7150221 未加载
sushirain超过 11 年前
1. It doesn&#x27;t count word frequencies, but sub-string frequencies. Moreover, if a sub-string appears more than once-per-title, then it is counted more than once. I draw this conclusion by submitting &quot;a,b,c&quot;. And from their paper [1]:<p><pre><code> our algorithm strips out dashes and catches any occurrence of the query in the title, for example, &#x27;blow&#x27; catches &#x27;blowing&#x27;, &#x27;blowjobs&#x27; </code></pre> This explains the results of these queries: &quot;ada,erlang&quot;, &quot;tea,beer&quot;. As an alternative they could have used a stemmer [2].<p>2. The &quot;slow,fast&quot; and &quot;love,hardcore&quot; trends illustrate an interesting trend. Perhaps towards women or mainstream viewers.<p>[1] <a href="http://sexualitics.org/wp-content/uploads/2014/01/PORNSTUDIES_preprint.pdf" rel="nofollow">http:&#x2F;&#x2F;sexualitics.org&#x2F;wp-content&#x2F;uploads&#x2F;2014&#x2F;01&#x2F;PORNSTUDIE...</a><p>[2] <a href="http://nlp.stanford.edu/IR-book/html/htmledition/stemming-and-lemmatization-1.html" rel="nofollow">http:&#x2F;&#x2F;nlp.stanford.edu&#x2F;IR-book&#x2F;html&#x2F;htmledition&#x2F;stemming-an...</a>
评论 #7161592 未加载
easy_rider超过 11 年前
In my first 2 weeks of working at an adult company (as a dev yes, it&#x27;s sad) one of my tasks was to watch&#x2F;scan 200+ video&#x27;s and describe them.. It&#x27;s true, you run out of inspiration fast. Also I could hint you: the &quot;love&quot; in the titles is probably explained by &quot;love(s) to &lt;insert profanity&gt; &quot;. I don&#x27;t think I ever used hardcore in a title.
评论 #7152127 未加载
probably_wrong超过 11 年前
Traditional professions are still on top:<p><a href="http://porngram.sexualitics.org/?q=pizza%2Cdelivery%2Cplumber%2C+programmer" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=pizza%2Cdelivery%2Cplumbe...</a><p>I sense a business opportunity there.
评论 #7150628 未加载
评论 #7150236 未加载
ozh超过 11 年前
Quite fun!<p>Next: provide the porn industry a simple markov chain script to generate probabilistic porn movie titles, and save them all those incredibly tiresome brainstrom sessions they must have to create new titles :)
评论 #7149836 未加载
评论 #7149787 未加载
评论 #7149954 未加载
endriju超过 11 年前
<a href="http://porngram.sexualitics.org/?q=btc%2Cusd" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=btc%2Cusd</a>
uloweb超过 11 年前
<a href="http://porngram.sexualitics.org/?q=iphone%2Candroid" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=iphone%2Candroid</a>
评论 #7150034 未加载
mapleoin超过 11 年前
<a href="http://porngram.sexualitics.org/?q=hipster" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=hipster</a>
评论 #7150031 未加载
edoloughlin超过 11 年前
Strange: <a href="http://porngram.sexualitics.org/?q=tea,wine,beer,coffee" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=tea,wine,beer,coffee</a>
评论 #7150091 未加载
评论 #7150112 未加载
评论 #7150261 未加载
thinker超过 11 年前
I wonder if we&#x27;ll soon see an Upworthy for Porn: &quot;You will never believe what this girl did to pay her rent&quot;
评论 #7151865 未加载
guybrushT超过 11 年前
Very interesting to see the dataset being made available. Whenever I want to do this kind of analysis, I always stumble at &#x27;how to get the data?&#x27;. In their paper, it is mentioned that &quot;We created a dedicated computer program to carry out the navigation and data collection tasks required to gather the metadata for all available videos...&quot;. I would love to see this program. More broadly, can anyone help me with best resources (pref python) where one can learn to crawl&#x2F;scrape this type of information?
评论 #7159543 未加载
stcredzero超过 11 年前
<i>words frequency</i><p>Is the title a British thing? Like maths vs. math?
评论 #7149810 未加载
评论 #7150070 未加载
graylights超过 11 年前
So I did gay vs lesbian and I was confused why there was a big spike in 2010 for gay that has since dropped off. Is this an anomaly in their sampling?<p>Also Obama&#x27;s numbers have really dropped compared to Bush: <a href="http://porngram.sexualitics.org/?q=bush%2Cobama" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=bush%2Cobama</a>
6d0debc071超过 11 年前
I suppose we may as well do the obvious ones<p><a href="http://porngram.sexualitics.org/?q=BDSM%2Ctorture%2Cpain%2Crape" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=BDSM%2Ctorture%2Cpain%2Cr...</a><p>:&#x2F;
评论 #7154338 未加载
misnome超过 11 年前
Kind of interesting, but really needs statistical error bounds
himal超过 11 年前
<a href="http://porngram.sexualitics.org/?q=HIV%2CAIDS" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=HIV%2CAIDS</a>
jpswade超过 11 年前
<a href="http://porngram.sexualitics.org/?q=cup" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=cup</a>
bobowzki超过 11 年前
&quot;sister&quot; :-)