TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Evolution of words frequencies in porn

106 pointsby mazrover 11 years ago

18 comments

route66over 11 years ago
To speak of evolution when your time frame is 2008-2012 is somewhat far fetched. But I believe I see a reassuring trend here: <a href="http://porngram.sexualitics.org/?q=erlang%2C+clojure%2C+Ada%2C+scala%2C+rust" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=erlang%2C+clojure%2C+Ada%...</a>
评论 #7149939 未加载
评论 #7149912 未加载
评论 #7149922 未加载
评论 #7150088 未加载
评论 #7149937 未加载
评论 #7149854 未加载
评论 #7150221 未加载
sushirainover 11 years ago
1. It doesn&#x27;t count word frequencies, but sub-string frequencies. Moreover, if a sub-string appears more than once-per-title, then it is counted more than once. I draw this conclusion by submitting &quot;a,b,c&quot;. And from their paper [1]:<p><pre><code> our algorithm strips out dashes and catches any occurrence of the query in the title, for example, &#x27;blow&#x27; catches &#x27;blowing&#x27;, &#x27;blowjobs&#x27; </code></pre> This explains the results of these queries: &quot;ada,erlang&quot;, &quot;tea,beer&quot;. As an alternative they could have used a stemmer [2].<p>2. The &quot;slow,fast&quot; and &quot;love,hardcore&quot; trends illustrate an interesting trend. Perhaps towards women or mainstream viewers.<p>[1] <a href="http://sexualitics.org/wp-content/uploads/2014/01/PORNSTUDIES_preprint.pdf" rel="nofollow">http:&#x2F;&#x2F;sexualitics.org&#x2F;wp-content&#x2F;uploads&#x2F;2014&#x2F;01&#x2F;PORNSTUDIE...</a><p>[2] <a href="http://nlp.stanford.edu/IR-book/html/htmledition/stemming-and-lemmatization-1.html" rel="nofollow">http:&#x2F;&#x2F;nlp.stanford.edu&#x2F;IR-book&#x2F;html&#x2F;htmledition&#x2F;stemming-an...</a>
评论 #7161592 未加载
easy_riderover 11 years ago
In my first 2 weeks of working at an adult company (as a dev yes, it&#x27;s sad) one of my tasks was to watch&#x2F;scan 200+ video&#x27;s and describe them.. It&#x27;s true, you run out of inspiration fast. Also I could hint you: the &quot;love&quot; in the titles is probably explained by &quot;love(s) to &lt;insert profanity&gt; &quot;. I don&#x27;t think I ever used hardcore in a title.
评论 #7152127 未加载
probably_wrongover 11 years ago
Traditional professions are still on top:<p><a href="http://porngram.sexualitics.org/?q=pizza%2Cdelivery%2Cplumber%2C+programmer" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=pizza%2Cdelivery%2Cplumbe...</a><p>I sense a business opportunity there.
评论 #7150628 未加载
评论 #7150236 未加载
ozhover 11 years ago
Quite fun!<p>Next: provide the porn industry a simple markov chain script to generate probabilistic porn movie titles, and save them all those incredibly tiresome brainstrom sessions they must have to create new titles :)
评论 #7149836 未加载
评论 #7149787 未加载
评论 #7149954 未加载
endrijuover 11 years ago
<a href="http://porngram.sexualitics.org/?q=btc%2Cusd" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=btc%2Cusd</a>
ulowebover 11 years ago
<a href="http://porngram.sexualitics.org/?q=iphone%2Candroid" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=iphone%2Candroid</a>
评论 #7150034 未加载
mapleoinover 11 years ago
<a href="http://porngram.sexualitics.org/?q=hipster" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=hipster</a>
评论 #7150031 未加载
edoloughlinover 11 years ago
Strange: <a href="http://porngram.sexualitics.org/?q=tea,wine,beer,coffee" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=tea,wine,beer,coffee</a>
评论 #7150091 未加载
评论 #7150112 未加载
评论 #7150261 未加载
thinkerover 11 years ago
I wonder if we&#x27;ll soon see an Upworthy for Porn: &quot;You will never believe what this girl did to pay her rent&quot;
评论 #7151865 未加载
guybrushTover 11 years ago
Very interesting to see the dataset being made available. Whenever I want to do this kind of analysis, I always stumble at &#x27;how to get the data?&#x27;. In their paper, it is mentioned that &quot;We created a dedicated computer program to carry out the navigation and data collection tasks required to gather the metadata for all available videos...&quot;. I would love to see this program. More broadly, can anyone help me with best resources (pref python) where one can learn to crawl&#x2F;scrape this type of information?
评论 #7159543 未加载
stcredzeroover 11 years ago
<i>words frequency</i><p>Is the title a British thing? Like maths vs. math?
评论 #7149810 未加载
评论 #7150070 未加载
graylightsover 11 years ago
So I did gay vs lesbian and I was confused why there was a big spike in 2010 for gay that has since dropped off. Is this an anomaly in their sampling?<p>Also Obama&#x27;s numbers have really dropped compared to Bush: <a href="http://porngram.sexualitics.org/?q=bush%2Cobama" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=bush%2Cobama</a>
6d0debc071over 11 years ago
I suppose we may as well do the obvious ones<p><a href="http://porngram.sexualitics.org/?q=BDSM%2Ctorture%2Cpain%2Crape" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=BDSM%2Ctorture%2Cpain%2Cr...</a><p>:&#x2F;
评论 #7154338 未加载
misnomeover 11 years ago
Kind of interesting, but really needs statistical error bounds
himalover 11 years ago
<a href="http://porngram.sexualitics.org/?q=HIV%2CAIDS" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=HIV%2CAIDS</a>
jpswadeover 11 years ago
<a href="http://porngram.sexualitics.org/?q=cup" rel="nofollow">http:&#x2F;&#x2F;porngram.sexualitics.org&#x2F;?q=cup</a>
bobowzkiover 11 years ago
&quot;sister&quot; :-)