TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Looking for the author behind NYT op-ed using data science

20 pointsby ehudlaover 6 years ago

7 comments

was_boringover 6 years ago
Awesome! I was actually thinking of doing something similar, but with some key differences:<p>- try and find transcripts and speeches each &quot;suspect&quot; has given.<p>- Probably use a classification algo to try and determine who the author could be (most likely using KNN)<p>- I hadn&#x27;t determined feature building, but was thinking of a either a simple one-hot encoding, entity embedding, or tf-idf.<p>I then encountered a moral dilemma -- do I do this and it potentially becomes ammo for messing with someone else&#x27;s career (when that is not my intention)?
bchernyover 6 years ago
Aren&#x27;t there ethical considerations in outing the author? Clearly he&#x2F;she intended to remain anonymous, for obvious reasons.
评论 #17943957 未加载
评论 #17945706 未加载
rossdavidhover 6 years ago
Interesting. I put more weight on the authors caveats about this method, than on the method itself, which is fine since it was I think just presented as an example of what could be done.<p>Also, if the person most likely (by this analysis) is actually the one who wrote the op-ed, it was either: - the POTUS, which seems unlikely (although who knows) - the secretary for the POTUS, who types his tweets as well - the one person in the President&#x27;s circle who he cannot fire
hk__2over 6 years ago
I wonder if there could be a way to prevent this by e.g. paying a bunch of random people to rewrite parts of the text in their “own” style (with a review to ensure the meaning isn’t lost).
RickJWagnerover 6 years ago
I think if the source cares about his&#x2F;her job, they&#x27;d intentionally throw in some misdirection (i.e. lodestar&#x27;)
rootusrootusover 6 years ago
This is definitely interesting. However, I have assumed from the beginning that, protests to the contrary notwithstanding, this op-ed was thoroughly edited by <i>someone</i> to prevent such an analytical outing.
chxover 6 years ago
<a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=17943416" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=17943416</a> ...
评论 #17943808 未加载