TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: How would you go about doing a cluster analysis on Tweets?

2 pointsby nateover 6 years ago
I&#x27;m trying to do some analysis on which Tweets with which features.<p>For example, I&#x27;d like to know if a Tweet gets more engagement if it has hashtags or gifs or movies or mentions, etc. There&#x27;s been things like this before: https:&#x2F;&#x2F;www.quicksprout.com&#x2F;2014&#x2F;03&#x2F;05&#x2F;what-type-of-content-gets-shared-the-most-on-twitter&#x2F; But their methodology isn&#x27;t shared and might not be an accurate analysis. I&#x27;m also not sure if its results are valid 4-5 years later.<p>I&#x27;m curious how others would go about this. Would you just take the top 10% of a bunch of tweets and just do some basic counts on a single variable: has media, does not have media. And just visualize which variable makes the impact?<p>Or would you explore how maybe multiple variables interact? Would you do some kind of k-means cluster analysis? All pointers to education, methodologies, helpful tools&#x2F;software and previous attempts at this are very welcome.

1 comment

nedumaover 6 years ago
Can this help? - <a href="https:&#x2F;&#x2F;medium.com&#x2F;@mroth&#x2F;how-i-built-emojitracker-179cfd8238ac" rel="nofollow">https:&#x2F;&#x2F;medium.com&#x2F;@mroth&#x2F;how-i-built-emojitracker-179cfd823...</a>