Merry Christmas YCombinatorions, I am looking to improve my data analytics skills, basically how to identify patterns, trends, useful insights from data. Can folks some suggest good web based tools for this please?<p>Not looking for books, but rather sample datasets I can use to visualize, analyze and test if I found the right insights.
From a product analytics perspective I’d suggest :<p>1. Think of a product that exists. Define a goal for the product and success metrics you will use. Dau, mau, user retention,incremental revenue etc.
2. Come up with gaurd rail metrics
3. Define performance and reliability metrics as well.<p>Now try to figure out how you would construct queries to answer these questions. And how would you visualize this info.<p>Then if you can find datasets or create synthetic data sets to actually write these queries or better yet create pipelines that ultimately feed a dashboard I think would be worthwhile.
I would start with a familiar dataset and then see if you can prove your intuitions with the data. For example, if you use a smartwatch you can download your sleep data and check if it supports the hypothesis that during weekends you go to sleep later. Then, you can also look for other insights and then check if they are compatible with your prior hypothesis.
On the off chance that you're into sports, my company StatsBomb have some free data for both soccer and American Football up on GitHub:<p><a href="https://github.com/statsbomb/open-data">https://github.com/statsbomb/open-data</a><p><a href="https://github.com/statsbomb/amf-open-data">https://github.com/statsbomb/amf-open-data</a>
Data Is Plural is a weekly newsletter (and seasonal podcast) of useful/curious datasets, published by Jeremy Singer-Vine. There have been 356 editions, dating from October 21, 2015 to December 20, 2023.<p><a href="https://www.data-is-plural.com/" rel="nofollow noreferrer">https://www.data-is-plural.com/</a>
<a href="https://superset.datatest.ch/superset/dashboard/7/" rel="nofollow noreferrer">https://superset.datatest.ch/superset/dashboard/7/</a><p>Superset has a great dataset with prebuilt visualization for historical video game sales if that's interesting to you
Shameless plug, we're building exactly this: www.datawars.io<p>For now, we're hyperfocused on Python/Pandas/Scikit-learn as we're just getting stated (we launched in June). But we'll expand more tracks for data analytics and data engineering.
Merry Christmas buddy.<p>You'll find a ton of public datasets on GitHub [1].<p>Maven Analytics offers a monthly data analytics challenge [2] that you can enter for free. See their past competitions for some interesting datasets.<p>As I'm based in Ireland I'll also recommend the Irish Data Portal [3].<p>[1] <a href="https://github.com/awesomedata/awesome-public-datasets">https://github.com/awesomedata/awesome-public-datasets</a>
[2] <a href="https://mavenanalytics.io/challenges" rel="nofollow noreferrer">https://mavenanalytics.io/challenges</a>
[3] <a href="https://data.gov.ie/" rel="nofollow noreferrer">https://data.gov.ie/</a>