TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Beer and Data Science – Looking at topic modeling in multi-aspect reviews

64 pointsby bcohen123about 10 years ago

6 comments

bcaineabout 10 years ago
Nice read. I did something sort of similar with the same dataset about a year ago. I compared LDA (Latent Dirichlet Allocation) to TF-IDF as tools to find similar beers based on their review text. Lots of intuitive and funny topics discovered.<p>I suggest you play with LDA, it seemed to work really well at generating topics. There is also a lot of fascinating, very readable research using it. Check out SNAPs work on the same dataset [1] and some of the Yelp Dataset challenge winners [2]. If you end up interested in doing so, Gensim [3] was pleasant enough to work with.<p>[1] <a href="http:&#x2F;&#x2F;snap.stanford.edu&#x2F;data&#x2F;web-BeerAdvocate.html" rel="nofollow">http:&#x2F;&#x2F;snap.stanford.edu&#x2F;data&#x2F;web-BeerAdvocate.html</a><p>[2] <a href="http:&#x2F;&#x2F;www.yelp.com&#x2F;dataset_challenge" rel="nofollow">http:&#x2F;&#x2F;www.yelp.com&#x2F;dataset_challenge</a><p>[3] <a href="https:&#x2F;&#x2F;radimrehurek.com&#x2F;gensim&#x2F;wiki.html#latent-dirichlet-allocation" rel="nofollow">https:&#x2F;&#x2F;radimrehurek.com&#x2F;gensim&#x2F;wiki.html#latent-dirichlet-a...</a>
gjredaabout 10 years ago
Great post! I&#x27;ve been thinking about writing something similar with that same BeerAdvocate data. Good job beating me to it :)<p>Instead, I ended up writing a satirical beer snob bot [1] which tweets nonsensical beer reviews using Markov Chains. Some are bad, but some are pure gold. You can read about it here [2]. The code&#x27;s also on GitHub [3].<p>[1] <a href="https:&#x2F;&#x2F;twitter.com&#x2F;BeerSnobSays" rel="nofollow">https:&#x2F;&#x2F;twitter.com&#x2F;BeerSnobSays</a><p>[2] <a href="http:&#x2F;&#x2F;www.gregreda.com&#x2F;2015&#x2F;03&#x2F;30&#x2F;beer-review-markov-chains&#x2F;" rel="nofollow">http:&#x2F;&#x2F;www.gregreda.com&#x2F;2015&#x2F;03&#x2F;30&#x2F;beer-review-markov-chains...</a><p>[3] <a href="https:&#x2F;&#x2F;github.com&#x2F;gjreda&#x2F;beer-snob-says" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;gjreda&#x2F;beer-snob-says</a>
评论 #9388622 未加载
JasonCECabout 10 years ago
For anyone interested in beer and data science, my startup[1] uses machine learning and artificial intelligence to build flavor profiling and quality control tools for craft beverage producers.<p>Our models flag and predict flaws, taints, contaminations, and batch-to-batch deviations in real time from human sensory data. We then leverage our clients quality control data for flavor profile optimization, demographic targeting, and cognitive marketing - helping them sell consistently better products to their most valuable consumers.<p>[1] www.Gastrograph.com
评论 #9388278 未加载
socceroosabout 10 years ago
I just came across a relevant site this morning. Hilarious hipster brew review satire: <a href="http:&#x2F;&#x2F;vicioustasting.com&#x2F;" rel="nofollow">http:&#x2F;&#x2F;vicioustasting.com&#x2F;</a>
评论 #9388040 未加载
archimedespiabout 10 years ago
Love the license =D
评论 #9386607 未加载
cobranetabout 10 years ago
How to get data ?
评论 #9387966 未加载