科技回声

23 条评论

tibbon超过 13 年前

Now I'm just waiting for someone to make a Twitter bot that randomly samples tweets and responds to them with this...

评论 #3456903 未加载

评论 #3458062 未加载

评论 #3457614 未加载

评论 #3456844 未加载

notb超过 13 年前

I think your negative sample set is a little biased. Since all the phrases start with verbs like "was in the car" or "went to the park", these kinds of phrases are given lower probabilities.For example:<pre><code> > twss.prob("was on a stiff pole"); 0.016050826334564946 </code></pre> Only 1.6% chance of that's what she said?!?EDIT: Counter example:<pre><code> > twss.prob("that's one stiff pole"); 0.9767718880285885</code></pre>

评论 #3458913 未加载

blahpro超过 13 年前

Related: <a href="http://www.quora.com/How-would-you-programmatically-parse-a-sentence-and-decide-whether-to-answer-thats-what-she-said" rel="nofollow">http://www.quora.com/How-would-you-programmatically-parse-a-...</a>

driverdan超过 13 年前

An interesting (and funny) exercise.For those interested in neural networks and Bayesian classifiers check out the brain.js library: <a href="http://harthur.github.com/brain/" rel="nofollow">http://harthur.github.com/brain/</a>It works in both node and the browser.

yuvadam超过 13 年前

A while back I was interested in implementing a much less naive algorithm for classifying TWSS expressions, based on this [1] paper. Never actually got around to finishing the work.Interesting problem though, and nice work.[1] - <a href="http://www.cs.washington.edu/homes/brun/pubs/pubs/Kiddon11.pdf" rel="nofollow">http://www.cs.washington.edu/homes/brun/pubs/pubs/Kiddon11.p...</a>

adam_albrecht超过 13 年前

Well that open source project left me satisfied and smiling

评论 #3456647 未加载

TwistedWeasel超过 13 年前

Soon to be implemented in all IRC bots the world over

评论 #3456473 未加载

评论 #3458061 未加载

bjornsteffanson超过 13 年前

This is probably the first time I've understood node.js.

VolatileVoid超过 13 年前

I was wondering if anyone knew of a place where I could learn about this stuff in general. I know nothing about unigrams, bigrams, trigrams, tf-idf, Bayesian filtering, etc. Maths - while not awful - is not my strongest point, but I think I could grok a well-written tutorial to this stuff (with code examples!).I was hoping/wondering if anyone knew of sites I could start learning about this from? I find this very interesting, and I'm sure it could be highly useful and applicable to many different types of problems...

zeratul超过 13 年前

DanielRapp: in file twss.js/lib/classifier/knn.js, number of NN should be odd to prevent ties [EDIT: also, NN should be large enough to prevent over-fitting; small NN would mean that the difference (decision boundary) between twss and not-twss is highly non-linear; you need to implement cross-validation to find best NN]Note to self: machine learning using node.js; what's the speed of calculations, what's the memory management in node.js, can I find pure JS implementation of SVM?

评论 #3457382 未加载

donohoe超过 13 年前

I'm still looking for a classifier that will take a phrase, determine if and what the "In Soviet Russia X Y you" response would be.Anyone?

评论 #3458443 未加载

评论 #3463190 未加载

phreeza超过 13 年前

Has anyone thrown this on a web server with a simple interface?

评论 #3456850 未加载

mckoss超过 13 年前

MRI's have shown that humans are able to do this because of a dedicated site in the brain called "Scott's region". Once activated, this linguistic region is constantly searching for linguistic cues, surfacing signals to our conscious thoughts when the cues are strong enough.

tnorthcutt超过 13 年前

I've seen a Siri proxy TWSS implementation: <a href="http://www.youtube.com/watch?v=p4LamngB070" rel="nofollow">http://www.youtube.com/watch?v=p4LamngB070</a>

jfriedly超过 13 年前

We made our IRC bot respond to TWSS jokes, but ours was just a dumb match from a set of few thousand jokes that we scraped from offline. You can look at the code at: <a href="https://github.com/jfriedly/jenni" rel="nofollow">https://github.com/jfriedly/jenni</a>Now that I took Stanford's Machine Learning class though, I think I might just duplicate what this guy did for our bot.

Zelphyr超过 13 年前

While it seems on the surface like a waste of time (albeit amusing one), I actually expect this is a great project to learn from because of its use of Bayesian classifiers.In other words, I'm TOTALLY going to be using this on my next project.

mertd超过 13 年前

The training data is pretty funny. I suppose he collected it from an online TWSS thread.

评论 #3456605 未加载

评论 #3456562 未加载

radikalus超过 13 年前

Great start -- interesting to watch it go vs the twitter stream. (If you restrict to < 8 word tweets)

loganlinn超过 13 年前

Looks like this could easily be integrated into a script for Hubot

tibbon超过 13 年前

I've never had a script do that for me

评论 #3456442 未加载

评论 #3456434 未加载

tlrobinson超过 13 年前

What exactly is Node.js specific about this?

评论 #3457931 未加载

aithleyadeno超过 13 年前

goo.gl/39SLa

评论 #3457373 未加载

TWSS超过 13 年前

I approve this post.

23 条评论

tibbon超过 13 年前

Now I'm just waiting for someone to make a Twitter bot that randomly samples tweets and responds to them with this...

评论 #3456903 未加载

评论 #3458062 未加载

评论 #3457614 未加载

评论 #3456844 未加载

notb超过 13 年前

评论 #3458913 未加载

blahpro超过 13 年前

driverdan超过 13 年前

yuvadam超过 13 年前

adam_albrecht超过 13 年前

Well that open source project left me satisfied and smiling

评论 #3456647 未加载

TwistedWeasel超过 13 年前

Soon to be implemented in all IRC bots the world over

评论 #3456473 未加载

评论 #3458061 未加载

bjornsteffanson超过 13 年前

This is probably the first time I've understood node.js.

VolatileVoid超过 13 年前

zeratul超过 13 年前

评论 #3457382 未加载

donohoe超过 13 年前

I'm still looking for a classifier that will take a phrase, determine if and what the "In Soviet Russia X Y you" response would be.Anyone?

评论 #3458443 未加载

评论 #3463190 未加载

phreeza超过 13 年前

Has anyone thrown this on a web server with a simple interface?

评论 #3456850 未加载

mckoss超过 13 年前

tnorthcutt超过 13 年前

I've seen a Siri proxy TWSS implementation: <a href="http://www.youtube.com/watch?v=p4LamngB070" rel="nofollow">http://www.youtube.com/watch?v=p4LamngB070</a>

jfriedly超过 13 年前

Zelphyr超过 13 年前

mertd超过 13 年前

The training data is pretty funny. I suppose he collected it from an online TWSS thread.

评论 #3456605 未加载

评论 #3456562 未加载

radikalus超过 13 年前

Great start -- interesting to watch it go vs the twitter stream. (If you restrict to < 8 word tweets)

loganlinn超过 13 年前

Looks like this could easily be integrated into a script for Hubot

tibbon超过 13 年前

I've never had a script do that for me

评论 #3456442 未加载

评论 #3456434 未加载

tlrobinson超过 13 年前

What exactly is Node.js specific about this?

评论 #3457931 未加载

aithleyadeno超过 13 年前

goo.gl/39SLa

评论 #3457373 未加载

TWSS超过 13 年前

I approve this post.

Twss.js

23 条评论

Twss.js

23 条评论