科技回声

I work for the professor from the article (but not on TextRunner).<p>We're working on extracting meaning from reviews as well: <a href="http://revminer.com/" rel="nofollow">http://revminer.com/</a><p>At the moment, it only has reviews of Seattle places (restaurants, hotels, etc.) but we're moving it mobile. It's written using node.js and socket.io; I'd be interested in hearing any feedback.

From the article - "For example, to find the names of people who are CEOs within millions of documents, you'd first need to train the software with other examples, such as "Steve Jobs is CEO of Apple, Sheryl Sandberg is CEO of Facebook." "<p>Sheryl Sandberg? Deliberate or honest mistake? :-]

Looks like the directory index was left open. <a href="http://textrunner.cs.washington.edu/" rel="nofollow">http://textrunner.cs.washington.edu/</a>

Awesome: code released under the GPL, with several data sets. Good to see this project (which has been under development for a long time) releasing technology for other people to use.

Read The Web at CMU is also a similar system. <a href="http://rtw.ml.cmu.edu/rtw/" rel="nofollow">http://rtw.ml.cmu.edu/rtw/</a>

Hasn't this been out for like, a long time?

Looks like the directory index was left open. <a href="http://textrunner.cs.washington.edu/" rel="nofollow">http://textrunner.cs.washington.edu/</a>

Awesome: code released under the GPL, with several data sets. Good to see this project (which has been under development for a long time) releasing technology for other people to use.

Read The Web at CMU is also a similar system. <a href="http://rtw.ml.cmu.edu/rtw/" rel="nofollow">http://rtw.ml.cmu.edu/rtw/</a>

Hasn't this been out for like, a long time?

Extracting Meaning from Millions of Pages

6 条评论

Extracting Meaning from Millions of Pages

6 条评论