TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: An easy-to-use Text Analysis API – NLP and Machine Learning

153 点作者 parsabg超过 11 年前

30 条评论

gklitt超过 11 年前
Cool stuff! It&#x27;s nice to see platforms like this which abstract out good algorithms, so that developers can worry about thinking of interesting applications. .Open source libs are even better, but pragmatically speaking, I think these types of platforms probably move faster and get better results.<p>One major competitor (well known for anyone who&#x27;s looked into this stuff) is Alchemy [1]. I tried a New York Times link [2] on Aylien and Alchemy, and Alchemy performed much better -- in fact, Aylien didn&#x27;t even successfully find the article body. I&#x27;m sure you guys will be iterating on improving the algorithms, but just wanted to flag that as a potential turnoff for anyone comparing your website demo with Alchemy.<p>Best of luck!<p>[1] <a href="http://www.alchemyapi.com/products/demo/" rel="nofollow">http:&#x2F;&#x2F;www.alchemyapi.com&#x2F;products&#x2F;demo&#x2F;</a><p>[2] <a href="http://www.nytimes.com/2014/02/18/world/middleeast/bombings-in-syria-force-wave-of-civilians-to-flee.html?hp&amp;_r=0" rel="nofollow">http:&#x2F;&#x2F;www.nytimes.com&#x2F;2014&#x2F;02&#x2F;18&#x2F;world&#x2F;middleeast&#x2F;bombings-...</a>
评论 #7257800 未加载
评论 #7260339 未加载
fnl超过 11 年前
Seen quite a few times (NLP web APIs), and my opinion is that this kind of stuff tends to not be scalable: to be useful, such web APIs have to be able to do entire articles in just a split fraction of a second. Although I am not sure (because of the HN storm the API is down), it does not seem this tool will live up to those expectations, either. In the end, my choice always has been to include&#x2F;wrap an off-the-shelf tool in your own pipeline rather than relying on a external service that might be too slow for end-users and mass mining alike...
评论 #7262275 未加载
drakaal超过 11 年前
This is a much better Noun Phrase &#x2F; Entity extractor.<p><a href="https://www.mashape.com/stremor/noun-entity-extraction-noun-phrase-part-of-speech-tagger-alpha" rel="nofollow">https:&#x2F;&#x2F;www.mashape.com&#x2F;stremor&#x2F;noun-entity-extraction-noun-...</a><p>We don&#x27;t rely on CoreNLP, or NLTK, we have our own sentence disambiguation, and our own part of speech tools. So we are a lot faster.<p>Our other api&#x27;s let you piece together a lot of cool NLP projects with very little code.
mattmcknight超过 11 年前
These sorts of things are typically better offered as libraries, particularly as the training is usually specific to a corpus, or a particular context.<p>It would be a nice to offer a library with a bootstrapped training set.
评论 #7258168 未加载
kenshiro_o超过 11 年前
Unfortunately the web site is still analyzing the example Techcrunch link (it&#x27;s been 3 min already).<p>Is something broken? Maybe you could cache some recurring analyses.
评论 #7257970 未加载
评论 #7257969 未加载
zvanness超过 11 年前
Hey guys! Congrats, NLP is a huge problem that needs as many minds working on it as possible.<p>Just tried a few links:<p><a href="http://arstechnica.com/security/2014/02/dear-asus-router-user-youve-been-pwned-thanks-to-easily-exploited-flaw/" rel="nofollow">http:&#x2F;&#x2F;arstechnica.com&#x2F;security&#x2F;2014&#x2F;02&#x2F;dear-asus-router-use...</a><p><a href="http://blog.algore.com/2011/07/the_great_lakes_are_in_danger.html" rel="nofollow">http:&#x2F;&#x2F;blog.algore.com&#x2F;2011&#x2F;07&#x2F;the_great_lakes_are_in_danger...</a><p>Am I missing something here? It seems like it&#x27;s just parsing text, i&#x27;m not seeing any context(keywords, categories, summaries)<p>edit: It&#x27;s giving fantastic results when pasting the raw text! :)<p>Are you guys using DBpedia? It&#x27;s giving very similar results to a system I was working on in the past: <a href="http://www.zachvanness.com/nanobird_relevancy_engine.pdf" rel="nofollow">http:&#x2F;&#x2F;www.zachvanness.com&#x2F;nanobird_relevancy_engine.pdf</a>
评论 #7257789 未加载
blueblob超过 11 年前
What do you use for the extraction of entities (if you don&#x27;t mind saying)? I entered &quot;The Cat in the Hat&quot; is a good book. It didn&#x27;t recognize any entities. Are you using an ontology for named entity resolution, or just extracting NPs?
评论 #7258740 未加载
analytically超过 11 年前
Another player in this space, from Oxford, UK: <a href="http://apidemo.theysay.io/" rel="nofollow">http:&#x2F;&#x2F;apidemo.theysay.io&#x2F;</a>
imperio59超过 11 年前
It does really poorly analyzing a Wiktionary entry like <a href="http://en.wiktionary.org/wiki/run" rel="nofollow">http:&#x2F;&#x2F;en.wiktionary.org&#x2F;wiki&#x2F;run</a> or with a Wikipedia article like <a href="http://en.wikipedia.org/wiki/Big_O_notation" rel="nofollow">http:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Big_O_notation</a>
bane超过 11 年前
Playing around with it and seemed to have killed it by pasting the text from this WP article (<a href="http://pastebin.com/AtCU7E8H" rel="nofollow">http:&#x2F;&#x2F;pastebin.com&#x2F;AtCU7E8H</a>) in and hitting analyze. It&#x27;s been spinning for a while.<p><i>edit</i> I see from another response that the server room is on meltdown, I&#x27;ll wait for a bit.
crypto5超过 11 年前
Maybe somebody will find useful and relevant my pet project: <a href="https://github.com/crypto5/wikivector" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;crypto5&#x2F;wikivector</a> . It uses machine learning and wikipedia data as training set, supports 10 languages, and completely open source.
syllogism超过 11 年前
Do you publish accuracy figures? Any information about what domains your training data is from?
评论 #7258020 未加载
polskibus超过 11 年前
There&#x27;s more and more of text analysis APIs, would you mind comparing your feature set to something like Textrazor (<a href="http://www.textrazor.com" rel="nofollow">http:&#x2F;&#x2F;www.textrazor.com</a>) or Open Calais?<p>What is special about your project ?
评论 #7259278 未加载
skiplecariboo超过 11 年前
Super nice !<p>This is a very interesting area... Good to see something new apart from Alchemy and opencalais !
评论 #7257843 未加载
cliveowen超过 11 年前
&quot;There was a time when men could roam free on earth, free from concrete and tarmac. Now it&#x27;s all gone to shit.&quot;<p>Classification: arts, culture and entertainment - architecture .(WTF?)<p>Polarity: positive. (Nope)<p>Polarity confidence: 0.9994709276706056. (Well...)<p>Looks pretty rough to me.
评论 #7258381 未加载
评论 #7257773 未加载
评论 #7258416 未加载
ksk超过 11 年前
A bunch of TA libraries (Stemmers, Wordbreakers, etc) ship &quot;free&quot; with Windows that support a ton of different languages. I wish MS would open up the API a bit more.
elwell超过 11 年前
Clearly broken. Say&#x27;s news.ycombinator.com sentiment is &quot;Positive&quot;. All jokes aside, really cool; love the accessibility of the demo.
cglace超过 11 年前
I posted a couple of paragraphs from a financial blog and the tool interpreted SEC to mean Southeastern Conference.
评论 #7258160 未加载
moron4hire超过 11 年前
Should I have not tried it with a 3000 word essay I wrote? It has been beachballing for the last 5 minutes or so.
评论 #7257886 未加载
adventured超过 11 年前
How is this superior to Alchemy?<p><a href="http://www.alchemyapi.com/" rel="nofollow">http:&#x2F;&#x2F;www.alchemyapi.com&#x2F;</a>
评论 #7257828 未加载
mrg3_2013超过 11 年前
I tried bbc.com and nothing shows up. Is it supposed to work on top level links and summarize ?
评论 #7258343 未加载
Houshalter超过 11 年前
I can&#x27;t get it to work, can someone tell me what it&#x27;s supposed to do?
parsabg超过 11 年前
thanks for the feedback folks. FWIW, here&#x27;s the documentation (&#x2F; NLP crash course!): <a href="http://aylien.com/text-api-doc" rel="nofollow">http:&#x2F;&#x2F;aylien.com&#x2F;text-api-doc</a>
iamwithnail超过 11 年前
Annnnnnd that&#x27;s my thesis sorted. Part of it anyway.
afshinmeh超过 11 年前
One of stunning stuffs that I&#x27;ve seen. Good job.
lukasm超过 11 年前
HN - the ultimate DDOS machine
评论 #7257983 未加载
评论 #7258203 未加载
评论 #7257875 未加载
jhbellz超过 11 年前
pretty cool - what languages does your API support?
评论 #7257846 未加载
mm0超过 11 年前
sell it to a bank $$$
jackson1988超过 11 年前
This is incredible!
hamed_r超过 11 年前
Interesting!