TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Industrial-Strength Natural Language Processing in Python

169 pointsby federicoponziabout 8 years ago

7 comments

syllogismabout 8 years ago
Hi all,<p>Ironic timing here! We&#x27;re just preparing the 1.7 release, which has a lot of nice changes, including the option of a much smaller model for English (50mb), to help people test faster.<p>This means that if you install the library right now, you&#x27;ll have to redownload the data once the new version is released.<p>So, maybe wait until tomorrow to get started? Definitely our most ambivalent front-paging yet!
nickdavidhaynesabout 8 years ago
Not sure exactly why this was posted today, since spaCy has been around at least a couple years, but - spaCy is a great tool, and I have a ton of respect for Matthew Honnibal, the main developer.<p>Coincidentally, I wrote a blog post [1] that went up just this morning that, in part, compares spaCy with the other giant in the Python NLP ecosystem, NLTK. TLDR - I think that, right now, the majority of users are better served by spaCy than NLTK.<p>[1] <a href="https:&#x2F;&#x2F;automatedinsights.com&#x2F;blog&#x2F;the-python-nlp-ccosystem-a-short-and-very-opinionated-guide" rel="nofollow">https:&#x2F;&#x2F;automatedinsights.com&#x2F;blog&#x2F;the-python-nlp-ccosystem-...</a>
评论 #13876985 未加载
estabout 8 years ago
It only supports English and German. However you can try add other languages here <a href="https:&#x2F;&#x2F;spacy.io&#x2F;docs&#x2F;usage&#x2F;adding-languages" rel="nofollow">https:&#x2F;&#x2F;spacy.io&#x2F;docs&#x2F;usage&#x2F;adding-languages</a>
评论 #13877007 未加载
zeratulabout 8 years ago
Ask HN: Could you suggest a fast library for converting documents into a sparse matrix representation (e.g., COO or CSR) in any programming language? I&#x27;m guessing C beats most of the implementation? But there is also the issue of efficient n-gram hashing&#x2F;indexing.
评论 #13879947 未加载
评论 #13879951 未加载
nreeceabout 8 years ago
Does spaCy have a C# .NET wrapper, or can it be used from other languages&#x2F;frameworks through a REST API?<p>I&#x27;m using the CoreNLP C# wrapper, so I&#x27;m wondering if something similar (.NET Core compatible) is available&#x2F;doable for spaCy?
deepnotderpabout 8 years ago
NLTK is pretty good as well.
评论 #13878267 未加载
snackaiabout 8 years ago
Really great tool, I currently work on a project that makes use of Spacy. Can&#x27;t wait to push it into production.