TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Could you recommend language agnostic NLP tools?

3 点作者 assane101将近 7 年前
I just build a spell-checker for Wolof, my native language, using some basic rules and a dictionary I managed to put together. I need your help finding open source tools for NLP that are language agnostic or do not require lot of heavy lifting to adapt to a new locale.<p>Thanks for your help.<p>If you would like to test my spell-checker : https:&#x2F;&#x2F;digibox.info&#x2F;apps&#x2F;experiments&#x2F;wolofix&#x2F;

4 条评论

web64将近 7 年前
Polyglot [0] is a python multilingual NLP toolkit. The quality is not great, but it supports a lot of languages.<p>[0] <a href="https:&#x2F;&#x2F;github.com&#x2F;aboSamoor&#x2F;polyglot" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;aboSamoor&#x2F;polyglot</a>
Pamar将近 7 年前
Far from an expert but I was just discussing this with a former colleague about a specific problem he is considering and I found this: <a href="https:&#x2F;&#x2F;www.r-bloggers.com&#x2F;natural-language-processing-for-non-english-languages-with-udpipe&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.r-bloggers.com&#x2F;natural-language-processing-for-n...</a>
评论 #17470064 未加载
itronitron将近 7 年前
The Lucene API has a lot of language specific tokenizers and analyzers that will help normalize what a term is in the index regardless of language. You can then apply various statistical NLP methods which tend to be more language agnostic.
thecodingmonk将近 7 年前
I work in NLP at a company that actually develops language agnostic solutions, but I&#x27;m not aware of any open-source tool that can do this.<p>Nonetheless, if you can be more specific about what kind of tools you are looking for maybe I can give you some pointers.
评论 #17469941 未加载