TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: What approach would you suggest for Text classification?

1 pointsby gerenukalmost 7 years ago
Hey everyone!<p>We are trying to solve a problem where we need to classify the articles into the right categories.<p>Currently, using a FastText to train a model with 100,000 articles categorized into 600 categories. The loss seems to be converging but the precision is not going up, another thing that requires clarification is that can we use pre-trained Wikipedia English embeddings to categorize text.<p>What would you recommend using FastText or some other algorithm&#x2F;approach towards this problem?<p>Any suggestion&#x2F;ideas would be appreciated.<p>Thanks.

1 comment

smithmayowaalmost 7 years ago
FastText is state of the art when it comes to word embedding due to its ability to generate embedding for even words it has not seen, so perhaps your problem lies in your model&#x27;s architecture, are you using convolution neural nets or just basic feed forward networks I have had great success using CNN for text classification, and in your words pre-processing are you filtering out stopwords(very common words in English that throw confusion to a models ability to correctly classify text&#x27;s).