TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Good small LLM for text correction?

1 点作者 jiehong大约 2 个月前
I’ve been playing with local models for handling text correction, aka grammar and spelling in multilingual text.<p>Most models with less than 20b parameters struggle to do that, even instruct models. For example, they tend to answer questions from the text instead of correcting the mistakes. Strict prompting not to do so only partially helps. They always tend to add or remove sentences of there own.<p>Are there any good small models for text correction or is that just not a task LLM are good at?

1 comment

throwawayffffas大约 2 个月前
Have your tried running base models? I would try a base model instead of an instruct model it. I would prompt it like this:<p># Grammar correction .... :input:A couple of sentences from your text here:input: :output:<p>And see what it fills in after the output.<p>&gt;They always tend to add or remove sentences of there own.<p>You may be running into context size issues here. Try going small a sentence at a time. And using a new chat for each sentence.<p>btw: When I am saying a base model, I mean try using it in a text generation mode not a chat mode.<p>edit: There are models specifically trained for grammar correction though, for the multilingual case you may have to train one. See a link to an explanation of how someone did it for a google model from 2019: <a href="https:&#x2F;&#x2F;deeplearninganalytics.org&#x2F;nlp-building-a-grammatical-error-correction-model&#x2F;" rel="nofollow">https:&#x2F;&#x2F;deeplearninganalytics.org&#x2F;nlp-building-a-grammatical...</a>
评论 #43514782 未加载