科技回声

uniqueuid将近 4 年前

For anyone looking to fine-train transformers with less work, there is the FARM project (<a href="https://github.com/deepset-ai/FARM" rel="nofollow">https://github.com/deepset-ai/FARM</a>) which has some more or less ready-to-go configurations (classification, question answering, NER, and a couple of others). It's really almost "plug in a csv and run".By the way, a pet peeve is sentiment detection. It's a useful method, but please be aware that it does not measure "sentiment" in a way that one would normally think, and that what it measures varies strongly across methods (<a href="https://www.tandfonline.com/doi/abs/10.1080/19312458.2020.1869198" rel="nofollow">https://www.tandfonline.com/doi/abs/10.1080/19312458.2020.18...</a>).

评论 #27582388 未加载

whimsicalism将近 4 年前

Hm. I read this expecting a more in-depth discussion about best practices for fine-tuning massive transformers while avoiding catastrophic forgetting, ie.* How should you select the learning rate?* What tasks are best for fine-tuning on small amounts of data? etc.Instead, this seems mostly to just be running through the implementation of ML/DL 101: loss function for binary classification, helper functions to load data, etc.

评论 #27581728 未加载

visarga将近 4 年前

The same transformer diagram from the original paper, replicated everywhere. Nobody got time for redrawing.BTW, take a look at "sentence transformers" library, a nice interface on top of Hugging Face for this kind of operations (reusing, fine-tuning).<a href="https://www.sbert.net/" rel="nofollow">https://www.sbert.net/</a>

Fine-Tuning Transformers for NLP

3 条评论

Fine-Tuning Transformers for NLP

3 条评论