TE
TechEcho
Home
24h Top
Newest
Best
Ask
Show
Jobs
English
GitHub
Twitter
Home
The Pile: An 800GB Dataset of Diverse Text for Language Modeling [pdf]
1 points
by
nixtaken
over 4 years ago
1 comment
dang
over 4 years ago
<a href="https://news.ycombinator.com/item?id=25607809" rel="nofollow">https://news.ycombinator.com/item?id=25607809</a>