TE
TechEcho
Home
24h Top
Newest
Best
Ask
Show
Jobs
English
GitHub
Twitter
Home
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
2 points
by
andyxor
almost 4 years ago
1 comment
andyxor
almost 4 years ago
<a href="https://pile.eleuther.ai/" rel="nofollow">https://pile.eleuther.ai/</a>