> Reddit is a treasure trove of exactly the kind of training data that always-hungry, large language model AI companies need thanks to its long-history, huge user footprint, and active, crowd-sourced creation of written material.<p>And as a source of training data for LLMs, the way that Reddit users think and write will become an important source of mentality for generative AI. At least we may able to trace the sources for biases.