Discussed at length by HN 3 days ago<p><a href="https://news.ycombinator.com/item?id=41046773">https://news.ycombinator.com/item?id=41046773</a>
Is the Llama 3.1 model open source?
Yes.<p>Really?
No.<p><i>Training Data = Source Code</i><p>This time, the Llama 3.1 model was released as open source. However, the training data is not disclosed. In AI and deep learning, training data is the "source code," while the model is like an app. Therefore, Llama is not truly open source. It's more like a free language model app has been released.<p>When Meta's Zuckerberg released Llama, he mentioned Linux and posted, "Open Source Al is the Path Forward“.<p>While Zuckerberg looks cool and we appreciate him,<p>still, we should understand that Llama is not open source. To be open source, the training data must be disclosed. We should not be under the illusion that Llama is open source. (Training data size: approximately 15 trillion tokens)