They claim several times to be “truly open” but I don’t see anything about open sourcing the training code. The inference code isn’t that interesting. What we need is total transparency on how the model weights are produced, since otherwise it’s hard to trust the biases of a model. The only actually truly open model is AI2’s OLMo as far as I know - and even they don’t get totally transparent about how they produced their training data set, which includes curation and filtering by “safety and ethics” people:<p><a href="https://blog.allenai.org/hello-olmo-a-truly-open-llm-43f7e7359222" rel="nofollow">https://blog.allenai.org/hello-olmo-a-truly-open-llm-43f7e73...</a><p>But until training data sets and source code are released under an OSI license, Snowflake should stop with the open washing.