58 点作者 jncraton将近 2 年前

2 条评论

mdaniel将近 2 年前

be sure to read the warning in their repo: <a href="https://github.com/openlm-research/open_llama#loading-the-weights-with-hugging-face-transformers">https://github.com/openlm-research/open_llama#loading-the-we...</a><p>> Please note that it is advised to avoid using the Hugging Face fast tokenizer for now, as we’ve observed that the auto-converted fast tokenizer sometimes gives incorrect tokenization

fancyfredbot将近 2 年前

This is great. Based on the throughout of 2200 tokens/sec and the 1,000,000,000,000 tokens used to train this was at least $183k worth of compute (that's based on the three year committed use rate). And now we can have it for free!

评论 #36233829 未加载

OpenLLaMA 7B Training Completed to 1T Tokens

2 条评论

OpenLLaMA 7B Training Completed to 1T Tokens

2 条评论