Looks quite a solid fine tune from benchmark results. I iwsh they had added benchmark comparison to GPT 4 Turbo as well.<p>From the huggingface page:<p>Model Details<p>We introduce Llama3-ChatQA-1.5, which excels at conversational question answering (QA) and retrieval-augumented generation (RAG). Llama3-ChatQA-1.5 is built using the training recipe from ChatQA (1.0), and it is built on top of Llama-3 base model. Additionally, we incorporate more conversational QA data to enhance its tabular and arithmatic calculation capability. Llama3-ChatQA-1.5 has two variants: Llama3-ChatQA-1.5-8B and Llama3-ChatQA-1.5-70B. Both models were originally trained using Megatron-LM, we converted the checkpoints to Hugging Face format.
Other Resources