TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Gorilla: LLMs Connected to APIs Explained

9 pointsby CShortenalmost 2 years ago
Hey everyone, I am SUPER excited to present a paper summary video of &quot;Gorilla: Large Language Models connected to Massive APIs&quot; by Patil et al. 2023!<p>LLMs have been supercharged by connecting them with external tools. An external tool could be a search engine, code executor, calculator, calendar, email, CRM, and many others! Although GPT-4 is fairly strong at formatting API requests zero-shot (without additional training), Gorilla shows that specialized training can outperform it significantly! In addition to the accuracy performance, this is also achievable with a much cheaper 7 billion parameter model, derived by fine-tuning the Meta AI LlaMA-2 7B checkpoint!! There are all sorts of interesting details about this paper covered in the video, from the APIBench dataset to Self-Instruct training data generation, Retrieval-Aware Training, and the miscellaneous details of Gorilla! I hope you enjoy the paper summary video! As always I am more than happy to answer any questions or discuss any ideas you have related to the content in the video! P.S. Please stay tuned for Weaviate Gorilla! https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=LkV5DTRNxAg

1 comment

riguer1almost 2 years ago
Great video, Thanks for this overview!
评论 #37237811 未加载