TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Gorilla: LLMs Connected to APIs Explained

9 点作者 CShorten将近 2 年前
Hey everyone, I am SUPER excited to present a paper summary video of &quot;Gorilla: Large Language Models connected to Massive APIs&quot; by Patil et al. 2023!<p>LLMs have been supercharged by connecting them with external tools. An external tool could be a search engine, code executor, calculator, calendar, email, CRM, and many others! Although GPT-4 is fairly strong at formatting API requests zero-shot (without additional training), Gorilla shows that specialized training can outperform it significantly! In addition to the accuracy performance, this is also achievable with a much cheaper 7 billion parameter model, derived by fine-tuning the Meta AI LlaMA-2 7B checkpoint!! There are all sorts of interesting details about this paper covered in the video, from the APIBench dataset to Self-Instruct training data generation, Retrieval-Aware Training, and the miscellaneous details of Gorilla! I hope you enjoy the paper summary video! As always I am more than happy to answer any questions or discuss any ideas you have related to the content in the video! P.S. Please stay tuned for Weaviate Gorilla! https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=LkV5DTRNxAg

1 comment

riguer1将近 2 年前
Great video, Thanks for this overview!
评论 #37237811 未加载