There are a lot of these youtube-openai api projects, but I really don't want to open another page to interact with youtube videos. This is a Chrome extension to perform tasks on YouTube videos without leaving your current tab.<p>This extension simply pulls down a transcript for the video you are currently viewing and sends this to the OpenAI API with your task request.<p>"Generate a vegetarian version of this recipe."<p>"Extract the materials list for this project."<p>"What was the book they recommended?"<p>This extension does not use a speech to text model to transcribe the video or an image recognition model to actually pass along information about what took place in the video. Automatically generated YouTube transcripts are not great quality, but they tend to be totally fine for the lightweight use cases when you just have a simple question on a video with dialogue.<p>gpt-3.5-turbo-1106 with the 16k token context window is used by default for videos under about ten minutes. For longer videos, the gpt-4-1106-preview model with a 128k context window is automatically selected. You should be able to use this for videos of to about four hours in length, but this is highly dependent on the volume of dialogue.<p>Best thought of as a very simple starting point for configuring a diy extension of your own. I actually use this on a regular basis now! Its nice for just getting around ads and not suffering through long youtube videos sent by well-meaning friends. Would love to see how other people configure this to be even better. Fork away!