From Laptop to Cloud: Scaling YouTube Video Transcription with LLMs

My experiments around Summarizing YouTube Videos with LLMs have been working well on my Mac laptop, but I’d really like to run it as a service on a regular schedule. Running the OpenAI Whisper transcription process locally requires more than 1GB that my favorite Digital Ocean droplets provide. Instead I’ve added an option to run it directly against the OpenAI Transcriptions AI (the official OpenAI Python library makes it very easy). To use the online API you’ll need an API key set up either as pay-as-you-go or with a ChatGPT Plus subscription (which is what I have). After I’ve run…

Continue Reading →

Summarizing YouTube Videos with LLMs

Over the holiday break I was thinking about taking transcriptions of YouTube videos and running them through an LLM for summaries. My inspiration was the dust up around Honey coupon code browser extension. Most of the news pointed to MegaLag’s video Exposing the Honey Influencer Scam. It’s a 23 minute video which isn’t too bad, but I wondered whether a summary would help here? Python Solution I built a simple pipeline in Python that takes a list of YouTube channels and produces a summary email for each published video: YouTube channels have a standard RSS feed which makes this first…

Continue Reading →