From Laptop to Cloud: Scaling YouTube Video Transcription with LLMs

My experiments around Summarizing YouTube Videos with LLMs have been working well on my Mac laptop, but I’d really like to run it as a service on a regular schedule. Running the OpenAI Whisper transcription process locally requires more than 1GB that my favorite Digital Ocean droplets provide. Instead I’ve added an option to run it directly against the OpenAI Transcriptions AI (the official OpenAI Python library makes it very easy).

To use the online API you’ll need an API key set up either as pay-as-you-go or with a ChatGPT Plus subscription (which is what I have). After I’ve run this for a while I’ll be able to tell what the usage might cost and whether upgrading to a larger droplet size would make more sense.

In any case this is working fine and I have it running hourly in the cloud!

Sample email from a Network Chuck video:

Sample email generated by summarizing the conversation in a YouTube video