PodcastBrain: Audio to Blog Generator

Turns long-form podcast episodes into SEO-optimized blog posts automatically. Transcribes, summarizes, structures, and publishes, leaving the host to do nothing but record.

Year2023
RoleBuilder & Designer
TypePython Pipeline + WP
Timeline5 weeks

The Idea

Most podcasters publish an episode and never turn it into content again. But a 45-minute episode is a goldmine of search-friendly material if you can transcribe it cleanly and restructure it into something Google can read.

PodcastBrain does exactly that. You drop an MP3 in a folder. A few minutes later there's a fully structured, SEO-optimized blog post in WordPress, with timestamped chapter markers linking back to the audio.

Pipeline

  1. Watch a folder for new audio files (uploaded by the host's RSS feed).
  2. Transcribe with OpenAI Whisper, with speaker diarization.
  3. Summarize each segment with GPT-4 and identify natural chapter breaks.
  4. Generate an SEO title, meta description, and FAQ block from the transcript.
  5. Push the final markdown to WordPress via the REST API.
  6. Notify the host on Slack with a draft preview link.

The Stack

Python 3.11
OpenAI Whisper
GPT-4
FastAPI
Celery
Redis
WordPress REST API
Slack API

What Worked

The single best decision was treating the transcript as the source of truth and never throwing it away. Every downstream artifact, summary, title, FAQ, references back to specific timestamps in the transcript, so when a host pushes back on a generated post, we can show them exactly which 30-second clip the claim came from.

Results

What It Moved.

$1.20
Average pipeline cost per 45-min episode
−98%
Cost reduction vs. previous freelance workflow
~7min
Average pipeline time from drop to draft
+62%
Organic traffic lift to the show's blog in 90 days