Open-source podcast ad removal system that uses Whisper transcription and the Claude API (or local LLMs via Ollama) to automatically detect and strip host-read sponsorships, dynamically inserted ads, and pre/mid/post-roll breaks from podcast feeds. Serves modified RSS feeds compatible with any podcast app—features multi-pass detection, audio fingerprinting, and cross-episode pattern learning.
I have been using this tool daily for the last 3 months. In general, I feel that it works pretty well. Unfortunately, I don't have a real measurement of accuracy, but it feels like it works ~95% of the time.