Longform Video Editor - YouTube Pipeline
Full pipeline for editing a YouTube long-form video: transcribe with Whisper, cut silences, sync dual cameras, compose 1920x1080 with PiP and captions, render with Remotion.
Longform Video Editor - YouTube Pipeline
Edit a YouTube long-form video using a fully local pipeline: Whisper transcription, silence cutting, dual-camera sync, and Remotion rendering. No paid editor or cloud service needed.
Pipeline overview
- Transcribe the raw footage with faster-whisper (local, CPU or GPU)
- Detect and cut silences and filler pauses
- Sync secondary camera (OBS/face-cam) to primary
- Compose 1920x1080 with PiP + auto-captions + zoom effects
- Render with Remotion
Requirements
- Python 3.9+ with faster-whisper, ffmpeg, numpy
- Node.js 18+ with Remotion
- Primary camera footage (
main.mp4) and optional face-cam (facecam.mp4) - At least 8GB RAM for 1080p rendering
Create an account to unlock the full step-by-step pipeline with all scripts, config schemas, and sync algorithms.
Get the full workflow - it's free
Create a free account to access the complete script, copy-paste code, and community discussion.
Create free account