Background:
I'm a video editor transitioning from Premiere Pro to Descript. I produce both podcast-style interviews and short-form scripted theatrical content, both requiring multicam editing with 3+ camera angles plus separate audio tracks.
Current Problem:
When using create_sequence to combine multiple synced camera files, the automatic sync often fails. When this happens, there are no manual tools to correct the sync, making multicam workflows impossible without pre-syncing in external NLEs.
Specific Issues:
No Manual Sync Adjustment
When automatic sync fails, there's no way to manually adjust individual track offsets
Cannot drag tracks in timeline to align them (unlike Premiere Pro)
No numeric offset input fields
No Sync Refinement Tools
Cannot see waveform comparison between tracks
No visual feedback on sync confidence/quality
No "re-analyze sync" button to retry with different parameters
Sequence Workflow Limitations
Individual tracks within a sequence aren't accessible for timing adjustments
Cannot temporarily view/solo individual angles for sync verification
Playback triggers all tracks simultaneously, making manual sync impossible
Missing Sync Point Detection
Should detect clapperboard audio spikes automatically
Should allow manual marking of sync points (like Premiere's "Mark Clip" feature)
Requested Features:
Priority 1: Manual Sync Controls
Ability to adjust individual track start offsets (in frames or seconds)
Visual waveform overlay for comparing audio tracks
Numeric input fields for precise offset adjustment
Ability to drag tracks horizontally in timeline for visual alignment
Priority 2: Improved Automatic Sync
Sync confidence indicator (percentage or quality score)
Option to retry sync with different sensitivity settings
Automatic detection of clapperboard/slate audio spikes
Manual sync point marking (user clicks sync frame on each clip)
Priority 3: Better Sequence Workflow
Ability to expand/collapse sequence to see individual tracks
Solo/mute individual tracks within sequence for verification
Scrub playback without auto-playing all tracks
Visual indicators showing which tracks are in/out of sync
Use Cases This Would Enable:
Scripted Content: Multi-camera theatrical scenes with separate audio recorder (common in indie film production)
Podcast Interviews: Multiple camera angles with dedicated audio interface
Live Events: Concert footage, panel discussions, conferences with multiple cameras
Educational Content: Instructor + screen capture + close-up camera
Why This Matters:
Descript's text-based editing is revolutionary, but multicam sync is a fundamental video editing capability. Every professional NLE (Premiere, Resolve, Final Cut, even free tools like Resolve) has robust multicam sync tools.
Without these features, Descript cannot be used for professional multicam workflows, forcing users to:
Pre-sync in external NLEs (defeating Descript's "all-in-one" value proposition)
Abandon Descript for multicam projects entirely
Adding these features would make Descript viable for a much broader range of professional video production, not just podcast editing.
Technical Feasibility:
These features already exist in other NLEs and are technically straightforward:
Waveform analysis algorithms are mature technology
Manual offset adjustment is basic timeline functionality
Visual waveform comparison is standard in audio editing tools
Business Impact:
This would unlock Descript for:
Film/video production professionals
Corporate video teams
Educational content creators
Live event videographers
Anyone shooting multicam content