As a content producer, one of my most important workflows starts after recording and doing a light cleanup edit in Descript.
Once the composition transcript is ready, I’d like to be able to retrieve that transcript through the API.
Ideally, I could call an endpoint like: GET /v1/projects/{project_id}/compositions/{composition_id}/transcript
This would let me use Descript as the source of truth for the edited composition, then pass the transcript into downstream workflows for clip planning, social posts, newsletters, YouTube descriptions, show notes, and other content.
Useful options would include: - plain text transcript - speaker labels - timestamps - word-level or sentence-level timing if available - transcript readiness/status - VTT/SRT export formats
This would be especially valuable for agentic workflows where another AI system analyzes the transcript for downstream content.