Feature Requests

Underlord doesn't see Drive Comments?
Feature Request: Agent Underlord Access to Comments & Notes Problem: Currently, Agent Underlord cannot see comments, notes, or annotations that users add within Descript (DRIVE) projects. This creates a workflow gap where users must manually transcribe their editing notes into the chat instead of having the agent read them directly. Requested Feature: Enable Agent Underlord to read and reference comments and notes added to Descript projects, including: Timeline comments/annotations Script notes and inline comments Project-level notes or task lists Time-stamped feedback markers Why This Matters: Natural Workflow Integration Users already document their editing decisions using Descript's native commenting tools. Requiring manual duplication into chat breaks the flow. Efficiency Eliminates the need to re-type or screenshot notes, especially for complex projects with multiple change requests. Context Preservation Comments often include timestamp references, visual markers, and rich context that's difficult to communicate through text alone. Collaboration Alignment Many users work in teams where comments are the primary feedback mechanism. The agent should participate in that same communication layer. Example Use Case: A user reviews their 15-minute video, adds 8 timestamped comments like "cut this section" or "add title card here," then asks Agent Underlord: "Can you make the changes from my notes?" The agent would read all comments and execute the requested edits without requiring manual re-entry. Expected Behavior: When a user references "my notes" or "the comments I added," Agent Underlord should be able to query and retrieve comment content, timestamps, and context, then execute the requested changes accordingly.
0
·
AI
AI WCAG 2.1 AA Audio Description Generationfor Video
The federal government recently postponed a rule requiring public entities and universities to comply with WCAG 2.1 Level AA under the ADA which includes Audio Description requirements for Video by (now) April 24, 2027. This process can many hours to do manually for even simple videos, and is practically or operationally impossible for many workflows or content types Audio Descriptions allow low vision users to hear a spoken description of the visual content and action happening on sc either during the natural pauses between dialogue (standard), or through pausing the video to allow the visual content to be spoken (extended). AI models have only very recently reached the technical capability of analyzing video frames and accurately summarizing the content over time. When paired with speech-to-text or speech generation models, the descriptions can be largely automated, compared to manually review, scripting, voicing, and editing. Ideal features would include: Auto-generated editable and time-aware text descriptions of on screen visual content Generation of speech audio based on said description, that adapts to updates to edits Ability to detect non-dialogue portions of a video for possible AD audio insertion Automatic insertion of a freeze frame and ripple edit of generated audio Customization of timbre timing and phonetic pronunciation for non-standard words or phrases (think company names, uncommon person names, industry jargon and acronyms) Custom voice selection and cloning. Ability for human review, editing, and customization throughout the process. As a video editing platform on the cutting of integrating AI media tools into real production environments, Descript already has many of the tools in place to help significantly speed up the AD generation for hundreds of thousands of hours of video content in public entities' backlog in addition to new content generated every year while. Very few effective commercial solutions currently exist otherwise. https://www.w3.org/TR/WCAG21/ https://www.federalregister.gov/documents/2026/04/20/2026-07663/extension-of-compliance-dates-for-nondiscrimination-on-the-basis-of-disability-accessibility-of-web
0
·
AI
Load More