Sometimes I have multiple video files but only a single audio file. Since Descript has the ability to detect speakers based on voice alone, it would be great to pair this with the auto multicam feature, meaning we don't need separate isolated audio tracks to use it.