Currently Descript only allows sentence by sentence replacement of text by dub. However, there are many instances with instructional content where an entire audio track needs to be replaced. For example, an SME records a demo with audio (and background noise) and professional voice is desired. There is also a potential application in localization. For reference Cubase ($$$) offer something similar based on waveform and splitting to avoid distortion. Descript's text-anchored tech seems perfect for this approach. It would be a very attractive feature for enterprise organizations.