Create a feature that can automatically identify and separate two voices in a video recording, assigning each voice to its respective avatar. The app should be capable of detecting each voice and linking it to a corresponding avatar with natural expressions and synchronized mouth movements. The app should ensure that the avatars match the tone, emotion, and timing of the voices, making the conversation appear seamless and lifelike.