DiffVC

Diffusion-based voice conversion model for one-shot speaker style transfer. Develops a novel SDE solver for fast maximum likelihood sampling, enabling high-quality voice conversion while preserving linguistic content. Accepted as an oral presentation at ICLR 2022, demonstrating superior quality compared to state-of-the-art one-shot voice conversion approaches.

No results found