Zero-shot-Singing-Voice-Conversion

Unofficial implementation of zero-shot svc presented at ISMIR 2020, in Pytorch.

@misc{Nercessian2020Zero-shot,
    title={Zero-shot Singing Voice Conversion},
    author={Shahan Nercessian},
    booktitle={Submitted to International Society for Music Information Retrieval},
    year={2020},
    url={https://program.ismir2020.net/poster_1-08.html},
    
}

In this paper, we propose the application of speaker embedding networks for zero-shot SVC. We suggest two architectures for carrying out zero-shot SVC using the WORLD vocoder for modeling singing voice. Overall, we find that speaker embeddings can indeed be used directly for zeroshot SVC. Moreover, zero-shot networks replacing onehot speaker labels with speaker embeddings perform as well as (or even better than) their supervised closed set counterparts, with the invaluable added benefits that they can be trained on unlabeled data and can potentially adapt to new voices without requiring further training. Furthermore, we show that there is some benefit to training zeroshot SVC networks by adapting an initial model trained on large amounts of speech data. In future work, we will investigate learning latent factors which can allow for further expressive manipulation of conversion results. While some initial progress to this end has been made using Gaussian Mixture VAEs (GMVAEs) , they have largely been limited to sung vowels. We can likely generalize this to more practical singing voice by utilizing the conditioning signals used in this work. We are also interested in replacing the WORLD vocoder with learned vocoders based on differentiable digital signal processing, as in in order to enable lightweight end-to-end

my name is Mozhgan Dehghan Azad

40014140111066 student number

Digital signal processing Eslami

https://github.com/ak9250/Zero-shot-Singing-Voice-Conversion

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
1-readme by mozhgn dehghan azad		1-readme by mozhgn dehghan azad
1658552203-2.docx		1658552203-2.docx
2-innovation		2-innovation
2104.06074.pdf		2104.06074.pdf
2203.16705.pdf		2203.16705.pdf
2205.05227.pdf		2205.05227.pdf
3-error of project		3-error of project
4-source code changes		4-source code changes
5-original link		5-original link
6-introduction		6-introduction
7.pdf		7.pdf
8.pdf		8.pdf
LICENSE		LICENSE
README.md		README.md
casanova22a.pdf		casanova22a.pdf
model.py		model.py
other article.docx		other article.docx
qian19c.pdf		qian19c.pdf
video		video
ترجمه مقاله.pdf		ترجمه مقاله.pdf
مقاله1.pdf		مقاله1.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Zero-shot-Singing-Voice-Conversion

About

Releases

Packages

Languages

License

mahdeslami11/Zero-shot-Singing-Voice-Conversion

Folders and files

Latest commit

History

Repository files navigation

Zero-shot-Singing-Voice-Conversion

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages