SP-FaceVC

This is the offcial github page of the AAAI 2023 proceeding paper: "Zero-shot Face-based Voice Conversion: Bottleneck-Free Speech Disentanglement in the Real-world Scenario".

The demo website: https://sites.google.com/view/spfacevc-demo

Download your data (with face and speech).
Change wav input and output path in preprocess/config/preprocess.yaml and generate data with command:

python3 preprocess/preprocess.py

Change rootDir and targetDir in make_faceemb.py, and execute to get face embedding. Then, do the arithmatic mean for the embeddings (change your input and output dir path as well) with commands:

python3 make_faceemb.py

python3 make_spk_mean.py

Change the directory path to your own path in data_loader.py.
Train the model until the loss function is converged with command:

python3 main_gan.py --model_id $your_id$

Change the parameters in the file to your checkpoint and data, then generate results with command:

python3 conversion_speechbrain.py

Synthesized results with WAVEGLOW pretrained model and change the inference.py to our file, running with command:

python3 inference.py -f $your_result_path$ -w $waveglow_checkpoint_path$ -o $output_dir$ --is_fp16 -s 0.6

If you need a checkpoint, here is a reference one. Please read Readme.txt in the following link. Thanks.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
preprocess		preprocess
500_lrs3_shuffle.txt		500_lrs3_shuffle.txt
Dis_noncond.py		Dis_noncond.py
LICENSE		LICENSE
README.md		README.md
conversion_speechbrain.py		conversion_speechbrain.py
data_loader.py		data_loader.py
inference.py		inference.py
main_gan.py		main_gan.py
make_faceemb.py		make_faceemb.py
make_spk_mean.py		make_spk_mean.py
mel2samp.py		mel2samp.py
model_vc_gan.py		model_vc_gan.py
solver_encoder_gan.py		solver_encoder_gan.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SP-FaceVC

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SP-FaceVC

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages