This is a repository for a voice conversion model based on VITS.
- Python >= 3.8
- Clone this repository
- Install python requirements. Please refer requirements.txt
- Download FairSeq Hubert pretrained models:Download
- Download datasets
First generate the file list text file from the dataset folder.
One folder for audio files of each speaker in the dataset folder.
Run filelist.py
to get filelist.txt
Then split the training and validation dataset, run split.py
.
pass