Name	Name	Last commit message	Last commit date
Latest commit jackaduma Update README.zh-CN.md Mar 22, 2023 55f842e · Mar 22, 2023 History 42 Commits
cache	cache	update README	Jul 14, 2020
converted_sound	converted_sound	data & result wav file	Jul 14, 2020
data	data	data & result wav file	Jul 14, 2020
misc	misc	add donate img	Jan 18, 2021
LICENSE	LICENSE	Initial commit	May 13, 2020
README.md	README.md	update README	May 5, 2022
README.zh-CN.md	README.zh-CN.md	Update README.zh-CN.md	Mar 22, 2023
model_tf.py	model_tf.py	model's network archtech	Jul 14, 2020
preprocess.py	preprocess.py	preprocess	Jul 14, 2020
preprocess_training.py	preprocess_training.py	preprocess before training: generate wav spec	Jul 14, 2020
requirements.txt	requirements.txt	add requirements.txt	Jul 14, 2020
train.py	train.py	fixed: the second step adverserial loss: but the	Nov 17, 2020
trainingDataset.py	trainingDataset.py	dataset loader	Jul 14, 2020

Repository files navigation

CycleGAN-VC2-PyTorch

中文说明 | English

This code is a PyTorch implementation for paper: CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion, a nice work on Voice-Conversion/Voice Cloning.

Update

2020.11.17: fixed issues: re-implements the second step adverserial loss.

2020.08.27: add the second step adverserial loss by Jeffery-zhang-nfls

CycleGAN-VC2

Project Page

To advance the research on non-parallel VC, we propose CycleGAN-VC2, which is an improved version of CycleGAN-VC incorporating three new techniques: an improved objective (two-step adversarial losses), improved generator (2-1-2D CNN), and improved discriminator (Patch GAN).

This repository contains:

model code which implemented the paper.
audio preprocessing script you can use to create cache for training data.
training scripts to train the model.
Examples of Voice Conversion - converted result after training.

Requirement

pip install -r requirements.txt

Usage

preprocess

python preprocess_training.py

is short for

python preprocess_training.py --train_A_dir ./data/S0913/ --train_B_dir ./data/gaoxiaosong/ --cache_folder ./cache/

train

python train.py

is short for

python train.py --logf0s_normalization ./cache/logf0s_normalization.npz --mcep_normalization ./cache/mcep_normalization.npz --coded_sps_A_norm ./cache/coded_sps_A_norm.pickle --coded_sps_B_norm ./cache/coded_sps_B_norm.pickle --model_checkpoint ./model_checkpoint/ --resume_training_at ./model_checkpoint/_CycleGAN_CheckPoint --validation_A_dir ./data/S0913/ --output_A_dir ./converted_sound/S0913 --validation_B_dir ./data/gaoxiaosong/ --output_B_dir ./converted_sound/gaoxiaosong/

Pretrained

a pretrained model which converted between S0913 and GaoXiaoSong

download from Google Drive <735MB>

Demo

Samples:

reference speaker A: S0913(./data/S0913/BAC009S0913W0351.wav)

reference speaker B: GaoXiaoSong(./data/gaoxiaosong/gaoxiaosong_1.wav)

speaker A's speech changes to speaker B's voice: Converted from S0913 to GaoXiaoSong (./converted_sound/S0913/BAC009S0913W0351.wav)

Star-History

Reference

CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion. Paper, Project
Parallel-Data-Free Voice Conversion Using Cycle-Consistent Adversarial Networks. Paper, Project
Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks. Paper, Project, Code
Image-to-Image Translation with Conditional Adversarial Nets. Paper, Project, Code

Donation

If this project help you reduce time to develop, you can give me a cup of coffee :)

AliPay(支付宝)

WechatPay(微信)

License

MIT © Kun

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CycleGAN-VC2-PyTorch

Update

CycleGAN-VC2

Project Page

Table of Contents

Requirement

Usage

preprocess

train

Pretrained

Demo

Star-History

Reference

Donation

License

About

Releases

Packages

Contributors 2

Languages

License

jackaduma/CycleGAN-VC2

Folders and files

Latest commit

History

Repository files navigation

CycleGAN-VC2-PyTorch

Update

CycleGAN-VC2

Project Page

Table of Contents

Requirement

Usage

preprocess

train

Pretrained

Demo

Star-History

Reference

Donation

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages