ISSAC - Interpretability of Speech Signal under Adverse Conditions - Language ID

GitHub Link: https://github.com/TonnyTran/ISSAC_LanguageID

Installation:

Setting up environment

Install Kaldi

git clone -b 5.4 https://github.com/kaldi-asr/kaldi.git kaldi
cd kaldi/tools/; 
# Run this next line to check for dependencies, and then install them
extras/check_dependencies.sh
make; cd ../src; ./configure; make depend; make

Install EspNet

git clone -b v.0.9.7 https://github.com/espnet/espnet.git
cd espnet/tools/        # change to tools folder
ln -s {kaldi_root}      # Create link to Kaldi. e.g. ln -s home/theanhtran/kaldi/

Set up Conda environment

./setup_anaconda.sh anaconda espnet 3.7.9   # Create a anaconda environmetn - espnet with Python 3.7.9
make TH_VERSION=1.8.0 CUDA_VERSION=10.2     # Install Pytorch and CUDA
. ./activate_python.sh; python3 check_install.py  # Check the installation
conda install torchvision==0.9.0 torchaudio==0.8.0 -c pytorch

Install Kaldi IO

conda install kaldi_io

Download the project

Clone the project from GitHub into your workspace

git clone https://github.com/TonnyTran/ISSAC_LanguageID

Point to your espnet

Open ISSAC_LanguageID/path.sh file, change $MAIN_ROOT$ to your espnet directory, e.g. MAIN_ROOT=/home/theanhtran/espnet

How to run Language ID systems

Data preparation step Open ISSAC_LanguageID/prepare_data.sh file, update raw LRE 2017 data location of train, dev and test set

bash prepare_data.sh --steps 1-6     # we can run step by step

Run the program: train Kaldi x-vector baseline

bash baseline_xvector.sh --steps 1-7

Test the pretrained model: Kaldi x-vector baseline

bash test.sh --steps 1-2

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
conf		conf
local		local
pretrained-model		pretrained-model
sid		sid
steps_ivec		steps_ivec
subtools		subtools
.gitignore		.gitignore
README.md		README.md
baseline_xvector.sh		baseline_xvector.sh
cmd.sh		cmd.sh
path.sh		path.sh
prepare_data.sh		prepare_data.sh
test.log		test.log
test.sh		test.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ISSAC - Interpretability of Speech Signal under Adverse Conditions - Language ID

Installation:

Setting up environment

Download the project

How to run Language ID systems

About

Releases

Packages

Languages

TonnyTran/ISSAC_LanguageID

Folders and files

Latest commit

History

Repository files navigation

ISSAC - Interpretability of Speech Signal under Adverse Conditions - Language ID

Installation:

Setting up environment

Download the project

How to run Language ID systems

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages