Amplifier Health - AI Engineer/Researcher Take-Home Project

Overview

This repository contains the implementation of the Take-Home project for the AI Engineer/Researcher role at Amplifier Health.

To tackle this task, I classified audio data using both binary and multi-class classification approaches. This was done by implementing two different models: ResNet-18 and LCNN. These models process various audio features (MelSpectrogram, LogSpectrogram, MFCC, and LFCC), with an analysis conducted to identify the feature set that yields the best classification performance for the task at hand.

Here are the main components of this repository:

main.py: The core script that handles model setup, dataset loading, training, and evaluation.
visualize_results.py: A script to visualize the evaluation results of the trained models.
test_code.py: A script to run a simple test to check if the code is working correctly and plot a ROC curve of the classification results.
src/: A folder containing the source code for the models, training, and feature extraction.
config/: A folder containing configuration files for the models and training.
requirements.txt: A txt file listing all the dependencies needed to run the project.
run_experiments.sh: A shell script to run the experiments across different configurations.
notebooks/analyze_data.ipynb: A Jupyter notebook for dataset analysis and visualization.
report.pdf: The final report of the assignment.

Running the Code

After creating the environment, run the main.py script, choosing the experiment configuration using the following command-line arguments:

--feature_set: Choose from ['MelSpec', 'LogSpec', 'MFCC', 'LFCC'] for the feature set.
--model_arch: Choose from ['ResNet', 'LCNN'] for the model architecture.
--train_model: Set to True to train the model, or False to skip training.
--eval_model: Set to True to evaluate the model after training, or False to skip evaluation.
--classification_type: Set the classification type to either 'binary' or 'multi'.
--win_len: The length (in seconds) of the audio window for analysis.

Example:

python main.py --feature_set MelSpec --model_arch ResNet --train_model True --eval_model True --classification_type binary --win_len 5.0

To run the test code, simply download the repository and run the following command:

python test_code.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Amplifier Health - AI Engineer/Researcher Take-Home Project

Overview

Running the Code

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
checkpoints		checkpoints
config		config
data		data
notebook		notebook
src		src
test_audio		test_audio
.gitignore		.gitignore
README.md		README.md
main.py		main.py
report.pdf		report.pdf
requirements.txt		requirements.txt
run_experiments.sh		run_experiments.sh
test_code.py		test_code.py
visualize_results.py		visualize_results.py

davidesalvi/ICBHI_2017

Folders and files

Latest commit

History

Repository files navigation

Amplifier Health - AI Engineer/Researcher Take-Home Project

Overview

Running the Code

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages