This folder contains the tools to train the neural networks used by T-Top.
The descriptor_yolo_v7
network is training with https://github.com/mamaheux/descriptor-yolov7.
- Training scripts
- The train_audio_descriptor_extractor.py script trains a neural network that
classifies a sound to one class and extracts an embedding. The network name is
audio_descriptor_extractor
. - The train_backbone.py script trains a neural network that classifies images.
- The train_backbone_distillation.py script trains a student neural network that classifies images from a teacher one.
- The train_descriptor_yolo_v4.py script trains a neural network that detects
objects, classifies them and extracts embeddings. The network name is
descriptor_yolo_v4
. The training is not working properly. - The train_face_descriptor_extractor.py script trains a neural network that
extracts an embedding for a face. The network name is
face_descriptor_extractor
. - The train_keyword_spotter.py script trains a neural network that detects a wake-up
word. The network name is
keyword_spotter
. - The train_multiclass_audio_descriptor_extractor.py script trains
a neural network that classifies a sound to many classes and extracts an embedding. The network name
is
audio_descriptor_extractor
. - The train_pose_estimator.py script trains a neural network that estimates the pose of a
person. The network name is
pose_estimator
. - The train_semantic_segmentation_network.py script trains a neural
network that performs semantic segmentation. The network name is
semantic_segmentation_network
.
- The train_audio_descriptor_extractor.py script trains a neural network that
classifies a sound to one class and extracts an embedding. The network name is
- Export scripts
- The export_audio_descriptor_extractor.py script exports
the
audio_descriptor_extractor
network to a TorchScript file and a TensorRT file. - The export_descriptor_yolo.py script exports the
descriptor_yolo
network to a TorchScript file and a TensorRT file. - The export_face_descriptor_extractor.py script exports
the
face_descriptor_extractor
network to a TorchScript file and a TensorRT file. - The export_keyword_spotter.py script exports the
keyword_spotter
network to a TorchScript file and a TensorRT file. - The export_pose_estimator.py script exports the
pose_estimator
network to a TorchScript file and a TensorRT file. - The export_yolo_v4.py script exports the
yolo_v4
network to a TorchScript file and a TensorRT file. - The export_semantic_segmentation_network.py script exports the
semantic_segmentation_network
network to a TorchScript file and a TensorRT file.
- The export_audio_descriptor_extractor.py script exports
the
- Test scripts
- The test_exported_audio_descriptor_extractor.py script tests the
exported
audio_descriptor_extractor
network. - The test_exported_descriptor_yolo_v4.py script tests the
exported
descriptor_yolo_v4
network. - The test_exported_face_descriptor_extractor.py script tests the
exported
face_descriptor_extractor
network. - The test_exported_keyword_spotter.py script tests the
exported
keyword_spotter
network. - The test_exported_pose_estimator.py script tests the exported
pose_estimator
network. - The test_lfw_vox_celeb_models.py script tests the combination of
the
audio_descriptor_extractor
network and theface_descriptor_extractor
network. - The test_pose_estimator_with_yolo_v4.py script tests the
pose_estimator
network with theyolo_v4
network on the COCO dataset. - The test_exported_semantic_segmentation_network.py script tests
the exported
semantic_segmentation_network
network.
- The test_exported_audio_descriptor_extractor.py script tests the
exported
- Setup the Python virtual environment.
python3 -m venv venv
source venv/bin/activate
pip install numpy
pip install -r requirements.txt
- Show the help of the script :
python script.py -h
- Use the script according to the help :
python script.py ...