Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
das-mithun authored Jun 17, 2024
1 parent 13ece34 commit 3dc3482
Showing 1 changed file with 13 additions and 1 deletion.
14 changes: 13 additions & 1 deletion Codes/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,11 +22,23 @@
# To Run the unimodal Vision Based models

6.Vision+lstm_foldWise.py
7. 3DCNN_withFolds.py
7.3DCNN_withFolds.py

# To Run the Multimodal Model
9. MultiModalFusionModelfoldWise.py

# To extract all the video frames.
frameExtract.py

# Extraction of transcript

The 'all__video_vosk_audioMap.p' has to be generated using the Vosk speech recognition toolkit(https://alphacephei.com/vosk/). The format of the file is in JSON format like the below:

{
"video_name1": "transcript1",
"video_name2": "transcript2",
...
"video_name3": "transcript3"
}

0 comments on commit 3dc3482

Please sign in to comment.