Recursive-joint-co-attention Implementation for Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention Dependency torch 0.4.1 python 3.6 Download data from the original paper link and put it to data folder train and test function in supervised_main.py