Highlights
- Support the Training of ActionClip
- Support VindLU multi-modality algorithm
- Support MobileOne TSN/TSM
New Features
- Support the Training of ActionClip (2620)
- Support video retrieval dataset MSVD (2622)
- Support VindLU multi-modality algorithm (2667)
- Support Dense Regression Network for Video Grounding (2668)
Improvements
- Support Video Demos (2602)
- Support Audio Demos (2603)
- Add README_zh-CN.md for Swin and VideoMAE (2621)
- Support MobileOne TSN/TSM (2656)
- Support SlowOnly K700 feature to train localization models (2673)
Bug Fixes