[AAAI-2024] Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception, Xiao Wang, Wentao Wu, Chenglong Li, Zhicheng Zhao, Zhe Chen, Yukai Shi, Jin Tang
clip vehicle-tracking vehicle-detection prior mae pre-training vehicle-perceptron large-modal large-langauge-model vehicle-segmentation vehicle-attribute-recognition vehicle-fine-grained-classification vision-text-contrastive
-
Updated
Jul 29, 2024 - Python