[ICASSP 2024 Oral] WAVER: Writing-Style Agnostic Text-Video Retrieval Via Distilling Vision-Language Models Through Open-Vocabulary Knowledge
-
Updated
Jan 10, 2024 - Python
[ICASSP 2024 Oral] WAVER: Writing-Style Agnostic Text-Video Retrieval Via Distilling Vision-Language Models Through Open-Vocabulary Knowledge
Official repo for "Audio-Visual Speech Recognition In-the-Wild: Multi-Angle Vehicle Cabin Corpus and Attention-based Method" in ICASSP 2024
Official code for "Multi-Level Motion Attention with Contrastive Learning for Few-shot Action Recognition" (IICASSP2024)
Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better understanding, and stay at the forefront of advances in speech enhancement with this repository! Don't forget to ⭐ if you find it helpful.
2D residual U-Net (ResUNet) and a lead combiner (LC) for 12-lead ECG Abnormality Classification
StofNet: Super-resolution Time of Flight Network (ICASSP 2024)
Repository for the ICASSP 2024 paper "An Experimental Comparison Of Multi-view Self-supervised Methods For Music Tagging".
The official implementation for IEEE-ICASSP 2024 paper "Flare-Free Vision: Empowering Uformer with Depth Insights"
Cross-lingual learning in scene text recognition (ICASSP2024)
Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
Add a description, image, and links to the icassp2024 topic page so that developers can more easily learn about it.
To associate your repository with the icassp2024 topic, visit your repo's landing page and select "manage topics."