update SLP-P12.md

DmitryRyumin · Oct 20, 2024 · 75e35a0 · 75e35a0
1 parent 2d82d9d
commit 75e35a0
Showing 1 changed file with 8 additions and 2 deletions.
diff --git a/sections/2024/main/SLP-P12.md b/sections/2024/main/SLP-P12.md
@@ -34,7 +34,7 @@
 
 ## Robust Speech Recognition and Adaptation
 
-![Section Papers](https://img.shields.io/badge/Section%20Papers-0-42BA16) ![Preprint Papers](https://img.shields.io/badge/Preprint%20Papers-0-b31b1b) ![Papers with Open Code](https://img.shields.io/badge/Papers%20with%20Open%20Code-0-1D7FBF) ![Papers with Video](https://img.shields.io/badge/Papers%20with%20Video-0-FF0000)
+![Section Papers](https://img.shields.io/badge/Section%20Papers-23-42BA16) ![Preprint Papers](https://img.shields.io/badge/Preprint%20Papers-13-b31b1b) ![Papers with Open Code](https://img.shields.io/badge/Papers%20with%20Open%20Code-3-1D7FBF) ![Papers with Video](https://img.shields.io/badge/Papers%20with%20Video-0-FF0000)
 
 | **Title** | **Repo** | **Paper** | **Video** |
 |-----------|:--------:|:---------:|:---------:|
@@ -54,4 +54,10 @@
 | Synthetic Conversations Improve Multi-Talker ASR | :heavy_minus_sign: | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10446589-E4A42C.svg)](https://ieeexplore.ieee.org/document/10446589) | :heavy_minus_sign: |
 | Stable Distillation: Regularizing Continued Pre-training for Low-Resource Automatic Speech Recognition | [![GitHub](https://img.shields.io/github/stars/cs20s030/stable_distillation?style=flat)](https://github.com/cs20s030/stable_distillation) | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10446335-E4A42C.svg)](https://ieeexplore.ieee.org/document/10446335) <br/> [![arXiv](https://img.shields.io/badge/arXiv-2312.12783-b31b1b.svg)](https://arxiv.org/abs/2312.12783) | :heavy_minus_sign: |
 | Towards High-Performance and Low-Latency Feature-based Speaker Adaptation of Conformer Speech Recognition Systems | [![GitHub Page](https://img.shields.io/badge/GitHub-Page-159957.svg?style=flat)](https://jjdean321.github.io/FastAdapt/) | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10448488-E4A42C.svg)](https://ieeexplore.ieee.org/document/10448488) | :heavy_minus_sign: |
-| Progressive Unsupervised Domain Adaptation for ASR Using Ensemble Models and Multi-Stage Training | :heavy_minus_sign: | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10448438-E4A42C.svg)](https://ieeexplore.ieee.org/document/10448438) <br/> [![arXiv](https://img.shields.io/badge/arXiv-2402.04805-b31b1b.svg)](https://arxiv.org/abs/2402.04805) | :heavy_minus_sign: |
+| Progressive Unsupervised Domain Adaptation for ASR Using Ensemble Models and Multi-Stage Training | :heavy_minus_sign: | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10448438-E4A42C.svg)](https://ieeexplore.ieee.org/document/10448438) <br/> [![arXiv](https://img.shields.io/badge/arXiv-2402.04805-b31b1b.svg)](https://arxiv.org/abs/2402.04805) | :heavy_minus_sign: |
+| Sparsely Shared LoRA on Whisper for Child Speech Recognition | :heavy_minus_sign: | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10447004-E4A42C.svg)](https://ieeexplore.ieee.org/document/10447004) <br/> [![arXiv](https://img.shields.io/badge/arXiv-2309.11756-b31b1b.svg)](https://arxiv.org/abs/2309.11756) | :heavy_minus_sign: |
+| Cross-Speaker Encoding Network for Multi-Talker Speech Recognition | [![GitHub](https://img.shields.io/github/stars/kjw11/CSEnet-ASR?style=flat)](https://github.com/kjw11/CSEnet-ASR) | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10446249-E4A42C.svg)](https://ieeexplore.ieee.org/document/10446249) <br/> [![arXiv](https://img.shields.io/badge/arXiv-2401.04152-b31b1b.svg)](https://arxiv.org/abs/2401.04152) | :heavy_minus_sign: |
+| Max-Margin Transducer Loss: Improving Sequence-Discriminative Training Using a Large-Margin Learning Strategy | :heavy_minus_sign: | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10446322-E4A42C.svg)](https://ieeexplore.ieee.org/document/10446322) | :heavy_minus_sign: |
+| Corpus Synthesis for Zero-shot ASR domain Adaptation using Large Language Models | :heavy_minus_sign: | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10447240-E4A42C.svg)](https://ieeexplore.ieee.org/document/10447240) <br/> [![arXiv](https://img.shields.io/badge/arXiv-2309.10707-b31b1b.svg)](https://arxiv.org/abs/2309.10707) | :heavy_minus_sign: |
+| FusDom: Combining In-Domain and Out-of-Domain Knowledge for Continuous Self-Supervised Learning | [![GitHub](https://img.shields.io/github/stars/cs20s030/fusdom?style=flat)](https://github.com/cs20s030/fusdom) | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10448147-E4A42C.svg)](https://ieeexplore.ieee.org/document/10448147) <br/> [![arXiv](https://img.shields.io/badge/arXiv-2312.13026-b31b1b.svg)](https://arxiv.org/abs/2312.13026) | :heavy_minus_sign: |
+| AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition | :heavy_minus_sign: | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10446721-E4A42C.svg)](https://ieeexplore.ieee.org/document/10446721) <br/> [![arXiv](https://img.shields.io/badge/arXiv-2403.11578-b31b1b.svg)](https://arxiv.org/abs/2403.11578) | :heavy_minus_sign: |