| Leveraging Pre-trained Checkpoints for Sequence Generation Tasks | Jul 29, 2019 | DecoderMachine Translation | CodeCode Available | 1 |
| Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition | Feb 24, 2022 | Audio-Visual Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Actor and Action Video Segmentation from a Sentence | Mar 20, 2018 | Action SegmentationDecoder | CodeCode Available | 1 |
| A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations | May 20, 2025 | SentenceSentence Classification | CodeCode Available | 1 |
| Librispeech Transducer Model with Internal Language Model Prior Correction | Apr 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Like a Good Nearest Neighbor: Practical Content Moderation and Text Classification | Feb 17, 2023 | ClassificationContrastive Learning | CodeCode Available | 1 |
| Linguistic Structure Guided Context Modeling for Referring Image Segmentation | Oct 1, 2020 | Dependency ParsingImage Segmentation | CodeCode Available | 1 |
| A Plug-and-Play Method for Controlled Text Generation | Sep 20, 2021 | SentenceStory Generation | CodeCode Available | 1 |
| Listening to Sounds of Silence for Speech Denoising | Oct 22, 2020 | DenoisingSentence | CodeCode Available | 1 |
| BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation | Mar 22, 2021 | Document Level Machine TranslationMachine Translation | CodeCode Available | 1 |