| DeSTA2: Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data | Sep 30, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI | Jul 19, 2023 | Conversational RecommendationDiversity | CodeCode Available | 2 |
| BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained Transformer | Jul 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Rank1: Test-Time Compute for Reranking in Information Retrieval | Feb 25, 2025 | Information RetrievalInstruction Following | CodeCode Available | 2 |
| GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation | Apr 10, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 |
| Re3: Generating Longer Stories With Recursive Reprompting and Revision | Oct 13, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI | Nov 21, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| GIT: A Generative Image-to-text Transformer for Vision and Language | May 27, 2022 | DecoderImage Captioning | CodeCode Available | 2 |
| BAE: BERT-based Adversarial Examples for Text Classification | Apr 4, 2020 | Adversarial AttackAdversarial Text | CodeCode Available | 2 |
| Backtracing: Retrieving the Cause of the Query | Mar 6, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 2 |