| CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos | Mar 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Efficient recurrent architectures through activity sparsity and sparse back-propagation through time | Jun 13, 2022 | Gesture RecognitionLanguage Modeling | CodeCode Available | 1 |
| ELECTRAMed: a new pre-trained language representation model for biomedical NLP | Apr 19, 2021 | Drug–drug Interaction ExtractionLanguage Modeling | CodeCode Available | 1 |
| CO-Bench: Benchmarking Language Model Agents in Algorithm Search for Combinatorial Optimization | Apr 6, 2025 | BenchmarkingCombinatorial Optimization | CodeCode Available | 1 |
| Fusing Pre-Trained Language Models With Multimodal Prompts Through Reinforcement Learning | Jan 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data | Aug 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models | Oct 15, 2023 | CPUGPU | CodeCode Available | 1 |
| ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users | May 24, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| CLIP the Gap: A Single Domain Generalization Approach for Object Detection | Jan 13, 2023 | Domain Generalizationimage-classification | CodeCode Available | 1 |
| ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators | Mar 23, 2020 | GPULanguage Modeling | CodeCode Available | 1 |