| Multi-Head Mixture-of-Experts | Apr 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language Model | Sep 6, 2024 | AttributeAutoML | CodeCode Available | 1 |
| Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices | Mar 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation | Dec 23, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Bring Your Own Data! Self-Supervised Evaluation for Large Language Models | Jun 23, 2023 | ChatbotLanguage Modeling | CodeCode Available | 1 |
| MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models | Aug 30, 2024 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| MultiMax: Sparse and Multi-Modal Attention Learning | Jun 3, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain | May 20, 2023 | De-identificationLanguage Modeling | CodeCode Available | 1 |
| Entity Tracking in Language Models | May 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval | Oct 4, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement | Feb 9, 2024 | Code GenerationDecision Making | CodeCode Available | 1 |
| Can Large Language Model Agents Balance Energy Systems? | Feb 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Generative Approach for Script Event Prediction via Contrastive Fine-tuning | Dec 7, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AcTune: Uncertainty-aware Active Self-Training for Semi-Supervised Active Learning with Pretrained Language Models | Dec 16, 2021 | Active LearningLanguage Modeling | CodeCode Available | 1 |
| Multi-Stage Document Ranking with BERT | Oct 31, 2019 | Document RankingLanguage Modeling | CodeCode Available | 1 |
| Multi-Task Learning for Knowledge Graph Completion with Pre-trained Language Models | Dec 1, 2020 | Knowledge Graph CompletionKnowledge Graphs | CodeCode Available | 1 |
| Building A Coding Assistant via the Retrieval-Augmented Language Model | Oct 21, 2024 | Code CompletionCode Generation | CodeCode Available | 1 |
| Epidemic Modeling with Generative Agents | Jul 11, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Deciphering the Language of Nature: A transformer-based language model for deleterious mutations in proteins | Oct 27, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Espresso: A Fast End-to-end Neural Speech Recognition Toolkit | Sep 18, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Enhancing Reasoning to Adapt Large Language Models for Domain-Specific Applications | Feb 5, 2025 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| Atla Selene Mini: A General Purpose Evaluation Model | Jan 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning | Sep 19, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| Enhancing RL Safety with Counterfactual LLM Reasoning | Sep 16, 2024 | counterfactualLanguage Modeling | CodeCode Available | 1 |
| Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning | Sep 19, 2024 | Change DetectionDecoder | CodeCode Available | 1 |