| Kalman Filter Enhanced GRPO for Reinforcement Learning-Based Language Model Reasoning | May 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Counterfactual Data Augmentation for Neural Machine Translation | Jun 1, 2021 | counterfactualData Augmentation | CodeCode Available | 1 | 5 |
| A Simple Long-Tailed Recognition Baseline via Vision-Language Model | Nov 29, 2021 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| Keep CALM and Explore: Language Models for Action Generation in Text-based Games | Oct 6, 2020 | Action GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| A Simple Language Model for Task-Oriented Dialogue | May 2, 2020 | Dialogue State TrackingEnd-To-End Dialogue Modelling | CodeCode Available | 1 | 5 |
| Aioli: A Unified Optimization Framework for Language Model Data Mixing | Nov 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Jump to Conclusions: Short-Cutting Transformers With Linear Transformations | Mar 16, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation | Jul 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Copy Is All You Need | Jul 13, 2023 | AllDomain Adaptation | CodeCode Available | 1 | 5 |
| CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Feb 20, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 | 5 |