| Linear Attention via Orthogonal Memory | Dec 18, 2023 | Causal Language ModelingComputational Efficiency | —Unverified | 0 | 0 |
| DavIR: Data Selection via Implicit Reward for Large Language Models | Oct 16, 2023 | Causal Language ModelingGSM8K | —Unverified | 0 | 0 |
| Mixture of Weight-shared Heterogeneous Group Attention Experts for Dynamic Token-wise KV Optimization | Jun 16, 2025 | Causal Language ModelingInstruction Following | —Unverified | 0 | 0 |
| Multitask Finetuning for Improving Neural Machine Translation in Indian Languages | Dec 3, 2021 | Causal Language ModelingLanguage Modeling | —Unverified | 0 | 0 |
| Multi-Task Learning for Situated Multi-Domain End-to-End Dialogue Systems | Oct 11, 2021 | Causal Language ModelingDiversity | —Unverified | 0 | 0 |
| N-gram Prediction and Word Difference Representations for Language Modeling | Sep 5, 2024 | Causal Language ModelingLanguage Modeling | —Unverified | 0 | 0 |
| NIFTY Financial News Headlines Dataset | May 16, 2024 | Causal Language ModelingLanguage Modeling | —Unverified | 0 | 0 |
| Novel-WD: Exploring acquisition of Novel World Knowledge in LLMs Using Prefix-Tuning | Aug 30, 2024 | Causal Language ModelingContinual Learning | —Unverified | 0 | 0 |
| Predictability and Causality in Spanish and English Natural Language Generation | Aug 26, 2024 | Causal Language ModelingLanguage Modeling | —Unverified | 0 | 0 |
| Prix-LM: Pretraining for Multilingual Knowledge Base Construction | Nov 16, 2021 | Bilingual Lexicon InductionCausal Language Modeling | —Unverified | 0 | 0 |