| Extending Context Window of Large Language Models from a Distributional Perspective | Oct 2, 2024 | 16k8k | CodeCode Available | 0 |
| On The Adaptation of Unlimiformer for Decoder-Only Transformers | Oct 2, 2024 | 4k8k | —Unverified | 0 |
| PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization | Sep 25, 2024 | 8kDomain Adaptation | CodeCode Available | 1 |
| PipeFill: Using GPUs During Bubbles in Pipeline-parallel LLM Training | Sep 23, 2024 | 8kGPU | —Unverified | 0 |
| LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models | Aug 31, 2024 | 8kGPU | CodeCode Available | 2 |
| Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models | Aug 28, 2024 | 2k4k | CodeCode Available | 1 |
| Evaluating Large Language Models on Spatial Tasks: A Multi-Task Benchmarking Study | Aug 26, 2024 | 8kBenchmarking | —Unverified | 0 |
| Narratives at Conflict: Computational Analysis of News Framing in Multilingual Disinformation Campaigns | Aug 24, 2024 | 8kArticles | CodeCode Available | 0 |
| SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models | Aug 21, 2024 | 8kGSM8K | CodeCode Available | 1 |
| FocusLLM: Precise Understanding of Long Context by Dynamic Condensing | Aug 21, 2024 | 8kDecoder | CodeCode Available | 1 |