| A Training-Free Length Extrapolation Approach for LLMs: Greedy Attention Logit Interpolation (GALI) | Feb 4, 2025 | Long-Context Understanding | CodeCode Available | 0 |
| Can LLMs Maintain Fundamental Abilities under KV Cache Compression? | Feb 4, 2025 | Arithmetic ReasoningCode Generation | —Unverified | 0 |
| M+: Extending MemoryLLM with Scalable Long-Term Memory | Feb 1, 2025 | 16kGPU | CodeCode Available | 3 |
| Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks | Jan 11, 2025 | Code GenerationHumanEval | —Unverified | 0 |
| Repository Structure-Aware Training Makes SLMs Better Issue Resolver | Dec 26, 2024 | Long-Context Understanding | —Unverified | 0 |
| State Space Models are Strong Text Rerankers | Dec 18, 2024 | Long-Context UnderstandingMamba | —Unverified | 0 |
| LIFT: Improving Long Context Understanding Through Long Input Fine-Tuning | Dec 18, 2024 | In-Context LearningLong-Context Understanding | —Unverified | 0 |
| RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios | Dec 12, 2024 | Logical ReasoningLong-Context Understanding | CodeCode Available | 1 |
| Gated Delta Networks: Improving Mamba2 with Delta Rule | Dec 9, 2024 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 4 |
| How Effective Is Self-Consistency for Long-Context Problems? | Nov 2, 2024 | Long-Context UnderstandingPosition | —Unverified | 0 |
| What is Wrong with Perplexity for Long-context Language Modeling? | Oct 31, 2024 | Document SummarizationIn-Context Learning | CodeCode Available | 2 |
| GATEAU: Selecting Influential Samples for Long Context Alignment | Oct 21, 2024 | Instruction FollowingLong-Context Understanding | CodeCode Available | 1 |
| BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression | Oct 20, 2024 | In-Context LearningLong-Context Understanding | CodeCode Available | 1 |
| L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding? | Oct 3, 2024 | 8kDocument Summarization | CodeCode Available | 1 |
| HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models | Sep 24, 2024 | Long-Context UnderstandingText Generation | CodeCode Available | 2 |
| Enhancing Scientific Reproducibility Through Automated BioCompute Object Creation Using Retrieval-Augmented Generation from Publications | Sep 23, 2024 | HallucinationLong-Context Understanding | —Unverified | 0 |
| Retrieval Or Holistic Understanding? Dolce: Differentiate Our Long Context Evaluation Tasks | Sep 10, 2024 | Long-Context UnderstandingRetrieval | —Unverified | 0 |
| E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning | Sep 10, 2024 | Code GenerationDecoder | —Unverified | 0 |
| Large Language Models as Efficient Reward Function Searchers for Custom-Environment Multi-Objective Reinforcement Learning | Sep 4, 2024 | Long-Context UnderstandingMulti-Objective Reinforcement Learning | —Unverified | 0 |
| ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities | Jul 19, 2024 | 4k8k | —Unverified | 0 |
| Fine-Tuning Medical Language Models for Enhanced Long-Contextual Understanding and Domain Expertise | Jul 16, 2024 | DiagnosticLong-Context Understanding | —Unverified | 0 |
| Mixture of In-Context Experts Enhance LLMs' Long Context Awareness | Jun 28, 2024 | Long-Context Understanding | CodeCode Available | 1 |
| Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA | Jun 25, 2024 | BenchmarkingLong-Context Understanding | CodeCode Available | 2 |
| Anomaly Detection of Tabular Data Using LLMs | Jun 24, 2024 | Anomaly DetectionLong-Context Understanding | —Unverified | 0 |
| MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression | Jun 21, 2024 | GPULanguage Modeling | CodeCode Available | 2 |