| NeedleBench: Can LLMs Do Retrieval and Reasoning in Information-Dense Context? | Jul 16, 2024 | 4k8k | CodeCode Available | 9 |
| LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks | Dec 19, 2024 | 8kIn-Context Learning | CodeCode Available | 5 |
| KBLaM: Knowledge Base augmented Language Model | Oct 14, 2024 | 8kGPU | CodeCode Available | 5 |
| Learning to (Learn at Test Time): RNNs with Expressive Hidden States | Jul 5, 2024 | 16k8k | CodeCode Available | 5 |
| LongQLoRA: Efficient and Effective Method to Extend Context Length of Large Language Models | Nov 8, 2023 | 8kGPU | CodeCode Available | 5 |
| StarCoder: may the source be with you! | May 9, 2023 | 8kCode Generation | CodeCode Available | 5 |
| DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework | Mar 19, 2025 | 8kAction Recognition | CodeCode Available | 4 |
| LettuceDetect: A Hallucination Detection Framework for RAG Applications | Feb 24, 2025 | 8kGPU | CodeCode Available | 4 |
| CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up | Dec 20, 2024 | 8kGPU | CodeCode Available | 3 |
| CAMixerSR: Only Details Need More "Attention" | Feb 29, 2024 | 2k8k | CodeCode Available | 3 |
| LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens | Feb 21, 2024 | 8k | CodeCode Available | 3 |
| BTLM-3B-8K: 7B Parameter Performance in a 3B Parameter Model | Sep 20, 2023 | 8kLanguage Modeling | CodeCode Available | 3 |
| MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly | May 15, 2025 | 8kBenchmarking | CodeCode Available | 2 |
| GSM-Infinite: How Do Your LLMs Behave over Infinitely Increasing Context Length and Reasoning Complexity? | Feb 7, 2025 | 8kInformation Retrieval | CodeCode Available | 2 |
| LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models | Aug 31, 2024 | 8kGPU | CodeCode Available | 2 |
| Odd-One-Out: Anomaly Detection by Comparing with Neighbors | Jun 28, 2024 | 8kAnomaly Detection | CodeCode Available | 2 |
| Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging cases | Jun 19, 2024 | 8kHallucination | CodeCode Available | 2 |
| LongEmbed: Extending Embedding Models for Long Context Retrieval | Apr 18, 2024 | 4k8k | CodeCode Available | 2 |
| Self-Supervised Visual Preference Alignment | Apr 16, 2024 | 8kMM-Vet | CodeCode Available | 2 |
| Spacetime Gaussian Feature Splatting for Real-Time Dynamic View Synthesis | Dec 28, 2023 | 8kFeature Splatting | CodeCode Available | 2 |
| Transformer-VQ: Linear-Time Transformers via Vector Quantization | Sep 28, 2023 | 8kDecoder | CodeCode Available | 2 |
| XGen-7B Technical Report | Sep 7, 2023 | 2k8k | CodeCode Available | 2 |
| AbdomenAtlas-8K: Annotating 8,000 CT Volumes for Multi-Organ Segmentation in Three Weeks | May 16, 2023 | 8kActive Learning | CodeCode Available | 2 |
| Hyena Hierarchy: Towards Larger Convolutional Language Models | Feb 21, 2023 | 2k8k | CodeCode Available | 2 |
| Hungry Hungry Hippos: Towards Language Modeling with State Space Models | Dec 28, 2022 | 8kCoreference Resolution | CodeCode Available | 2 |
| Ultra-High-Definition Low-Light Image Enhancement: A Benchmark and Transformer-Based Method | Dec 22, 2022 | 4k8k | CodeCode Available | 2 |
| SoccerTrack: A Dataset and Tracking Algorithm for Soccer With Fish-Eye and Drone Videos | Jun 20, 2022 | 4k8k | CodeCode Available | 2 |
| CLUECorpus2020: A Large-scale Chinese Corpus for Pre-training Language Model | Mar 3, 2020 | 8kLanguage Modeling | CodeCode Available | 2 |
| GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution | May 27, 2025 | 8kAvg | CodeCode Available | 1 |
| SkyLadder: Better and Faster Pretraining via Context Window Scheduling | Mar 19, 2025 | 8kScheduling | CodeCode Available | 1 |
| One ruler to measure them all: Benchmarking multilingual long-context language models | Mar 3, 2025 | 8kAll | CodeCode Available | 1 |
| Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec Compression | Dec 12, 2024 | 4k8k | CodeCode Available | 1 |
| Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding | Nov 27, 2024 | 8k | CodeCode Available | 1 |
| C^2: Scalable Auto-Feedback for LLM-based Chart Generation | Oct 24, 2024 | 8kDiversity | CodeCode Available | 1 |
| Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning | Oct 16, 2024 | 8k | CodeCode Available | 1 |
| AuthFace: Towards Authentic Blind Face Restoration with Face-oriented Generative Diffusion Prior | Oct 13, 2024 | 8kBlind Face Restoration | CodeCode Available | 1 |
| L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding? | Oct 3, 2024 | 8kDocument Summarization | CodeCode Available | 1 |
| PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization | Sep 25, 2024 | 8kDomain Adaptation | CodeCode Available | 1 |
| Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models | Aug 28, 2024 | 2k4k | CodeCode Available | 1 |
| FocusLLM: Precise Understanding of Long Context by Dynamic Condensing | Aug 21, 2024 | 8kDecoder | CodeCode Available | 1 |
| SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models | Aug 21, 2024 | 8kGSM8K | CodeCode Available | 1 |
| Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum | May 21, 2024 | 2k8k | CodeCode Available | 1 |
| Fast Kernel Scene Flow | Mar 9, 2024 | 8kAutonomous Driving | CodeCode Available | 1 |
| Referring Expression Counting | Jan 1, 2024 | 8kobject-detection | CodeCode Available | 1 |
| 4K-Resolution Photo Exposure Correction at 125 FPS with ~8K Parameters | Nov 15, 2023 | 4k8k | CodeCode Available | 1 |
| A High-Resolution Dataset for Instance Detection with Multi-View Instance Capture | Oct 30, 2023 | 8kObject | CodeCode Available | 1 |
| M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models | Oct 30, 2023 | 8kSemantic Retrieval | CodeCode Available | 1 |
| Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning | Aug 18, 2023 | 8kPosition | CodeCode Available | 1 |
| Recurrent Multi-scale Transformer for High-Resolution Salient Object Detection | Aug 7, 2023 | 2k8k | CodeCode Available | 1 |
| VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive Representation | Jul 28, 2023 | 3D Generation8k | CodeCode Available | 1 |