| MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent | Jul 3, 2025 | 8k | —Unverified | 0 |
| UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions | Jun 16, 2025 | 4k8k | —Unverified | 0 |
| Through the Valley: Path to Effective Long CoT Training for Small Language Models | Jun 9, 2025 | 8kReinforcement Learning (RL) | —Unverified | 0 |
| InterRVOS: Interaction-aware Referring Video Object Segmentation | Jun 3, 2025 | 8kObject | —Unverified | 0 |
| SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis | Jun 2, 2025 | 8kMath | —Unverified | 0 |
| LLM in the Loop: Creating the PARADEHATE Dataset for Hate Speech Detoxification | Jun 2, 2025 | 8k | —Unverified | 0 |
| Efficient Neural and Numerical Methods for High-Quality Online Speech Spectrogram Inversion via Gradient Theorem | May 30, 2025 | 8k | —Unverified | 0 |
| LoLA: Low-Rank Linear Attention With Sparse Caching | May 29, 2025 | 4k8k | —Unverified | 0 |
| GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution | May 27, 2025 | 8kAvg | CodeCode Available | 1 |
| Efficient Correlation Volume Sampling for Ultra-High-Resolution Optical Flow Estimation | May 22, 2025 | 8kOptical Flow Estimation | —Unverified | 0 |
| UHD Image Dehazing via anDehazeFormer with Atmospheric-aware KV Cache | May 20, 2025 | 4k8k | —Unverified | 0 |
| MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly | May 15, 2025 | 8kBenchmarking | CodeCode Available | 2 |
| Achieving Scalable Robot Autonomy via neurosymbolic planning using lightweight local LLM | May 13, 2025 | 16k8k | CodeCode Available | 0 |
| ViCTr: Vital Consistency Transfer for Pathology Aware Image Synthesis | May 8, 2025 | 8kData Augmentation | —Unverified | 0 |
| Effective Length Extrapolation via Dimension-Wise Positional Embeddings Manipulation | Apr 26, 2025 | 8kPosition | —Unverified | 0 |
| KeyDiff: Key Similarity-Based KV Cache Eviction for Long-Context LLM Inference in Resource-Constrained Environments | Apr 21, 2025 | 8k | —Unverified | 0 |
| FactGuard: Leveraging Multi-Agent Systems to Generate Answerable and Unanswerable Questions for Enhanced Long-Context LLM Extraction | Apr 8, 2025 | 8kData Augmentation | CodeCode Available | 0 |
| Sequential-NIAH: A Needle-In-A-Haystack Benchmark for Extracting Sequential Needles from Long Contexts | Apr 7, 2025 | 8k | —Unverified | 0 |
| Pan-LUT: Efficient Pan-sharpening via Learnable Look-Up Tables | Mar 31, 2025 | 8kComputational Efficiency | —Unverified | 0 |
| Visual Acuity Consistent Foveated Rendering towards Retinal Resolution | Mar 30, 2025 | 8k | —Unverified | 0 |
| XL-Instruct: Synthetic Data for Cross-Lingual Open-Ended Generation | Mar 29, 2025 | 8kSynthetic Data Generation | —Unverified | 0 |
| ESSR: An 8K@30FPS Super-Resolution Accelerator With Edge Selective Network | Mar 26, 2025 | 8kSuper-Resolution | —Unverified | 0 |
| Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding | Mar 24, 2025 | 8kGPU | —Unverified | 0 |
| KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications | Mar 21, 2025 | 16k4k | CodeCode Available | 0 |
| SkyLadder: Better and Faster Pretraining via Context Window Scheduling | Mar 19, 2025 | 8kScheduling | CodeCode Available | 1 |
| DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework | Mar 19, 2025 | 8kAction Recognition | CodeCode Available | 4 |
| Efficient but Vulnerable: Benchmarking and Defending LLM Batch Prompting Attack | Mar 18, 2025 | 8kBenchmarking | —Unverified | 0 |
| One ruler to measure them all: Benchmarking multilingual long-context language models | Mar 3, 2025 | 8kAll | CodeCode Available | 1 |
| Evaluating the Suitability of Different Intraoral Scan Resolutions for Deep Learning-Based Tooth Segmentation | Feb 26, 2025 | 16k2k | —Unverified | 0 |
| LettuceDetect: A Hallucination Detection Framework for RAG Applications | Feb 24, 2025 | 8kGPU | CodeCode Available | 4 |
| Unshackling Context Length: An Efficient Selective Attention Approach through Query-Key Compression | Feb 20, 2025 | 8k | —Unverified | 0 |
| ParallelComp: Parallel Long-Context Compressor for Length Extrapolation | Feb 20, 2025 | 4k8k | —Unverified | 0 |
| CopySpec: Accelerating LLMs with Speculative Copy-and-Paste Without Compromising Quality | Feb 13, 2025 | 8kGPU | CodeCode Available | 0 |
| GSM-Infinite: How Do Your LLMs Behave over Infinitely Increasing Context Length and Reasoning Complexity? | Feb 7, 2025 | 8kInformation Retrieval | CodeCode Available | 2 |
| BCAT: A Block Causal Transformer for PDE Foundation Models for Fluid Dynamics | Jan 31, 2025 | 8kImage Generation | —Unverified | 0 |
| State Stream Transformer (SST) : Emergent Metacognitive Behaviours Through Latent State Persistence | Jan 30, 2025 | 8kARC | —Unverified | 0 |
| Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration | Jan 27, 2025 | 4k8k | —Unverified | 0 |
| LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation | Jan 9, 2025 | 2k8k | —Unverified | 0 |
| Towards High-fidelity 3D Talking Avatar with Personalized Dynamic Texture | Jan 1, 2025 | 3D Face Animation8k | —Unverified | 0 |
| CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up | Dec 20, 2024 | 8kGPU | CodeCode Available | 3 |
| LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks | Dec 19, 2024 | 8kIn-Context Learning | CodeCode Available | 5 |
| FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion | Dec 12, 2024 | 8k | —Unverified | 0 |
| Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec Compression | Dec 12, 2024 | 4k8k | CodeCode Available | 1 |
| ICPR 2024 Competition on Multilingual Claim-Span Identification | Nov 29, 2024 | 8kBinary Classification | CodeCode Available | 0 |
| TQA-Bench: Evaluating LLMs for Multi-Table Question Answering with Scalable Context and Symbolic Extension | Nov 29, 2024 | 8kQuestion Answering | CodeCode Available | 0 |
| Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding | Nov 27, 2024 | 8k | CodeCode Available | 1 |
| Zoomed In, Diffused Out: Towards Local Degradation-Aware Multi-Diffusion for Extreme Image Super-Resolution | Nov 18, 2024 | 2k4k | CodeCode Available | 0 |
| Understanding Chain-of-Thought in LLMs through Information Theory | Nov 18, 2024 | 8k | —Unverified | 0 |
| Scaling Mesh Generation via Compressive Tokenization | Nov 11, 2024 | 8k | CodeCode Available | 0 |
| Fox-1 Technical Report | Nov 8, 2024 | 2k8k | —Unverified | 0 |