| MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent | Jul 3, 2025 | 8k | —Unverified | 0 |
| UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions | Jun 16, 2025 | 4k8k | —Unverified | 0 |
| Through the Valley: Path to Effective Long CoT Training for Small Language Models | Jun 9, 2025 | 8kReinforcement Learning (RL) | —Unverified | 0 |
| InterRVOS: Interaction-aware Referring Video Object Segmentation | Jun 3, 2025 | 8kObject | —Unverified | 0 |
| SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis | Jun 2, 2025 | 8kMath | —Unverified | 0 |
| LLM in the Loop: Creating the PARADEHATE Dataset for Hate Speech Detoxification | Jun 2, 2025 | 8k | —Unverified | 0 |
| Efficient Neural and Numerical Methods for High-Quality Online Speech Spectrogram Inversion via Gradient Theorem | May 30, 2025 | 8k | —Unverified | 0 |
| LoLA: Low-Rank Linear Attention With Sparse Caching | May 29, 2025 | 4k8k | —Unverified | 0 |
| GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution | May 27, 2025 | 8kAvg | CodeCode Available | 1 |
| Efficient Correlation Volume Sampling for Ultra-High-Resolution Optical Flow Estimation | May 22, 2025 | 8kOptical Flow Estimation | —Unverified | 0 |
| UHD Image Dehazing via anDehazeFormer with Atmospheric-aware KV Cache | May 20, 2025 | 4k8k | —Unverified | 0 |
| MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly | May 15, 2025 | 8kBenchmarking | CodeCode Available | 2 |
| Achieving Scalable Robot Autonomy via neurosymbolic planning using lightweight local LLM | May 13, 2025 | 16k8k | CodeCode Available | 0 |
| ViCTr: Vital Consistency Transfer for Pathology Aware Image Synthesis | May 8, 2025 | 8kData Augmentation | —Unverified | 0 |
| Effective Length Extrapolation via Dimension-Wise Positional Embeddings Manipulation | Apr 26, 2025 | 8kPosition | —Unverified | 0 |
| KeyDiff: Key Similarity-Based KV Cache Eviction for Long-Context LLM Inference in Resource-Constrained Environments | Apr 21, 2025 | 8k | —Unverified | 0 |
| FactGuard: Leveraging Multi-Agent Systems to Generate Answerable and Unanswerable Questions for Enhanced Long-Context LLM Extraction | Apr 8, 2025 | 8kData Augmentation | CodeCode Available | 0 |
| Sequential-NIAH: A Needle-In-A-Haystack Benchmark for Extracting Sequential Needles from Long Contexts | Apr 7, 2025 | 8k | —Unverified | 0 |
| Pan-LUT: Efficient Pan-sharpening via Learnable Look-Up Tables | Mar 31, 2025 | 8kComputational Efficiency | —Unverified | 0 |
| Visual Acuity Consistent Foveated Rendering towards Retinal Resolution | Mar 30, 2025 | 8k | —Unverified | 0 |
| XL-Instruct: Synthetic Data for Cross-Lingual Open-Ended Generation | Mar 29, 2025 | 8kSynthetic Data Generation | —Unverified | 0 |
| ESSR: An 8K@30FPS Super-Resolution Accelerator With Edge Selective Network | Mar 26, 2025 | 8kSuper-Resolution | —Unverified | 0 |
| Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding | Mar 24, 2025 | 8kGPU | —Unverified | 0 |
| KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications | Mar 21, 2025 | 16k4k | CodeCode Available | 0 |
| SkyLadder: Better and Faster Pretraining via Context Window Scheduling | Mar 19, 2025 | 8kScheduling | CodeCode Available | 1 |