| Dynamic Parameter Memory: Temporary LoRA-Enhanced LLM for Long-Sequence Emotion Recognition in Conversation | Jul 11, 2025 | 4kEmotion Recognition | CodeCode Available | 0 |
| 4KAgent: Agentic Any Image to 4K Super-Resolution | Jul 9, 2025 | 4kImage Quality Assessment | —Unverified | 0 |
| AUTOMATIC ROOM LIGHT CONTROLLER MANAGEMENT SYSTEM. | Jun 25, 2025 | 4kCPU | —Unverified | 0 |
| UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions | Jun 16, 2025 | 4k8k | —Unverified | 0 |
| ComfyUI-R1: Exploring Reasoning Models for Workflow Generation | Jun 11, 2025 | 4k | CodeCode Available | 7 |
| TransXSSM: A Hybrid Transformer State Space Model with Unified Rotary Position Embedding | Jun 11, 2025 | 4kLanguage Modeling | —Unverified | 0 |
| SeerAttention-R: Sparse Attention Adaptation for Long Reasoning | Jun 10, 2025 | 4kGPU | CodeCode Available | 2 |
| Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations | Jun 5, 2025 | 4kSpatial Reasoning | CodeCode Available | 1 |
| Ultra-High-Resolution Image Synthesis: Data, Method and Evaluation | Jun 2, 2025 | 4kDescriptive | CodeCode Available | 3 |
| GThinker: Towards General Multimodal Reasoning via Cue-Guided Rethinking | Jun 1, 2025 | 4kMath | CodeCode Available | 0 |
| Latent Wavelet Diffusion: Enabling 4K Image Synthesis for Free | May 31, 2025 | 2k4k | —Unverified | 0 |
| Control-R: Towards controllable test-time scaling | May 30, 2025 | 4k | —Unverified | 0 |
| LoLA: Low-Rank Linear Attention With Sparse Caching | May 29, 2025 | 4k8k | —Unverified | 0 |
| Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models | May 29, 2025 | 2k4k | CodeCode Available | 1 |
| MonarchAttention: Zero-Shot Conversion to Fast, Hardware-Aware Structured Attention | May 24, 2025 | 16k4k | CodeCode Available | 1 |
| QwenLong-CPRS: Towards -LLMs with Dynamic Context Optimization | May 23, 2025 | 4kLanguage Modeling | —Unverified | 0 |
| VeriFastScore: Speeding up long-form factuality evaluation | May 22, 2025 | 4kForm | CodeCode Available | 0 |
| UNCLE: Uncertainty Expressions in Long-Form Generation | May 22, 2025 | 4kForm | —Unverified | 0 |
| Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL | May 21, 2025 | 4kMultimodal Reasoning | —Unverified | 0 |
| UHD Image Dehazing via anDehazeFormer with Atmospheric-aware KV Cache | May 20, 2025 | 4k8k | —Unverified | 0 |
| Analog Foundation Models | May 14, 2025 | 4kQuantization | CodeCode Available | 1 |
| Leveraging Vision-Language Models for Visual Grounding and Analysis of Automotive UI | May 9, 2025 | 4kDomain Generalization | CodeCode Available | 0 |
| TeGA: Texture Space Gaussian Avatars for High-Resolution Dynamic Head Modeling | May 8, 2025 | 4kMotion Estimation | —Unverified | 0 |
| EntroLLM: Entropy Encoded Weight Compression for Efficient Large Language Model Inference on Edge Devices | May 5, 2025 | 4kLanguage Modeling | —Unverified | 0 |
| Learning Adaptive Parallel Reasoning with Language Models | Apr 21, 2025 | 4k | CodeCode Available | 2 |