SOTAVerified

4k

Papers

Showing 150 of 367 papers

TitleStatusHype
Dynamic Parameter Memory: Temporary LoRA-Enhanced LLM for Long-Sequence Emotion Recognition in ConversationCode0
4KAgent: Agentic Any Image to 4K Super-Resolution0
AUTOMATIC ROOM LIGHT CONTROLLER MANAGEMENT SYSTEM.0
UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions0
ComfyUI-R1: Exploring Reasoning Models for Workflow GenerationCode7
TransXSSM: A Hybrid Transformer State Space Model with Unified Rotary Position Embedding0
SeerAttention-R: Sparse Attention Adaptation for Long ReasoningCode2
Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual SimulationsCode1
Ultra-High-Resolution Image Synthesis: Data, Method and EvaluationCode3
GThinker: Towards General Multimodal Reasoning via Cue-Guided RethinkingCode0
Latent Wavelet Diffusion: Enabling 4K Image Synthesis for Free0
Control-R: Towards controllable test-time scaling0
Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language ModelsCode1
LoLA: Low-Rank Linear Attention With Sparse Caching0
MonarchAttention: Zero-Shot Conversion to Fast, Hardware-Aware Structured AttentionCode1
QwenLong-CPRS: Towards -LLMs with Dynamic Context Optimization0
VeriFastScore: Speeding up long-form factuality evaluationCode0
UNCLE: Uncertainty Expressions in Long-Form Generation0
Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL0
UHD Image Dehazing via anDehazeFormer with Atmospheric-aware KV Cache0
Analog Foundation ModelsCode1
Leveraging Vision-Language Models for Visual Grounding and Analysis of Automotive UICode0
TeGA: Texture Space Gaussian Avatars for High-Resolution Dynamic Head Modeling0
EntroLLM: Entropy Encoded Weight Compression for Efficient Large Language Model Inference on Edge Devices0
Learning Adaptive Parallel Reasoning with Language ModelsCode2
Distribution-aware Dataset Distillation for Efficient Image Restoration0
Exploring Generalizable Pre-training for Real-world Change Detection via Geometric Estimation0
WildLive: Near Real-time Visual Wildlife Tracking onboard UAVs0
Evaluation of the phi-3-mini SLM for identification of texts related to medicine, health, and sports injuries0
LENVIZ: A High-Resolution Low-Exposure Night Vision Benchmark Dataset0
Scaling Vision Pre-Training to 4K ResolutionCode7
Surg-3M: A Dataset and Foundation Model for Perception in Surgical SettingsCode2
Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion ModelsCode3
MaSS13K: A Matting-level Semantic Segmentation BenchmarkCode2
KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing ApplicationsCode0
iFlame: Interleaving Full and Linear Attention for Efficient Mesh Generation0
Ultra-Resolution Adaptation with EaseCode2
GAEA: A Geolocation Aware Conversational Model0
DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario UnderstandingCode2
Illuminating Darkness: Enhancing Real-world Low-light Scenes with Smartphone ImagesCode1
Evaluating the Suitability of Different Intraoral Scan Resolutions for Deep Learning-Based Tooth Segmentation0
Heterogeneous Multi-Agent Bandits with Parsimonious Hints0
ParallelComp: Parallel Long-Context Compressor for Length Extrapolation0
CLOVER: A Test Case Generation Benchmark with Coverage, Long-Context, and Verification0
Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context AccurayCode3
Claim Extraction for Fact-Checking: Data, Models, and Automated Metrics0
Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration0
From Informal to Formal -- Incorporating and Evaluating LLMs on Natural Language Requirements to Verifiable Formal Proofs0
GeoPixel: Pixel Grounding Large Multimodal Model in Remote SensingCode2
CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh GenerationCode2
Show:102550
← PrevPage 1 of 8Next →

No leaderboard results yet.