SOTAVerified

4k

Papers

Showing 51100 of 367 papers

TitleStatusHype
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMsCode3
Knowledge Distillation with Adapted Weight0
PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution0
"ScatSpotter" 2024 -- A Distributed Dog Poop Detection Dataset0
Prompting Depth Anything for 4K Resolution Accurate Metric Depth EstimationCode5
Turbo-GS: Accelerating 3D Gaussian Fitting for High-Quality Radiance Fields0
Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures0
Block-Based Multi-Scale Image Rescaling0
PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian SplattingCode3
Lexico: Extreme KV Cache Compression via Sparse Coding over Universal DictionariesCode1
Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec CompressionCode1
RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content0
RadPhi-3: Small Language Models for Radiology0
Zoomed In, Diffused Out: Towards Local Degradation-Aware Multi-Diffusion for Extreme Image Super-ResolutionCode0
Additional Tests for TV 3.00
TSFormer: A Robust Framework for Efficient UHD Image Restoration0
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models0
Advanced computer vision for extracting georeferenced vehicle trajectories from drone imageryCode1
MPDS: A Movie Posters Dataset for Image Generation with Diffusion Model0
Bias Similarity Across Large Language Models0
Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image AnimationCode7
A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts0
On The Adaptation of Unlimiformer for Decoder-Only Transformers0
Study of Subjective and Objective Quality in Super-Resolution Enhanced Broadcast Images on a Novel SR-IQA Dataset0
AIM 2024 Challenge on Efficient Video Super-Resolution for AV1 Compressed Content0
AIM 2024 Challenge on UHD Blind Photo Quality AssessmentCode1
On the Effectiveness of LLMs for Manual Test Verifications0
Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models0
USTC-TD: A Test Dataset and Benchmark for Image and Video Coding in 2020s0
Hybrid Cost Volume for Memory-Efficient Optical FlowCode1
HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM PromptsCode1
CyberCortex.AI: An AI-based Operating System for Autonomous Robotics and Complex Automation0
Assessing UHD Image Quality from Aesthetics, Distortions, and SaliencyCode1
MemLong: Memory-Augmented Retrieval for Long Text ModelingCode2
Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language ModelsCode1
Advanced atom-level representations for protein flexibility prediction utilizing graph neural networks0
Video-to-Text Pedestrian Monitoring (VTPM): Leveraging Computer Vision and Large Language Models for Privacy-Preserve Pedestrian Activity Monitoring at Intersections0
MobileMEF: Fast and Efficient Method for Multi-Exposure FusionCode1
What should I wear to a party in a Greek taverna? Evaluation for Conversational Agents in the Fashion Domain0
A complete characterization of pairs of binary phylogenetic trees with identical A_k-alignments0
Review Learning: Advancing All-in-One Ultra-High-Definition Image Restoration Training Method0
PGNeXt: High-Resolution Salient Object Detection via Pyramid Grafting Network0
Highly Efficient No-reference 4K Video Quality Assessment with Full-Pixel Covering Sampling and Training Strategy0
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities0
Scaling Granite Code Models to 128K ContextCode4
NeedleBench: Can LLMs Do Retrieval and Reasoning in Information-Dense Context?Code9
Uncovering Semantics and Topics Utilized by Threat Actors to Deliver Malicious Attachments and URLs0
HoloHisto: End-to-end Gigapixel WSI Segmentation with 4K Resolution Sequential Tokenization0
Meta 3D TextureGen: Fast and Consistent Texture Generation for 3D Objects0
VFIMamba: Video Frame Interpolation with State Space ModelsCode2
Show:102550
← PrevPage 2 of 8Next →

No leaderboard results yet.