| LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs | Jan 10, 2025 | 4kVisual Reasoning | CodeCode Available | 3 |
| Knowledge Distillation with Adapted Weight | Jan 6, 2025 | 4kFairness | —Unverified | 0 |
| PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution | Jan 1, 2025 | 4kSuper-Resolution | —Unverified | 0 |
| "ScatSpotter" 2024 -- A Distributed Dog Poop Detection Dataset | Dec 21, 2024 | 4k | —Unverified | 0 |
| Turbo-GS: Accelerating 3D Gaussian Fitting for High-Quality Radiance Fields | Dec 18, 2024 | 3DGS3D Reconstruction | —Unverified | 0 |
| Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation | Dec 18, 2024 | 3D Reconstruction4k | CodeCode Available | 5 |
| Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures | Dec 17, 2024 | 4k | —Unverified | 0 |
| Block-Based Multi-Scale Image Rescaling | Dec 16, 2024 | 2k4k | —Unverified | 0 |
| PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting | Dec 16, 2024 | 3D Reconstruction4k | CodeCode Available | 3 |
| Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries | Dec 12, 2024 | 4kGSM8K | CodeCode Available | 1 |
| Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec Compression | Dec 12, 2024 | 4k8k | CodeCode Available | 1 |
| RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content | Nov 20, 2024 | 4kKnowledge Distillation | —Unverified | 0 |
| RadPhi-3: Small Language Models for Radiology | Nov 19, 2024 | 4kLanguage Modeling | —Unverified | 0 |
| Zoomed In, Diffused Out: Towards Local Degradation-Aware Multi-Diffusion for Extreme Image Super-Resolution | Nov 18, 2024 | 2k4k | CodeCode Available | 0 |
| Additional Tests for TV 3.0 | Nov 18, 2024 | 4k | —Unverified | 0 |
| TSFormer: A Robust Framework for Efficient UHD Image Restoration | Nov 17, 2024 | 4kComputational Efficiency | —Unverified | 0 |
| Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models | Nov 11, 2024 | 4kImage Generation | —Unverified | 0 |
| Advanced computer vision for extracting georeferenced vehicle trajectories from drone imagery | Nov 4, 2024 | 4kgeo-localization | CodeCode Available | 1 |
| MPDS: A Movie Posters Dataset for Image Generation with Diffusion Model | Oct 22, 2024 | 4k8k | —Unverified | 0 |
| Bias Similarity Across Large Language Models | Oct 15, 2024 | 4kFairness | —Unverified | 0 |
| Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation | Oct 10, 2024 | 4kImage Animation | CodeCode Available | 7 |
| A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts | Oct 2, 2024 | 4kGPU | —Unverified | 0 |
| On The Adaptation of Unlimiformer for Decoder-Only Transformers | Oct 2, 2024 | 4k8k | —Unverified | 0 |
| Study of Subjective and Objective Quality in Super-Resolution Enhanced Broadcast Images on a Novel SR-IQA Dataset | Sep 26, 2024 | 2k4k | —Unverified | 0 |
| AIM 2024 Challenge on Efficient Video Super-Resolution for AV1 Compressed Content | Sep 25, 2024 | 4kSuper-Resolution | —Unverified | 0 |