| High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network | May 19, 2021 | 4kAttribute | CodeCode Available | 1 |
| Joint Bilateral Learning for Real-time Universal Photorealistic Style Transfer | Apr 23, 2020 | 4kStyle Transfer | CodeCode Available | 1 |
| MAILEX: Email Event and Argument Extraction | May 22, 2023 | 4k8k | CodeCode Available | 1 |
| Memory-Efficient Optical Flow via Radius-Distribution Orthogonal Cost Volume | Dec 6, 2023 | 4kOptical Flow Estimation | CodeCode Available | 1 |
| Finding a Needle in a Haystack: Tiny Flying Object Detection in 4K Videos using a Joint Detection-and-Tracking Approach | May 18, 2021 | 4kMotion Estimation | —Unverified | 0 |
| Fewshot learning on global multimodal embeddings for earth observation tasks | Sep 29, 2023 | 4kEarth Observation | —Unverified | 0 |
| Challenges in Deploying Long-Context Transformers: A Theoretical Peak Performance Analysis | May 14, 2024 | 4kGPU | —Unverified | 0 |
| FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression Recognition in Videos | Mar 17, 2022 | 4kFacial Expression Recognition | —Unverified | 0 |
| Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL | May 21, 2025 | 4kMultimodal Reasoning | —Unverified | 0 |
| A Novel Computational and Modeling Foundation for Automatic Coherence Assessment | Oct 1, 2023 | 4kLong Form Question Answering | —Unverified | 0 |