| Meticulous Object Segmentation | Dec 13, 2020 | 2k4k | CodeCode Available | 1 | 5 |
| m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks | Mar 17, 2024 | 4k | CodeCode Available | 1 | 5 |
| High-Resolution Optical Flow from 1D Attention and Correlation | Apr 28, 2021 | 4kOptical Flow Estimation | CodeCode Available | 1 | 5 |
| Advanced computer vision for extracting georeferenced vehicle trajectories from drone imagery | Nov 4, 2024 | 4kgeo-localization | CodeCode Available | 1 | 5 |
| Form-NLU: Dataset for the Form Natural Language Understanding | Apr 4, 2023 | 4kForm | CodeCode Available | 1 | 5 |
| Illuminating Darkness: Enhancing Real-world Low-light Scenes with Smartphone Images | Mar 10, 2025 | 4kBenchmarking | CodeCode Available | 1 | 5 |
| MobileMEF: Fast and Efficient Method for Multi-Exposure Fusion | Aug 15, 2024 | 4kComputational Efficiency | CodeCode Available | 1 | 5 |
| High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network | May 19, 2021 | 4kAttribute | CodeCode Available | 1 | 5 |
| Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models | Aug 28, 2024 | 2k4k | CodeCode Available | 1 | 5 |
| End-to-End Speech Recognition from Federated Acoustic Models | Apr 29, 2021 | 2k4k | CodeCode Available | 1 | 5 |