| ASOD60K: An Audio-Induced Salient Object Detection Dataset for Panoramic Videos | Jul 24, 2021 | 4kObject | CodeCode Available | 1 | 5 |
| End-to-End Speech Recognition from Federated Acoustic Models | Apr 29, 2021 | 2k4k | CodeCode Available | 1 | 5 |
| CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large Input | Apr 13, 2023 | 2k4k | CodeCode Available | 1 | 5 |
| m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks | Mar 17, 2024 | 4k | CodeCode Available | 1 | 5 |
| Asking Multimodal Clarifying Questions in Mixed-Initiative Conversational Search | Feb 12, 2024 | 4kConversational Search | CodeCode Available | 1 | 5 |
| Pyramid Grafting Network for One-Stage High Resolution Saliency Detection | Apr 11, 2022 | 4k8k | CodeCode Available | 1 | 5 |
| Capturing and Inferring Dense Full-Body Human-Scene Contact | Jun 20, 2022 | 4kContact Detection | CodeCode Available | 1 | 5 |
| Multi-Scale Separable Network for Ultra-High-Definition Video Deblurring | Jan 1, 2021 | 4kDeblurring | CodeCode Available | 1 | 5 |
| COM2SENSE: A Commonsense Reasoning Benchmark with Complementary Sentences | Jun 2, 2021 | 4kSentence | CodeCode Available | 1 | 5 |
| Form-NLU: Dataset for the Form Natural Language Understanding | Apr 4, 2023 | 4kForm | CodeCode Available | 1 | 5 |