| Visual Commonsense Tests Predict 5 property types (color, shape, material, size, and … | Multimodal & Vision-Language | 2 | 0 |
| Vocabulary-free Image Classification Recent advances in large vision-language models have revolut… | Computer Vision | 2 | 0 |
| Vocal ensemble separation | Audio & Speech | 2 | 0 |
| Weakly Supervised Action Segmentation (Action Set)) Learning an action segmentation model while the only availab… | Computer Vision | 2 | 0 |
| White Matter Fiber Tractography | Graphs & Structured Data | 2 | 0 |
| Wildly Unsupervised Domain Adaptation Transferring knowledge from a noisy source domain to unlabel… | Computer Vision | 2 | 0 |
| Yield Mapping In Apple Orchards | Medical & Scientific | 2 | 0 |
| Zero-Shot Action Recognition on HMDB51 | Computer Vision | 2 | 0 |
| Zero-shot Cross-Lingual Document Classification | Language & Reasoning | 2 | 0 |
| Zero-Shot Cross-Lingual Visual Question Answering | Multimodal & Vision-Language | 2 | 0 |
| Zero-Shot Cross-Lingual Visual Reasoning | Multimodal & Vision-Language | 2 | 0 |
| Zero-Shot Facial Expression Recognition | Computer Vision | 2 | 0 |
| zero-shot long video global-mode question answering | Multimodal & Vision-Language | 2 | 0 |
| Zero-shot Moment Retrieval | Recommendation & Retrieval | 2 | 0 |
| Zero-shot Scene Classification (unified classes) | Computer Vision | 2 | 0 |
| Zero-Shot Transfer Image Classification (CN) | Computer Vision | 2 | 0 |
| Zeroshot Video Question Answer | Multimodal & Vision-Language | 2 | 0 |
| 2-task Classification | Foundations & Efficiency | 1 | 0 |
| 3D Building Mesh Labeling | Computer Vision | 1 | 0 |
| 3D Human Pose Estimation in Limited Data | Computer Vision | 1 | 0 |
| 3D Human Pose Estimation in Limited Date | Computer Vision | 1 | 0 |
| 3D Noise Generation 3D Noise Generation involves creating deviations on an origi… | Computer Vision | 1 | 0 |
| 3D Object Detection on View-of-Delft (val) | Computer Vision | 1 | 0 |
| 3D Object Detection (RoI) on View-of-Delft (val) | Computer Vision | 1 | 0 |
| 3D Point Cloud Part Segmentation 3D point cloud part segmentation on datasets like ShapeNet P… | Computer Vision | 1 | 0 |
| 3D Prostate Segmentation | Computer Vision | 1 | 0 |
| 3D Source-Free Domain Adaptation Source-Free domain adaptation for 3D data. | Computer Vision | 1 | 0 |
| 3D Video Frame Interpolation | Computer Vision | 1 | 0 |
| 6D Vision | Computer Vision | 1 | 0 |
| AbbreviationDetection | Computer Vision | 1 | 0 |
| Actionable Phrase Detection | Computer Vision | 1 | 0 |
| Action Classification (zero-shot) | Computer Vision | 1 | 0 |
| Action Quality Assessment Report Generation Generate full length action quality assessment reports conta… | Computer Vision | 1 | 0 |
| Activeness Detection Determining activeness via images | Computer Vision | 1 | 0 |
| Active Observation Completion | Reinforcement Learning & Robotics | 1 | 0 |
| Add - PO | Foundations & Efficiency | 1 | 0 |
| Add - PQ | Foundations & Efficiency | 1 | 0 |
| Adroid door-cloned | Reinforcement Learning & Robotics | 1 | 0 |
| Adroid door-human | Reinforcement Learning & Robotics | 1 | 0 |
| Adroid hammer-cloned | Reinforcement Learning & Robotics | 1 | 0 |
| Adroid hammer-human | Reinforcement Learning & Robotics | 1 | 0 |
| Adroid pen-cloned | Reinforcement Learning & Robotics | 1 | 0 |
| Adroid pen-human | Reinforcement Learning & Robotics | 1 | 0 |
| Adroid relocate-cloned | Reinforcement Learning & Robotics | 1 | 0 |
| Adroid relocate-human | Reinforcement Learning & Robotics | 1 | 0 |
| Adversarial Defense against AutoAttack | Foundations & Efficiency | 1 | 0 |
| Adversarial Defense against PGD | Foundations & Efficiency | 1 | 0 |
| Aerial Video Saliency Prediction | Computer Vision | 1 | 0 |
| Aerial Video Semantic Segmentation | Computer Vision | 1 | 0 |
| Agent-based model inverse problem | Reinforcement Learning & Robotics | 1 | 0 |