| Protein-Ligand Affinity Prediction | Medical & Scientific | 8 | 10 |
| Open Intent Discovery Open intent discovery aims to leverage limited prior knowled… | Language & Reasoning | 7 | 10 |
| Video-to-image Affordance Grounding Given a demonstration video V and a target image I, the goal… | Computer Vision | 4 | 10 |
| Adversarial Attack An Adversarial Attack is a technique to find a perturbation … | Foundations & Efficiency | 1,808 | 9 |
| Facial Expression Recognition | Computer Vision | 532 | 9 |
| Gait Recognition ( Image credit: [GaitSet: Regarding Gait as a Set for Cross-… | Computer Vision | 220 | 9 |
| Video Grounding Video grounding is the task of linking spoken language descr… | Multimodal & Vision-Language | 114 | 9 |
| Road Segmentation Road Segmentation is a pixel wise binary classification in o… | Computer Vision | 82 | 9 |
| Multi-Object Tracking and Segmentation Multiple object tracking and segmentation requires detecting… | Computer Vision | 28 | 9 |
| Photo Retouching | Generative Models | 27 | 9 |
| Music Auto-Tagging | Audio & Speech | 22 | 9 |
| Audio Quality Assessment Computational audio quality assessment aims to predict the q… | Audio & Speech | 15 | 9 |
| 3D Interacting Hand Pose Estimation | Computer Vision | 13 | 9 |
| Unconditional Molecule Generation This task evaluates the ability of generative models to samp… | Medical & Scientific | 8 | 9 |
| Prediction Of Occupancy Grid Maps | Foundations & Efficiency | 7 | 9 |
| 3D Point Cloud Interpolation Point cloud interpolation is a fundamental problem for 3D co… | Computer Vision | 6 | 9 |
| Spatial Relation Recognition | Computer Vision | 5 | 9 |
| AMR Graph Similarity | Graphs & Structured Data | 3 | 9 |
| De novo molecule generation from MS/MS spectrum (bonus chemical formulae) | Medical & Scientific | 3 | 9 |
| Partial Point Cloud Matching | Computer Vision | 1 | 9 |
| Ethics | Language & Reasoning | 832 | 8 |
| Scene Classification Scene Classification is a task in which scenes from photogra… | Computer Vision | 453 | 8 |
| Unsupervised Pre-training Pre-training a neural network using unsupervised (self-super… | Foundations & Efficiency | 265 | 8 |
| Rolling Shutter Correction Rolling Shutter Correction | Generative Models | 216 | 8 |
| Vulnerability Detection Vulnerability detection plays a crucial role in safeguarding… | Computer Vision | 216 | 8 |
| Program Repair Task of teaching ML models to modify an existing program to … | Language & Reasoning | 132 | 8 |
| Brain Decoding Motor Brain Decoding is fundamental task for building motor … | Medical & Scientific | 118 | 8 |
| Camera Localization | Computer Vision | 116 | 8 |
| Automated Essay Scoring Essay scoring: Automated Essay Scoring is the task of assign… | Language & Reasoning | 104 | 8 |
| Data-free Knowledge Distillation | Foundations & Efficiency | 75 | 8 |
| Recipe Generation Food recipe generation from ingredients, title and/or food i… | Language & Reasoning | 40 | 8 |
| Lung Nodule Classification | Medical & Scientific | 35 | 8 |
| Length-of-Stay prediction | Foundations & Efficiency | 28 | 8 |
| Respiratory Failure Continuous prediction of onset of respiratory failure in the… | Medical & Scientific | 26 | 8 |
| CCG Supertagging Combinatory Categorical Grammar (CCG; [Steedman, 2000](http:… | Language & Reasoning | 24 | 8 |
| Space-time Video Super-resolution | Generative Models | 24 | 8 |
| Accented Speech Recognition | Audio & Speech | 20 | 8 |
| Part-aware Panoptic Segmentation Panoptic segmentation with part-aware predictions. | Computer Vision | 8 | 8 |
| Seeing Beyond the Visible The objective of this challenge is to automate the process o… | Computer Vision | 8 | 8 |
| Cross-Lingual Bitext Mining Cross-lingual bitext mining is the task of mining sentence p… | Language & Reasoning | 6 | 8 |
| Document Text Classification | Language & Reasoning | 6 | 8 |
| Circulatory Failure Continuous prediction of onset of circulatory failure in the… | Medical & Scientific | 5 | 8 |
| Phrase Tagging A fine-grained task that aims to find all occurrences of phr… | Language & Reasoning | 3 | 8 |
| Recognizing Emotion Cause in Conversations Given an utterance U, labeled with emotion E, the task is to… | Language & Reasoning | 3 | 8 |
| Dynamic Point Removal In the field of robotics, the point cloud has become an esse… | Computer Vision | 2 | 8 |
| Molecule retrieval from MS/MS spectrum | Medical & Scientific | 2 | 8 |
| Molecule retrieval from MS/MS spectrum (bonus chemical formulae) | Medical & Scientific | 2 | 8 |
| Concept-based Classification Image classification using human-interpretable concepts | Foundations & Efficiency | 1 | 8 |
| lidar absolute pose regression | Foundations & Efficiency | 1 | 8 |
| TinyQA Benchmark++ Ultra-lightweight evaluation suite and python package design… | Multimodal & Vision-Language | 1 | 8 |