| Hope Speech Detection Detecting speech associated with positive, uplifting, promis… | Language & Reasoning | 47 | 3 |
| Intent Discovery Given a set of labelled and unlabelled utterances, the idea … | Language & Reasoning | 42 | 3 |
| Rotated MNIST | Foundations & Efficiency | 41 | 3 |
| 3D Face Modelling | Generative Models | 31 | 3 |
| Medical Procedure Predicting medical procedures performed during a hospital ad… | Medical & Scientific | 23 | 3 |
| Audio Denoising | Audio & Speech | 20 | 3 |
| 3D Depth Estimation Image: [monodepth2](https://github.com/nianticlabs/monodepth… | Computer Vision | 19 | 3 |
| Medical Image Enhancement Aims to improve the perceptual quality of low-quality medica… | Generative Models | 18 | 3 |
| Target Sound Extraction Target Sound Extraction is the task of extracting a sound co… | Audio & Speech | 16 | 3 |
| Action Parsing Action parsing is the task of, given a video or still image,… | Language & Reasoning | 15 | 3 |
| Lung Sound Classification ICBHI-2017 dataset based lung sound classification | Medical & Scientific | 15 | 3 |
| Medical Relation Extraction Biomedical relation extraction is the task of detecting and … | Medical & Scientific | 15 | 3 |
| Polyphone disambiguation A part of the TTS-front end framework which serves to predic… | Language & Reasoning | 15 | 3 |
| 2D Panoptic Segmentation | Computer Vision | 9 | 3 |
| Generative Visual Question Answering Generating answers in free form to questions posed about ima… | Multimodal & Vision-Language | 9 | 3 |
| Japanese Word Segmentation | Computer Vision | 7 | 3 |
| Handwriting Verification The goal of handwriting verification is to find a measure of… | Computer Vision | 6 | 3 |
| ECG Digitization Digitize print-outs of ECGs. | Medical & Scientific | 5 | 3 |
| Eyeblink detection Localize eyeblinks in videos. | Computer Vision | 5 | 3 |
| Instance Shadow Detection | Computer Vision | 5 | 3 |
| Pain Intensity Regression | Foundations & Efficiency | 5 | 3 |
| Text2text Generation | Language & Reasoning | 5 | 3 |
| Attribute Mining The attribute mining task assumes an open-world setting in w… | Language & Reasoning | 4 | 3 |
| Face Quality Assessement | Computer Vision | 4 | 3 |
| Music Question Answering | Audio & Speech | 4 | 3 |
| Supervised Image Retrieval | Multimodal & Vision-Language | 4 | 3 |
| Acoustic Novelty Detection Detect novel events given acoustic signals, either in domest… | Audio & Speech | 3 | 3 |
| Auto Debugging | Language & Reasoning | 3 | 3 |
| Human-Object Interaction Anticipation Human-Object Interaction (HOI) Anticipation is the task of p… | Computer Vision | 3 | 3 |
| Human-Object Interaction Concept Discovery Discovering the reasonable HOI concepts/categories from know… | Computer Vision | 3 | 3 |
| Surgical Skills Evaluation The task is to classify surgical skills using data that is r… | Medical & Scientific | 3 | 3 |
| Training-free 3D Part Segmentation Evaluation on target datasets for 3D Part Segmentation witho… | Computer Vision | 3 | 3 |
| Unconditional Crystal Generation This task evaluates the ability of generative models to samp… | Medical & Scientific | 3 | 3 |
| Blink estimation | Computer Vision | 2 | 3 |
| Calving Front Delineation In Synthetic Aperture Radar Imagery Delineating the calving front of a marine-terminating glacie… | Computer Vision | 2 | 3 |
| Czech Text Diacritization Addition of diacritics for undiacritized Czech Wikipedia tex… | Language & Reasoning | 2 | 3 |
| De novo molecule generation from MS/MS spectrum | Medical & Scientific | 2 | 3 |
| Human Judgment Classification A task where an algorithm judges which sample is better in a… | Foundations & Efficiency | 2 | 3 |
| Negation and Speculation Cue Detection | Computer Vision | 2 | 3 |
| Open Vocabulary Action Detection | Computer Vision | 2 | 3 |
| self-supervised scene text recognition scene text recognition task self-supervised scene text recog… | Computer Vision | 2 | 3 |
| Semi Supervised Learning for Image Captioning | Multimodal & Vision-Language | 2 | 3 |
| Vietnamese Text Diacritization Addition of diacritics for undiacritized Vietnamese Wikipedi… | Language & Reasoning | 2 | 3 |
| 3D Canonical Hand Pose Estimation Image: [Lin et al](https://arxiv.org/pdf/2006.01320v1.pdf) | Computer Vision | 1 | 3 |
| IFC Entity Classification | Language & Reasoning | 1 | 3 |
| Decision Making | Foundations & Efficiency | 12,311 | 2 |
| BIG-bench Machine Learning This branch include most common machine learning fundamental… | Language & Reasoning | 10,033 | 2 |
| Benchmarking | Foundations & Efficiency | 5,548 | 2 |
| Autonomous Vehicles Autonomous vehicles is the task of making a vehicle that can… | Reinforcement Learning & Robotics | 2,605 | 2 |
| Bayesian Inference Bayesian Inference is a methodology that employs Bayes Rule … | Foundations & Efficiency | 2,226 | 2 |