| Multi-Armed Bandits Multi-armed bandits refer to a task where a fixed amount of … | Foundations & Efficiency | 1,262 | 2 |
| Quantum Machine Learning | Foundations & Efficiency | 699 | 2 |
| Interpretable Machine Learning The goal of Interpretable Machine Learning is to allow overs… | Foundations & Efficiency | 537 | 2 |
| POS Tagging Part of Speech Tagging | Language & Reasoning | 523 | 2 |
| Cultural Vocal Bursts Intensity Prediction to predict the intensity of 40 culture-specific emotions (10… | Foundations & Efficiency | 490 | 2 |
| Time Series Prediction The goal of Time Series Prediction is to infer the future va… | Time Series & Forecasting | 477 | 2 |
| Speaker Recognition Speaker Recognition is the process of identifying or confirm… | Audio & Speech | 435 | 2 |
| Image Manipulation Image Manipulation is the process of altering or transformin… | Generative Models | 427 | 2 |
| General Knowledge This task aims to evaluate the ability of a model to answer … | Medical & Scientific | 399 | 2 |
| Fault Diagnosis | Time Series & Forecasting | 375 | 2 |
| Cancer Classification | Medical & Scientific | 208 | 2 |
| Multi-hop Question Answering | Language & Reasoning | 202 | 2 |
| Term Extraction Term Extraction, or Automated Term Extraction (ATE), is abou… | Computer Vision | 160 | 2 |
| Conversational Question Answering | Language & Reasoning | 142 | 2 |
| Retrosynthesis Retrosynthetic analysis is a pivotal synthetic methodology i… | Medical & Scientific | 110 | 2 |
| Talking Face Generation Talking face generation aims to synthesize a sequence of fac… | Generative Models | 110 | 2 |
| Motion Detection Motion Detection is a process to detect the presence of any … | Computer Vision | 101 | 2 |
| Intent Recognition | Language & Reasoning | 93 | 2 |
| 3D Classification | Computer Vision | 79 | 2 |
| Graph Question Answering | Graphs & Structured Data | 78 | 2 |
| Cloze Test The cloze task refers to infilling individual words. | Language & Reasoning | 71 | 2 |
| Speech Denoising Obtain the clean speech of the target speaker by suppressing… | Audio & Speech | 65 | 2 |
| Attribute Extraction | Language & Reasoning | 59 | 2 |
| Seizure prediction | Medical & Scientific | 57 | 2 |
| Meeting Summarization Generating a summary from meeting transcriptions. A survey f… | Language & Reasoning | 53 | 2 |
| 3D human pose and shape estimation Estimate 3D human pose and shape (e.g. SMPL) from images | Computer Vision | 50 | 2 |
| Crop Yield Prediction | Foundations & Efficiency | 48 | 2 |
| Diabetic Retinopathy Grading Grading the severity of diabetic retinopathy from (ophthalmi… | Medical & Scientific | 44 | 2 |
| 3D Medical Imaging Segmentation 3D medical imaging segmentation is the task of segmenting me… | Medical & Scientific | 41 | 2 |
| Code Repair | Language & Reasoning | 39 | 2 |
| Weakly Supervised Classification | Foundations & Efficiency | 39 | 2 |
| One-Shot Segmentation ( Image credit: [One-Shot Learning for Semantic Segmentation… | Computer Vision | 35 | 2 |
| Landmark Recognition | Computer Vision | 33 | 2 |
| 3D Shape Modeling Image: [Gkioxari et al](https://arxiv.org/pdf/1906.02739v2.p… | Generative Models | 28 | 2 |
| audio-visual event localization | Multimodal & Vision-Language | 26 | 2 |
| Face Anonymization | Generative Models | 23 | 2 |
| Pose Retrieval Retrieval of similar human poses from images or videos | Computer Vision | 23 | 2 |
| Odd One Out This task tests to what extent a language model is able to i… | Language & Reasoning | 21 | 2 |
| Text-to-Code Generation Text-to-Code Generation is a task where we can generate code… | Language & Reasoning | 20 | 2 |
| Next-basket recommendation | Recommendation & Retrieval | 18 | 2 |
| Chinese Spell Checking Chinese Spell Checking (CSC) aims to detect and correct erro… | Language & Reasoning | 17 | 2 |
| Composed Image Retrieval (CoIR) Composed Image Retrieval (CoIR) is the task involves retriev… | Multimodal & Vision-Language | 14 | 2 |
| Infrared image super-resolution Aims at upsampling the IR image and create the high resoluti… | Generative Models | 11 | 2 |
| 3D Shape Reconstruction From A Single 2D Image Image: [Liao et al](https://arxiv.org/pdf/1811.12016v1.pdf) | Generative Models | 10 | 2 |
| Music Genre Recognition Recognizing the genre (e.g. rock, pop, jazz, etc.) of a piec… | Audio & Speech | 10 | 2 |
| Optic Cup Segmentation Optic cup segmentation, concentric with optic disc, useful f… | Medical & Scientific | 8 | 2 |
| Skill Mastery | Foundations & Efficiency | 8 | 2 |
| Summarization Summarization is the task of producing a shorter version of … | Language & Reasoning | 8 | 2 |
| Voice pathology detection | Medical & Scientific | 8 | 2 |
| Analogical Similarity | Language & Reasoning | 7 | 2 |