| Task | Area | Papers | Results |
|---|---|---|---|
| Point Clouds | Computer Vision | 22 | 22 |
| Causal Inference Causal inference is the task of drawing a conclusion about a… | Language & Reasoning | 1,722 | 21 |
| Semantic Similarity The main objective Semantic Similarity is to measure the dis… | Foundations & Efficiency | 1,564 | 21 |
| Text-To-Speech Synthesis Text-To-Speech Synthesis is a machine learning task that inv… | Audio & Speech | 332 | 21 |
| Point Cloud Completion | Generative Models | 174 | 21 |
| 2D Pose Estimation detective pose | Computer Vision | 90 | 21 |
| Table-to-Text Generation Here is the provided data converted into a table format for … | Language & Reasoning | 68 | 21 |
| Open-Domain Dialog | Language & Reasoning | 60 | 21 |
| Table Recognition Table recognition refers to the process of automatically ide… | Language & Reasoning | 50 | 21 |
| Ad-Hoc Information Retrieval Ad-hoc information retrieval refers to the task of returning… | Recommendation & Retrieval | 41 | 21 |
| Cover song identification Cover Song Identification is the task of identifying an alte… | Audio & Speech | 18 | 21 |
| CARLA longest6 longest6 is an evaluation benchmark for sensorimotor autonom… | Reinforcement Learning & Robotics | 15 | 21 |
| Automatic Speech Recognition (ASR) Automatic Speech Recognition (ASR) involves converting spoke… | Audio & Speech | 3,012 | 20 |
| Optical Character Recognition (OCR) Optical Character Recognition or Optical Character Reader (O… | Computer Vision | 1,209 | 20 |
| Image Quality Assessment | Computer Vision | 732 | 20 |
| Dialogue Generation Dialogue generation is the task of "understanding" natural l… | Language & Reasoning | 606 | 20 |
| Visual Grounding Visual Grounding (VG) aims to locate the most relevant objec… | Multimodal & Vision-Language | 571 | 20 |
| Medical Image Classification Medical Image Classification is a task in medical image anal… | Medical & Scientific | 424 | 20 |
| Boundary Detection Boundary Detection is a vital part of extracting information… | Computer Vision | 359 | 20 |
| Sarcasm Detection The goal of Sarcasm Detection is to determine whether a sent… | Language & Reasoning | 266 | 20 |
| Protein Structure Prediction Image credit: [FastFold: Reducing AlphaFold Training Time fr… | Medical & Scientific | 188 | 20 |
| Multimodal Emotion Recognition This is a leaderboard for multimodal emotion recognition on … | Multimodal & Vision-Language | 180 | 20 |
| Cell Segmentation Cell Segmentation is a task of splitting a microscopic image… | Medical & Scientific | 178 | 20 |
| 3D Object Reconstruction Image: [Choy et al](https://arxiv.org/pdf/1604.00449v1.pdf) | Generative Models | 153 | 20 |
| Key Information Extraction Key Information Extraction (KIE) is aimed at extracting stru… | Language & Reasoning | 74 | 20 |
| Extreme Summarization Image credit: [TLDR: Extreme Summarization of Scientific Doc… | Language & Reasoning | 22 | 20 |
| 3D Multi-Person Pose Estimation (root-relative) This task aims to solve root-relative 3D multi-person pose e… | Computer Vision | 18 | 20 |
| 3D Open-Vocabulary Instance Segmentation Open-vocabulary 3D instance segmentation is a computer visio… | Computer Vision | 14 | 20 |
| Text-based de novo Molecule Generation Text-based de novo molecule generation involves utilizing na… | Medical & Scientific | 14 | 20 |
| Personality Trait Recognition | Computer Vision | 13 | 20 |
| Event-based Object Segmentation | Computer Vision | 8 | 20 |
| MuJoCo Games | Reinforcement Learning & Robotics | 8 | 20 |
| Video-Adverb Retrieval The bidirectional video-adverb retrieval task aims at retrie… | Multimodal & Vision-Language | 4 | 20 |
| General Classification Algorithms trying to solve the general task of classificatio… | Foundations & Efficiency | 14,581 | 19 |
| Retrieval A methodology that involves selecting relevant data or examp… | Recommendation & Retrieval | 14,297 | 19 |
| Knowledge Graphs A knowledge graph is a structured representation of informat… | Graphs & Structured Data | 2,974 | 19 |
| Fact Checking | Language & Reasoning | 669 | 19 |
| Visual Tracking Visual Tracking is an essential and actively researched prob… | Computer Vision | 525 | 19 |
| Emotion Classification Emotion classification, or emotion categorization, is the ta… | Language & Reasoning | 458 | 19 |
| Robotic Grasping This task is composed of using Deep Learning to identify how… | Reinforcement Learning & Robotics | 246 | 19 |
| Bias Detection Bias detection is the task of detecting and measuring racism… | Language & Reasoning | 199 | 19 |
| Source-Free Domain Adaptation Source-Free Domain Adaptation (SFDA) is a domain adaptation … | Foundations & Efficiency | 188 | 19 |
| Few-Shot Text Classification Few-shot Text Classification predicts the semantic label of … | Language & Reasoning | 100 | 19 |
| Human Part Segmentation | Computer Vision | 19 | 19 |
| Biomedical Information Retrieval | Medical & Scientific | 15 | 19 |
| Single-object discovery | Computer Vision | 10 | 19 |
| Morpheme Segmentaiton Succesful systems segment a given word or sentence into a se… | Language & Reasoning | 6 | 19 |
| Self-Supervised Learning Self-Supervised Learning is proposed for utilizing unlabeled… | Foundations & Efficiency | 5,044 | 18 |
| Emotion Recognition Emotion Recognition is an important area of research to enab… | Language & Reasoning | 2,041 | 18 |
| Topic Models A topic model is a type of statistical model for discovering… | Language & Reasoning | 881 | 18 |