SOTAVerified
Home/All Tasks

All Tasks

4,818 tasks

TaskAreaPapersResults
Image Classification

Image Classification is a fundamental task in vision recogni…

Computer Vision10,4192,912
Atari Games

The Atari 2600 Games task (and dataset) involves training an…

Reinforcement Learning & Robotics6252,519
Semantic SegmentationComputer Vision14,7631,920
Node Classification

Node Classification is a machine learning task in graph-base…

Graphs & Structured Data1,8601,793
Question Answering

Question answering can be segmented into domain-specific tas…

Language & Reasoning10,8171,784
Object DetectionComputer Vision10,957981
Few-Shot Image Classification

Few-Shot Image Classification is a computer vision task that…

Computer Vision353913
Image Generation

Image Generation (synthesis) is the task of generating new i…

Generative Models6,689871
3D Object Detection

3D Object Detection is a task in computer vision where the g…

Computer Vision1,576809
Graph Classification

Graph Classification is a task that involves classifying a g…

Graphs & Structured Data927809
Image Super-Resolution

Image Super-Resolution is a machine learning task where the …

Generative Models1,589748
Visual Question Answering (VQA)

Visual Question Answering (VQA) is a task in computer vision…

Multimodal & Vision-Language2,167727
Anomaly Detection

Anomaly Detection is a binary classification identifying unu…

Time Series & Forecasting4,856669
Action Recognition

Action Recognition is a computer vision task that involves r…

Computer Vision2,759650
Domain Generalization

The idea of Domain Generalization is to learn from one or mu…

Computer Vision1,751570
Time Series Forecasting

Time Series Forecasting is the task of fitting a model to hi…

Time Series & Forecasting1,609538
Person Re-Identification

Person Re-Identification is a computer vision task in which …

Computer Vision1,488533
Natural Language Inference

Natural language inference (NLI) is the task of determining …

Language & Reasoning1,961505
Link Prediction

Link Prediction is a task in graph and network analysis wher…

Graphs & Structured Data1,949501
Language Modelling

A language model is a model of natural language. Language mo…

Language & Reasoning17,610467
Few-Shot Semantic Segmentation

Few-shot semantic segmentation (FSS) learns to segment targe…

Computer Vision168458
Semi-Supervised Image Classification

Semi-supervised image classification leverages unlabelled da…

Computer Vision167456
3D Human Pose Estimation

3D Human Pose Estimation is a computer vision task that invo…

Computer Vision665454
Named Entity Recognition (NER)

Named Entity Recognition (NER) is a task of Natural Language…

Language & Reasoning2,874439
Machine Translation

Machine translation is the task of translating a sentence in…

Language & Reasoning10,752438
Neural Architecture Search

Neural architecture search (NAS) is a technique for automati…

Foundations & Efficiency1,915424
Image Captioning

Image Captioning is the task of describing the content of an…

Multimodal & Vision-Language1,878422
Long-tail Learning

Long-tailed learning, one of the most challenging problems i…

Foundations & Efficiency131421
Speech Recognition

Speech Recognition is the task of converting spoken language…

Audio & Speech6,433398
Common Sense Reasoning

Common sense reasoning tasks are intended to require the mod…

Language & Reasoning939397
Unsupervised Domain Adaptation

Unsupervised Domain Adaptation is a learning framework to tr…

Foundations & Efficiency1,951393
Domain Adaptation

Domain Adaptation is the task of adapting models across doma…

Foundations & Efficiency6,439391
Sentiment Analysis

Sentiment Analysis is the task of classifying the polarity o…

Language & Reasoning5,630382
Semi-Supervised Video Object Segmentation

The semi-supervised scenario assumes the user inputs a full …

Computer Vision147380
Relation Extraction

Relation Extraction is the task of predicting attributes and…

Language & Reasoning1,977377
Fine-Grained Image Classification

Fine-Grained Image Classification is a task in computer visi…

Computer Vision353377
Image Retrieval

Image Retrieval is a fundamental and long-standing computer …

Multimodal & Vision-Language2,239372
Text Classification

Text Classification is the task of assigning a sentence or d…

Language & Reasoning3,635341
Instance Segmentation

Instance Segmentation is a computer vision task that involve…

Computer Vision2,262340
Visual Question Answering

MLLM Leaderboard

Multimodal & Vision-Language2,177334
Medical Image Segmentation

Medical Image Segmentation is a computer vision task that in…

Medical & Scientific2,089333
Image Clustering

Models that partition the dataset into semantically meaningf…

Computer Vision236329
Referring Expression Segmentation

The task aims at labeling the pixels of an image or video th…

Multimodal & Vision-Language145317
Video Retrieval

The objective of video retrieval is as follows: given a text…

Multimodal & Vision-Language486309
Multi-Label Classification

multilabel graph classification with highest result

Foundations & Efficiency1,198302
Motion Forecasting

Motion forecasting is the task of predicting the location of…

Time Series & Forecasting205299
Visual Object Tracking

Visual Object Tracking is an important research topic in com…

Computer Vision341289
Semantic Textual Similarity

Semantic textual similarity deals with determining how simil…

Language & Reasoning2,381280
Out-of-Distribution Detection

Detect out-of-distribution or anomalous examples.

Foundations & Efficiency888269
Visual Place Recognition

Visual Place Recognition is the task of matching a view of a…

Multimodal & Vision-Language297265