SOTAVerified
Home/All Tasks

All Tasks

4,818 tasks

TaskAreaPapersResults
Document Summarization

Automatic Document Summarization is the task of rewriting a …

Language & Reasoning76050
Unsupervised Anomaly Detection

The objective of Unsupervised Anomaly Detection is to detect…

Time Series & Forecasting50650
Multi-Label Text Classification

According to Wikipedia "In machine learning, multi-label cla…

Language & Reasoning17150
Text Spotting

Scene Text Spotting is the combination of Scene Text Detecti…

Language & Reasoning11250
Medical Code Prediction

Context: Prediction of medical codes from clinical notes is …

Medical & Scientific2750
Speaker Verification

Speaker verification is the verifying the identity of a pers…

Audio & Speech74649
Semantic correspondence

The task of semantic correspondence aims to establish reliab…

Computer Vision17549
Document Image Classification

Document image classification is the task of classifying doc…

Multimodal & Vision-Language5049
Part-Of-Speech Tagging

Part-of-speech tagging (POS tagging) is the task of tagging …

Language & Reasoning99048
Image Inpainting

Remove the preview tag from template

Generative Models70848
Hate Speech Detection

Hate speech detection is the task of detecting if communicat…

Language & Reasoning50748
Speech Emotion Recognition

Speech Emotion Recognition is a task of speech processing an…

Audio & Speech43148
2D Human Pose Estimation

What is Human Pose Estimation? Human pose estimation is the …

Computer Vision11847
Salient Object DetectionComputer Vision52746
Handwritten Text Recognition

Handwritten Text Recognition (HTR) is the task of automatica…

Language & Reasoning13946
Fraud Detection

Fraud Detection is a vital topic that applies to many indust…

Computer Vision54745
Robot Navigation

The fundamental objective of mobile Robot Navigation is to a…

Reinforcement Learning & Robotics54245
OpenAI Gym

An open-source toolkit from OpenAI that implements several R…

Reinforcement Learning & Robotics38245
Image Matting

Image Matting is the process of accurately estimating the fo…

Generative Models22545
Action Recognition In Videos

Action Recognition in Videos is a task in computer vision an…

Computer Vision12445
Action Quality Assessment

Assessing/analyzing/quantifying how well an action was perfo…

Computer Vision5245
Full reference image quality assessment

The goal is to calculate an objective quality score for a gi…

Computer Vision5045
3D Reconstruction

3D Reconstruction is the task of creating a 3D model or repr…

Computer Vision2,32644
Object Tracking

Object tracking is the task of taking an initial set of obje…

Computer Vision1,96644
Paraphrase Identification

The goal of Paraphrase Identification is to determine whethe…

Language & Reasoning17244
Multivariate Time Series ForecastingTime Series & Forecasting24543
Music Source Separation

Music source separation is the task of decomposing music int…

Audio & Speech10743
Natural Language Understanding

Natural Language Understanding is an important field of Natu…

Language & Reasoning1,97842
Question Generation

The goal of Question Generation is to generate a valid and f…

Language & Reasoning66442
2D Semantic SegmentationComputer Vision7942
Zero-Shot Semantic SegmentationComputer Vision6042
Text based Person RetrievalMultimodal & Vision-Language4942
Mathematical ReasoningLanguage & Reasoning80541
Fact Verification

Fact verification, also called "fact checking", is a process…

Language & Reasoning21641
Cross-Lingual Document Classification

Cross-lingual document classification refers to the task of …

Language & Reasoning2541
Semantic Role Labeling

Semantic role labeling aims to model the predicate-argument …

Language & Reasoning62040
Speaker Diarization

Speaker Diarization is the task of segmenting and co-indexin…

Audio & Speech32840
Visual Navigation

Visual Navigation is the problem of navigating an agent, e.g…

Computer Vision31640
Constituency Parsing

Constituency parsing aims to extract a constituency-based pa…

Language & Reasoning20440
Visual Prompt Tuning

Visual Prompt Tuning(VPT) only introduces a small amount of …

Multimodal & Vision-Language7040
Action Anticipation

Next action anticipation is defined as observing 1, ... , T …

Computer Vision11039
Long-Context UnderstandingLanguage & Reasoning8139
Blind Face Restoration

Blind face restoration aims at recovering high-quality faces…

Generative Models5539
Molecule Captioning

Molecular description generation entails the creation of a d…

Multimodal & Vision-Language2539
Facial Landmark Detection

Facial Landmark Detection is a computer vision task that inv…

Computer Vision13938
Chart Question Answering

Question Answering task on charts images

Multimodal & Vision-Language5038
Animal Pose Estimation

Animal pose estimation is the task of identifying the pose o…

Computer Vision4838
Multispectral Object Detection

Only using RGB cameras for automatic outdoor scene analysis …

Computer Vision3938
Quantization

Quantization is a promising technique to reduce the computat…

Foundations & Efficiency4,92537
Video Inpainting

The goal of Video Inpainting is to fill in missing regions o…

Generative Models13037