SOTAVerified
Home/All Tasks

All Tasks

4,818 tasks

TaskAreaPapersResults
Edge Detection

Edge Detection is a fundamental image processing technique w…

Computer Vision49027
6D Pose Estimation

Image: [Zeng et al](https://arxiv.org/pdf/1609.09475v3.pdf)

Computer Vision25527
Age And Gender Classification

Age and gender classification is a dual-task of identifying …

Computer Vision3427
Social Media Popularity Prediction

Social Media Popularity Prediction (SMPP) aims to predict th…

Foundations & Efficiency727
Unsupervised Anomaly Detection with Specified Settings -- 1% anomalyTime Series & Forecasting627
Data Augmentation

Data augmentation involves techniques used for increasing th…

Generative Models8,37826
Image Enhancement

Image Enhancement is basically improving the interpretabilit…

Generative Models98326
Video Semantic Segmentation

The goal of video semantic segmentation is to assign a prede…

Computer Vision89526
Gesture Recognition

Gesture Recognition is an active field of research with appl…

Computer Vision57226
Audio Generation

Audio generation (synthesis) is the task of generating raw a…

Audio & Speech27026
Video Reconstruction

Source: [Deep-SloMo](https://github.com/avinashpaliwal/Deep-…

Generative Models14526
Multi-Label Image Classification

The Multi-Label Image Classification focuses on predicting l…

Computer Vision12426
Line Segment DetectionComputer Vision3726
Emotion InterpretationLanguage & Reasoning626
Point Cloud Classification

Point Cloud Classification is a task involving the classific…

Computer Vision26525
Sound Event Detection

Sound Event Detection (SED) is the task of recognizing the s…

Audio & Speech19425
Entity Typing

Entity Typing is an important task in text analysis. Assigni…

Language & Reasoning17025
Single-View 3D ReconstructionGenerative Models9825
Time Series Regression

Predicting one or more scalars for an entire time series exa…

Time Series & Forecasting8225
Clustering Algorithms EvaluationFoundations & Efficiency1225
Description-guided molecule generation

The significance of description-based molecule generation li…

Medical & Scientific225
Chunking

Chunking, also known as shallow parsing, identifies continuo…

Language & Reasoning44724
Virtual Try-on

Virtual try-on of clothing or other items such as glasses an…

Generative Models27624
Audio-Visual Speech Recognition

Audio-visual speech recognition is the task of transcribing …

Multimodal & Vision-Language10024
Temporal Relation Extraction

Temporal relation extraction systems aim to identify and cla…

Time Series & Forecasting8824
Visual Relationship Detection

Visual relationship detection (VRD) is one newly developed c…

Computer Vision8224
Emotional Intelligence

Emotional Intelligence (EI) is a measure of "The ability to …

Language & Reasoning7724
Dense Video Captioning

Most natural videos contain numerous events. For example, in…

Multimodal & Vision-Language7624
Subjectivity Analysis

A related task to sentiment analysis is the subjectivity ana…

Language & Reasoning6324
Meme Classification

Meme classification refers to the task of classifying intern…

Language & Reasoning5924
Image Attribution

Image attribution algorithms aim to identify important regio…

Computer Vision2624
Protein Secondary Structure Prediction

Protein secondary structure prediction is a vital task in bi…

Medical & Scientific2624
3D Point Cloud Linear Classification

Training a linear classifier(e.g. SVM) on the embeddings/rep…

Computer Vision2124
Autonomous Driving

Autonomous driving is the task of driving a vehicle without …

Reinforcement Learning & Robotics6,09223
Time Series Anomaly DetectionTime Series & Forecasting26423
Sign Language Translation

Given a video containing sign language, the task is to predi…

Multimodal & Vision-Language15323
Stock Market PredictionTime Series & Forecasting10423
Hypernym Discovery

Given a corpus and a target term (hyponym), the task of hype…

Language & Reasoning3323
Human Interaction Recognition

Human Interaction Recognition (HIR) is a field of study that…

Computer Vision2223
Video-based Generative Performance Benchmarking

The benchmark evaluates a generative Video Conversational Mo…

Generative Models2023
Multi-tissue Nucleus SegmentationMedical & Scientific1423
Semantic entity labeling

- One of Form Understanding task (Word grouping, Semantic en…

Language & Reasoning1423
Unsupervised Panoptic Segmentation

Unsupervised Panoptic Segmentation aims to partition an imag…

Computer Vision423
Image Compression

Image Compression is an application of data compression for …

Generative Models1,00822
Image Registration

Image registration is the process of transforming different …

Computer Vision95322
Point ProcessesFoundations & Efficiency54122
Gaze Estimation

Gaze Estimation is a task to predict where a person is looki…

Computer Vision24822
Scene Flow Estimation

Optical flow is a two-dimensional motion field in the image …

Computer Vision15222
Extractive Text Summarization

Given a document, selecting a subset of the words or sentenc…

Language & Reasoning9522
Text-to-Music GenerationAudio & Speech3722