SOTAVerified
Home/All Tasks

All Tasks

4,818 tasks

TaskAreaPapersResults
Spoken Language UnderstandingLanguage & Reasoning55036
Text-to-Video Generation

Ma grand-mère m’a raconté que quand elle était étudiante, el…

Multimodal & Vision-Language20136
No-Reference Image Quality Assessment

An Image Quality Assessment approach where no reference imag…

Computer Vision15536
Highlight Detection

https://youtu.be/pJ0auP7dbcY?si=vSiZevfJ57YUKC2q

Computer Vision7836
Tabular Data Generation

Generation of the tabular data using generative models

Generative Models7336
Image ReconstructionGenerative Models2,14335
Stochastic Optimization

Stochastic Optimization is the task of optimizing certain ob…

Foundations & Efficiency1,38735
Network Pruning

Network Pruning is a popular approach to reduce a heavy netw…

Foundations & Efficiency53435
Malware Classification

Malware Classification is the process of assigning a malware…

Language & Reasoning14635
Single-step retrosynthesisMedical & Scientific3435
Image Segmentation

Image Segmentation is a computer vision task that involves d…

Computer Vision5,03534
Text Clustering

Grouping a set of texts in such a way that objects in the sa…

Language & Reasoning12334
Referring expression generation

Generate referring expressions

Multimodal & Vision-Language8434
3D Anomaly Detection

3D-only Anomaly Detection. Structures out of normal distribu…

Time Series & Forecasting3634
Adversarial Robustness

Adversarial Robustness evaluates the vulnerabilities of mach…

Foundations & Efficiency1,74633
Relation Classification

Relation Classification is the task of identifying the seman…

Language & Reasoning44533
Visual Storytelling

( Image credit: [No Metrics Are Perfect](https://github.com/…

Multimodal & Vision-Language11533
Image-to-Text Retrieval

Image-text retrieval is the process of retrieving relevant i…

Multimodal & Vision-Language5933
Graph Clustering

Graph Clustering is the process of grouping the nodes of the…

Graphs & Structured Data39332
Depth Completion

The Depth Completion task is a sub-problem of depth estimati…

Computer Vision24232
Reflection Removal

Remove the spots from mirror and clear the picture

Generative Models8132
Shadow DetectionComputer Vision7932
Object Recognition

Object recognition is a computer vision technique for detect…

Computer Vision2,04231
Speech Synthesis

Speech synthesis is the task of generating speech from some …

Audio & Speech1,24931
Fake News Detection

Fake News Detection is a natural language processing task th…

Language & Reasoning49031
Video Classification

Video Classification is the task of producing a label that i…

Computer Vision45531
Visual Localization

Visual Localization is the problem of estimating the camera …

Computer Vision40231
Scene Graph Generation

A scene graph is a structured representation of an image, wh…

Graphs & Structured Data31831
Speech-to-Text Translation

Translate audio signals of speech in one language into text …

Audio & Speech14631
Person Search

Person Search is a task which aims at matching a specific pe…

Computer Vision13931
Music Transcription

Music transcription is the task of converting an acoustic mu…

Audio & Speech9631
Collaborative FilteringRecommendation & Retrieval1,30930
DeepFake Detection

DeepFake Detection is the task of detecting fake videos or i…

Language & Reasoning58030
Document Layout Analysis

"Document Layout Analysis is performed to determine physical…

Language & Reasoning9930
Phrase Grounding

Given an image and a corresponding caption, the Phrase Groun…

Multimodal & Vision-Language8830
Pedestrian Attribute Recognition

Pedestrian attribution recognition is the task of recognizin…

Computer Vision5630
Weather Forecasting

Weather Forecasting is the prediction of future weather cond…

Time Series & Forecasting42029
Audio captioning

Audio Captioning is the task of describing audio using text.…

Audio & Speech11929
Surface Normals Estimation

Surface normal estimation deals with the task of predicting …

Computer Vision3929
Bird's-Eye View Semantic SegmentationReinforcement Learning & Robotics2629
Parking Space Occupancy

Image credit: [https://github.com/martin-marek/parking-space…

Computer Vision529
Multiple Instance Learning

Multiple Instance Learning is a type of weakly supervised le…

Foundations & Efficiency74428
Adversarial Defense

Competitions with currently unpublished results: - [TrojAI](…

Foundations & Efficiency40328
Stance Detection

Stance detection is the extraction of a subject's reaction t…

Language & Reasoning34328
Code CompletionLanguage & Reasoning21228
Text to Audio RetrievalAudio & Speech2028
Unsupervised Anomaly Detection with Specified Settings -- 0.1% anomalyTime Series & Forecasting628
Unsupervised Anomaly Detection with Specified Settings -- 10% anomalyTime Series & Forecasting628
Unsupervised Anomaly Detection with Specified Settings -- 20% anomalyTime Series & Forecasting628
Unsupervised Anomaly Detection with Specified Settings -- 30% anomalyTime Series & Forecasting628