SOTAVerified
Home/All Tasks

All Tasks

4,818 tasks

TaskAreaPapersResults
Training-free 3D Point Cloud Classification

Evaluation on target datasets for 3D Point Cloud Classificat…

Computer Vision813
Generative 3D Object Classification

The task of generative 3D object classification involves pro…

Computer Vision513
Online Beat TrackingAudio & Speech413
Style Transfer

Style Transfer is a technique in computer vision and graphic…

Generative Models1,66112
Feature ImportanceFoundations & Efficiency89012
Small Object Detection

Small Object Detection is a computer vision task that involv…

Computer Vision15212
Partial Label LearningFoundations & Efficiency8512
General Reinforcement LearningMedical & Scientific8412
Attribute Value Extraction

Attribute Value Extraction is the task of extracting values …

Language & Reasoning3512
3D Face Animation

Image: [Cudeiro et al](https://arxiv.org/pdf/1905.03079v1.pd…

Generative Models3412
10-shot image generation

Generate 10 image of famous arts

Generative Models2912
Image EditingGenerative Models1012
Representation Learning

Representation Learning is a process in machine learning whe…

Foundations & Efficiency10,58011
AutoML

Automated Machine Learning (AutoML) is a general concept whi…

Foundations & Efficiency64111
Word Alignment

Word Alignment is the task of finding the correspondence bet…

Language & Reasoning55111
Point Cloud Segmentation

3D point cloud segmentation is the process of classifying po…

Computer Vision27211
Motion Segmentation

Motion Segmentation is an essential task in many application…

Computer Vision21211
Sparse LearningFoundations & Efficiency18511
Dialogue EvaluationLanguage & Reasoning9711
Type predictionFoundations & Efficiency9611
Audio Tagging

Audio tagging is a task to predict the tags of audio clips. …

Audio & Speech8111
Source Code Summarization

Code Summarization is a task that tries to comprehend code a…

Language & Reasoning5811
Stereo Disparity EstimationComputer Vision2911
Scene Change Detection

Scene change detection (SCD) refers to the task of localizin…

Computer Vision2511
Repetitive Action Counting

Repetitive action counting aims to count the number of repet…

Computer Vision1411
Visual Social Relationship RecognitionComputer Vision1011
ValNov

Given a textual premise and conclusion candidate, the Argume…

Foundations & Efficiency211
Transfer Learning

Transfer Learning is a machine learning technique where a mo…

Foundations & Efficiency10,30710
FairnessFoundations & Efficiency5,67610
Activity Recognition

Human Activity Recognition is the problem of identifying eve…

Computer Vision1,32210
parameter-efficient fine-tuning

Parameter-Efficient Fine-Tuning (PEFT) is a technique used t…

Foundations & Efficiency93510
Response Generation

A task where an agent should play the $DE$ role and generate…

Language & Reasoning91410
Intent Classification

Intent Classification is the task of correctly labeling a na…

Language & Reasoning34410
Natural Language QueriesLanguage & Reasoning33710
Knowledge Tracing

Knowledge Tracing is the task of modelling student knowledge…

Language & Reasoning21510
Text Style Transfer

Text Style Transfer is the task of controlling certain attri…

Language & Reasoning18610
Protein Design

Formally, given the design requirements of users, models are…

Medical & Scientific17510
Image ColorizationGenerative Models12710
Medical Image Generation

Medical image generation is the task of synthesising new med…

Generative Models7810
Pneumonia DetectionMedical & Scientific6310
Kinship Verification

Kinship verification aims to find out whether there is a kin…

Computer Vision4210
Voice Anti-spoofing

Discriminate genuine speech and spoofing attacks

Audio & Speech2310
Spatio-Temporal Video Grounding

Spatio-temporal video grounding is a computer vision and nat…

Multimodal & Vision-Language2210
Relationship Extraction (Distant Supervised)

Relationship extraction is the task of extracting semantic r…

Computer Vision1910
Abnormal Event Detection In Video

Abnormal Event Detection In Video is a challenging task in c…

Computer Vision1710
Image Paragraph Captioning

Image paragraph captioning involves generating a detailed, m…

Multimodal & Vision-Language1710
Open Vocabulary Panoptic SegmentationComputer Vision1710
Drivable Area Detection

The drivable area detection is a subset topic of object dete…

Reinforcement Learning & Robotics1210
Event data classificationFoundations & Efficiency1210
Mathematical Question Answering

Building systems that automatically answer mathematical ques…

Language & Reasoning1110