Home/All Tasks

All Tasks

4,818 tasks

Task	Area	Papers
Traffic Data Imputation	Time Series & Forecasting	23
3D Point Cloud Reconstruction Encoding and reconstruction of 3D point clouds.	Computer Vision	22
3D Question Answering (3D-QA) A 3D-QA task requires models to answer a question when given…	Multimodal & Vision-Language	22
Aerial Scene Classification	Computer Vision	22
Building change detection for remote sensing images	Computer Vision	22
Constituency Grammar Induction Inducing a constituency-based phrase structure grammar.	Language & Reasoning	22
Conversational Response Generation Given an input conversation, generate a natural-looking text…	Language & Reasoning	22
Coronary Artery Segmentation	Computer Vision	22
Cross Document Coreference Resolution	Language & Reasoning	22
Cross-Lingual Entity Linking Cross-lingual entity linking is the task of using data and m…	Language & Reasoning	22
De-aliasing De-aliasing is the problem of recovering the original high-f…	Computer Vision	22
Design Synthesis	Medical & Scientific	22
Evidence Selection	Recommendation & Retrieval	22
Facial expression generation	Computer Vision	22
Human Agent Collaboration	Computer Vision	22
Image Rescaling Image rescaling is a bidirectional operation, which first do…	Computer Vision	22
Image-to-Image Regression	Computer Vision	22
Image to Point Cloud Registration Given a query image and a scene of point cloud, get the came…	Computer Vision	22
Linguistic steganography	Graphs & Structured Data	22
ListOps	Foundations & Efficiency	22
LLM-generated Text Detection Classifying human-written and LLM-generated texts.	Language & Reasoning	22
Long Term Action Anticipation	Computer Vision	22
Natural Language Moment Retrieval	Language & Reasoning	22
Predicate Classification	Language & Reasoning	22
Raindrop Removal	Computer Vision	22
Real-Time Visual Tracking	Computer Vision	22
Sar Image Despeckling Despeckling is the task of suppressing speckle from Syntheti…	Computer Vision	22
Sequence-To-Sequence Speech Recognition	Audio & Speech	22
Short-Text Conversation Given a short text, finding an appropriate response (Source:…	Language & Reasoning	22
Unsupervised Few-Shot Learning In contrast to supervised few-shot learning, only the unlabe…	Time Series & Forecasting	22
Unsupervised Image Registration	Computer Vision	22
3D Inpainting 3D Inpainting is the removal of unwanted objects from a 3D s…	Computer Vision	21
Chord Recognition	Audio & Speech	21
Data Interaction Measure and value the train data interaction to interpret th…	Language & Reasoning	21
Egocentric Pose Estimation	Computer Vision	21
Graphon Estimation	Graphs & Structured Data	21
Low-latency processing	Foundations & Efficiency	21
Models Alignment Models Alignment is the process of ensuring that multiple mo…	Language & Reasoning	21
Motion Interpolation 3D human motion sequences interpolation and completion	Computer Vision	21
Neural Network simulation Simulation of abstract or biophysical neural networks in sil…	Foundations & Efficiency	21
Panoptic Scene Graph Generation PSG task abstracts the given image with a scene graph, where…	Graphs & Structured Data	21
Predicting Patient Outcomes	Medical & Scientific	21
Raspberry Pi 3	Medical & Scientific	21
Speech Tokenization Speech tokenization is the task of representing speech signa…	Audio & Speech	21
Subject Transfer	Foundations & Efficiency	21
text-guided-generation	Generative Models	21
Text-Independent Speaker Recognition	Audio & Speech	21
Zero-shot Slot Filling	Language & Reasoning	21
Aesthetics Quality Assessment Automatic assessment of aesthetic-related subjective ratings…	Computer Vision	20
Audio-Visual Question Answering (AVQA)	Audio & Speech	20