SOTAVerified
Home/Computer Vision

Computer Vision

1,775 tasks · View all areas

Papers in this area

Showing 110 of 10 papers

TitleStatusHype
YOLOv9: Learning What You Want to Learn Using Programmable Gradient InformationCode16
MinerU: An Open-Source Solution for Precise Document Content ExtractionCode16
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All ToolsCode14
Qwen2 Technical ReportCode13
Open-Sora: Democratizing Efficient Video Production for AllCode13
MiniCPM-V: A GPT-4V Level MLLM on Your PhoneCode12
SAM 2: Segment Anything in Images and VideosCode12
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient RoboticsCode12
NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive SecurityCode11
HunyuanVideo: A Systematic Framework For Large Video Generative ModelsCode11
Show:102550
TaskPapersResults
Image Classification

Image Classification is a fundamental task in vision recogni…

10,4192,912
Semantic Segmentation14,7631,920
Object Detection10,957981
Few-Shot Image Classification

Few-Shot Image Classification is a computer vision task that…

353913
3D Object Detection

3D Object Detection is a task in computer vision where the g…

1,576809
Action Recognition

Action Recognition is a computer vision task that involves r…

2,759650
Domain Generalization

The idea of Domain Generalization is to learn from one or mu…

1,751570
Person Re-Identification

Person Re-Identification is a computer vision task in which …

1,488533
Few-Shot Semantic Segmentation

Few-shot semantic segmentation (FSS) learns to segment targe…

168458
Semi-Supervised Image Classification

Semi-supervised image classification leverages unlabelled da…

167456
3D Human Pose Estimation

3D Human Pose Estimation is a computer vision task that invo…

665454
Semi-Supervised Video Object Segmentation

The semi-supervised scenario assumes the user inputs a full …

147380
Fine-Grained Image Classification

Fine-Grained Image Classification is a task in computer visi…

353377
Instance Segmentation

Instance Segmentation is a computer vision task that involve…

2,262340
Image Clustering

Models that partition the dataset into semantically meaningf…

236329
Visual Object Tracking

Visual Object Tracking is an important research topic in com…

341289
Pose Estimation

Pose Estimation is a computer vision task where the goal is …

4,228236
Multi-Object Tracking

Multi-Object Tracking is a task in computer vision that invo…

671227
3D Point Cloud Classification202216
Panoptic Segmentation

Panoptic Segmentation is a computer vision task that combine…

462208
Point Cloud Registration

Point Cloud Registration is a fundamental problem in 3D comp…

447190
Scene Text Recognition

See [Scene Text Detection](https://paperswithcode.com/task/s…

269190
Video Quality Assessment

Video Quality Assessment is a computer vision task aiming to…

216186
Facial Expression Recognition (FER)

Facial Expression Recognition (FER) is a computer vision tas…

492167
RGB Salient Object Detection

RGB Salient object detection is a task-based on a visual att…

222159
Change Detection

Change Detection is a computer vision task that involves det…

919146
Scene Text Detection

Scene Text Detection is a computer vision task that involves…

213146
3D Semantic Segmentation

3D Semantic Segmentation is a computer vision task that invo…

348145
3D Multi-Object Tracking

Image: [Weng et al](https://arxiv.org/pdf/1907.03961v4.pdf)

101140
Face Detection

Face Detection is a computer vision task that involves autom…

536139
Optical Flow Estimation

Optical Flow Estimation is a computer vision task that invol…

2,184133
Video Instance Segmentation

The goal of video instance segmentation is simultaneous dete…

148133
Face Verification

Face Verification is a machine learning task in computer vis…

360130
3D Object Tracking

3D Object Tracking is a computer vision task dedicated to mo…

67127
Open Vocabulary Semantic Segmentation113124
Action Segmentation

Action Segmentation is a challenging problem in high-level v…

219120
Few-Shot Object Detection

Few-Shot Object Detection is a computer vision task that inv…

179111
Semi-Supervised Object Detection

Semi-supervised object detection uses both labeled data and …

115110
Age Estimation

Age Estimation is the task of estimating the age of a person…

254109
Weakly Supervised Action Localization

In this task, the training data consists of videos with a li…

55107
Human-Object Interaction Detection

Human-Object Interaction (HOI) detection is a task of identi…

449103
Zero-Shot Action Recognition83103
3D Hand Pose Estimation

Image: [Zimmerman et l](https://arxiv.xsrg/pdf/1705.01389v3.…

178102
Unsupervised Semantic Segmentation

Models that learn to segment each image (i.e. assign a class…

9597
Video Object Segmentation

Video object segmentation is a binary labeling problem aimin…

55196
Zero-Shot Transfer Image Classification1995
Multi-Person Pose Estimation

Multi-person pose estimation is the task of estimating the p…

15194
Pedestrian Detection

Pedestrian detection is the task of detecting pedestrians fr…

43892
Face Recognition

Facial Recognition is the task of making a positive identifi…

2,32988
Action Detection

Action Detection aims to find both where and when an action …

81781