| Task | Area | Papers | Results |
|---|---|---|---|
| Training-free 3D Point Cloud Classification Evaluation on target datasets for 3D Point Cloud Classificat… | Computer Vision | 8 | 13 |
| Generative 3D Object Classification The task of generative 3D object classification involves pro… | Computer Vision | 5 | 13 |
| Online Beat Tracking | Audio & Speech | 4 | 13 |
| Style Transfer Style Transfer is a technique in computer vision and graphic… | Generative Models | 1,661 | 12 |
| Feature Importance | Foundations & Efficiency | 890 | 12 |
| Small Object Detection Small Object Detection is a computer vision task that involv… | Computer Vision | 152 | 12 |
| Partial Label Learning | Foundations & Efficiency | 85 | 12 |
| General Reinforcement Learning | Medical & Scientific | 84 | 12 |
| Attribute Value Extraction Attribute Value Extraction is the task of extracting values … | Language & Reasoning | 35 | 12 |
| 3D Face Animation Image: [Cudeiro et al](https://arxiv.org/pdf/1905.03079v1.pd… | Generative Models | 34 | 12 |
| 10-shot image generation Generate 10 image of famous arts | Generative Models | 29 | 12 |
| Image Editing | Generative Models | 10 | 12 |
| Representation Learning Representation Learning is a process in machine learning whe… | Foundations & Efficiency | 10,580 | 11 |
| AutoML Automated Machine Learning (AutoML) is a general concept whi… | Foundations & Efficiency | 641 | 11 |
| Word Alignment Word Alignment is the task of finding the correspondence bet… | Language & Reasoning | 551 | 11 |
| Point Cloud Segmentation 3D point cloud segmentation is the process of classifying po… | Computer Vision | 272 | 11 |
| Motion Segmentation Motion Segmentation is an essential task in many application… | Computer Vision | 212 | 11 |
| Sparse Learning | Foundations & Efficiency | 185 | 11 |
| Dialogue Evaluation | Language & Reasoning | 97 | 11 |
| Type prediction | Foundations & Efficiency | 96 | 11 |
| Audio Tagging Audio tagging is a task to predict the tags of audio clips. … | Audio & Speech | 81 | 11 |
| Source Code Summarization Code Summarization is a task that tries to comprehend code a… | Language & Reasoning | 58 | 11 |
| Stereo Disparity Estimation | Computer Vision | 29 | 11 |
| Scene Change Detection Scene change detection (SCD) refers to the task of localizin… | Computer Vision | 25 | 11 |
| Repetitive Action Counting Repetitive action counting aims to count the number of repet… | Computer Vision | 14 | 11 |
| Visual Social Relationship Recognition | Computer Vision | 10 | 11 |
| ValNov Given a textual premise and conclusion candidate, the Argume… | Foundations & Efficiency | 2 | 11 |
| Transfer Learning Transfer Learning is a machine learning technique where a mo… | Foundations & Efficiency | 10,307 | 10 |
| Fairness | Foundations & Efficiency | 5,676 | 10 |
| Activity Recognition Human Activity Recognition is the problem of identifying eve… | Computer Vision | 1,322 | 10 |
| parameter-efficient fine-tuning Parameter-Efficient Fine-Tuning (PEFT) is a technique used t… | Foundations & Efficiency | 935 | 10 |
| Response Generation A task where an agent should play the $DE$ role and generate… | Language & Reasoning | 914 | 10 |
| Intent Classification Intent Classification is the task of correctly labeling a na… | Language & Reasoning | 344 | 10 |
| Natural Language Queries | Language & Reasoning | 337 | 10 |
| Knowledge Tracing Knowledge Tracing is the task of modelling student knowledge… | Language & Reasoning | 215 | 10 |
| Text Style Transfer Text Style Transfer is the task of controlling certain attri… | Language & Reasoning | 186 | 10 |
| Protein Design Formally, given the design requirements of users, models are… | Medical & Scientific | 175 | 10 |
| Image Colorization | Generative Models | 127 | 10 |
| Medical Image Generation Medical image generation is the task of synthesising new med… | Generative Models | 78 | 10 |
| Pneumonia Detection | Medical & Scientific | 63 | 10 |
| Kinship Verification Kinship verification aims to find out whether there is a kin… | Computer Vision | 42 | 10 |
| Voice Anti-spoofing Discriminate genuine speech and spoofing attacks | Audio & Speech | 23 | 10 |
| Spatio-Temporal Video Grounding Spatio-temporal video grounding is a computer vision and nat… | Multimodal & Vision-Language | 22 | 10 |
| Relationship Extraction (Distant Supervised) Relationship extraction is the task of extracting semantic r… | Computer Vision | 19 | 10 |
| Abnormal Event Detection In Video Abnormal Event Detection In Video is a challenging task in c… | Computer Vision | 17 | 10 |
| Image Paragraph Captioning Image paragraph captioning involves generating a detailed, m… | Multimodal & Vision-Language | 17 | 10 |
| Open Vocabulary Panoptic Segmentation | Computer Vision | 17 | 10 |
| Drivable Area Detection The drivable area detection is a subset topic of object dete… | Reinforcement Learning & Robotics | 12 | 10 |
| Event data classification | Foundations & Efficiency | 12 | 10 |
| Mathematical Question Answering Building systems that automatically answer mathematical ques… | Language & Reasoning | 11 | 10 |