| OpenAI Vision | Reinforcement Learning & Robotics | 1 | 0 |
| Open Knowledge Graph Embedding | Graphs & Structured Data | 1 | 0 |
| Open-Set Multi-Target Domain Adaptation Open Set Multi-Target Domain Adaptation (OSMTDA) aims to tra… | Computer Vision | 1 | 0 |
| Open Set Video Captioning | Multimodal & Vision-Language | 1 | 0 |
| Open-vocab Temporal Action Detection | Time Series & Forecasting | 1 | 0 |
| Open-Vocabulary Panoramic Semantic Segmentation | Computer Vision | 1 | 0 |
| Open-World Social Event Classification With the rapid development of Internet and multimedia, popul… | Language & Reasoning | 1 | 0 |
| Optical Tweezers Simulations A sub-task for optical tweezers related physical simulations… | Computer Vision | 1 | 0 |
| Optimize the trajectory of UAV which plays a BS in communication system | Foundations & Efficiency | 1 | 0 |
| OrangeSum | Language & Reasoning | 1 | 0 |
| Oriented Object Detcti | Computer Vision | 1 | 0 |
| Oriented Object Detctio | Computer Vision | 1 | 0 |
| orpus Video Moment Retrieval | Computer Vision | 1 | 0 |
| Outcome Prediction In Multimodal Mri | Medical & Scientific | 1 | 0 |
| Out-of-Sight Trajectory Prediction Predicting visual trajectories of out-of-sight objects using… | Computer Vision | 1 | 0 |
| Overlapped 14-1 | Language & Reasoning | 1 | 0 |
| Overlapped 25-25 | Language & Reasoning | 1 | 0 |
| Overlapping Mention Recognition Overlapping mention recognition is the task of correctly ide… | Language & Reasoning | 1 | 0 |
| Overlapping Pose Estimation Pose estimation with overlapping poses. | Computer Vision | 1 | 0 |
| Panop | Computer Vision | 1 | 0 |
| Panoptic Segmentation (PanopticNDT instances) | Computer Vision | 1 | 0 |
| Panorama Pose Estimation (N-view) | Computer Vision | 1 | 0 |
| Paper generation (abstract-to-conclusion) | Generative Models | 1 | 0 |
| Paper generation (Conclusion-to-title) | Generative Models | 1 | 0 |
| Paper generation (Title-to-abstract) | Generative Models | 1 | 0 |
| Paraphrase Identification within Bi-Encoder | Language & Reasoning | 1 | 0 |
| Paraphrase Mining | Language & Reasoning | 1 | 0 |
| Parkinson Detection from Speech Detecting Parkinson’s Disease using speech features. | Audio & Speech | 1 | 0 |
| Patent Figure Description Generation Given a patent figure, automatically generate its descriptio… | Generative Models | 1 | 0 |
| Patient QA | Medical & Scientific | 1 | 0 |
| Pedestrian Density Estimation Pedestrian density estimation is the task of estimating the … | Computer Vision | 1 | 0 |
| Pedestrian Image Caption | Multimodal & Vision-Language | 1 | 0 |
| Persona Dialogue in Story Building persona dialogue in a story | Language & Reasoning | 1 | 0 |
| Person Identification (zero-shot) | Computer Vision | 1 | 0 |
| person reposing Person reposing describes the task of changing the pose of a… | Computer Vision | 1 | 0 |
| Persuasive Writing Strategy Detection | Computer Vision | 1 | 0 |
| Persuasive Writing Strategy Detection Level-1 | Computer Vision | 1 | 0 |
| Persuasive Writing Strategy Detection Level-2 | Computer Vision | 1 | 0 |
| Persuasive Writing Strategy Detection Level-3 | Computer Vision | 1 | 0 |
| Persuasive Writing Strategy Detection Level-4 | Computer Vision | 1 | 0 |
| PHMbench | Medical & Scientific | 1 | 0 |
| PHM-Vibench | Medical & Scientific | 1 | 0 |
| Phrase Extraction and Grounding (PEG) PEG requires a model to extract phrases from text and locate… | Computer Vision | 1 | 0 |
| Phrase Vector Embedding Just like the generation of word (1-gram) vector embedding, … | Foundations & Efficiency | 1 | 0 |
| Piano Music Modeling | Audio & Speech | 1 | 0 |
| Pill Classification (Both Sides) | Medical & Scientific | 1 | 0 |
| Plan2Scene Converting floorplans + RGB photos to textured 3D mesh model… | Computer Vision | 1 | 0 |
| Point cloud classification dataset | Computer Vision | 1 | 0 |
| Point Cloud Rrepresentation Learning | Computer Vision | 1 | 0 |
| Point Cloud Semantic Completion | Computer Vision | 1 | 0 |