| Unsupervised 3D Semantic Segmentation Unsupervised 3D Semantic Segmentation | Computer Vision | 3 | 0 |
| Unsupervised Domain Expansion | Foundations & Efficiency | 3 | 0 |
| Unsupervised Zero-Shot Instance Segmentation | Computer Vision | 3 | 0 |
| Vehicle Color Recognition Vehicle Color Recognition (VCR) involves developing a system… | Computer Vision | 3 | 0 |
| Video Narrative Grounding Video Narrative Grounding is the task of linking video narra… | Computer Vision | 3 | 0 |
| Video Relationship | Computer Vision | 3 | 0 |
| Video-to-Shop | Computer Vision | 3 | 0 |
| Vietnamese Aspect-Based Sentiment Analysis UIT-ViSFD: A Vietnamese Smartphone Feedback Dataset for Aspe… | Language & Reasoning | 3 | 0 |
| Vietnamese Natural Language Understanding | Language & Reasoning | 3 | 0 |
| Vietnamese Scene Text | Computer Vision | 3 | 0 |
| Vietnamese Sentiment Analysis | Language & Reasoning | 3 | 0 |
| Visual Sentiment Prediction | Multimodal & Vision-Language | 3 | 0 |
| Webcam (RGB) image classification | Computer Vision | 3 | 0 |
| Webpage Object Detection | Computer Vision | 3 | 0 |
| Whole Mammogram Classification | Medical & Scientific | 3 | 0 |
| Wikipedia Summarization | Language & Reasoning | 3 | 0 |
| Zero-day intrusion detection Zero-day or new attacks detection | Computer Vision | 3 | 0 |
| Zero-Shot Action Recognition on UCF101 | Computer Vision | 3 | 0 |
| Zero-Shot Audio Retrieval | Audio & Speech | 3 | 0 |
| Zero-Shot Cross-Lingual Image-to-Text Retrieval | Multimodal & Vision-Language | 3 | 0 |
| Zero-Shot Cross-Lingual Text-to-Image Retrieval | Multimodal & Vision-Language | 3 | 0 |
| Zero-Shot Cross-Lingual Visual Natural Language Inference | Multimodal & Vision-Language | 3 | 0 |
| zero-shot long video breakpoint-mode question answering | Multimodal & Vision-Language | 3 | 0 |
| zero-shot long video global-model question answering | Multimodal & Vision-Language | 3 | 0 |
| zero-shot long video question answering | Multimodal & Vision-Language | 3 | 0 |
| Zero-Shot Scene Graph Generation | Graphs & Structured Data | 3 | 0 |
| Zero-Shot Visual Question Answring | Multimodal & Vision-Language | 3 | 0 |
| 2D Tiny Object Detection | Computer Vision | 2 | 0 |
| 3D Mesh Denoising 3D Mesh Denoising involves removing distortions—referred to … | Computer Vision | 2 | 0 |
| 3D Object Super-Resolution 3D object super-resolution is the task of up-sampling 3D obj… | Computer Vision | 2 | 0 |
| 3D Wireframe Reconstruction | Computer Vision | 2 | 0 |
| 4-ary Relation Extraction | Language & Reasoning | 2 | 0 |
| 4-task Classification | Foundations & Efficiency | 2 | 0 |
| Abstention Prediction | Language & Reasoning | 2 | 0 |
| Actin Detection | Computer Vision | 2 | 0 |
| Action Classification (1-shot) | Computer Vision | 2 | 0 |
| Action Recognition on HMDB-51 | Computer Vision | 2 | 0 |
| Acute Stroke Lesion Segmentation | Computer Vision | 2 | 0 |
| ADP Prediction | Time Series & Forecasting | 2 | 0 |
| Adversarial Attack on Video Classification Use optimizer to add disturbe on video frame to fool the vid… | Computer Vision | 2 | 0 |
| Adversarial Defense against FGSM Attack | Foundations & Efficiency | 2 | 0 |
| Aesthetic Image Captioning | Multimodal & Vision-Language | 2 | 0 |
| Age/Bias-conflicting | Computer Vision | 2 | 0 |
| Airbubbles Detection | Computer Vision | 2 | 0 |
| ALS Detection | Computer Vision | 2 | 0 |
| Amodal Layout Estimation Amodal scene layout estimation involves estimating the stati… | Computer Vision | 2 | 0 |
| Amodal Panoptic Segmentation The goal of this task is to simultaneously predict the pixel… | Computer Vision | 2 | 0 |
| Amodal Tracking | Computer Vision | 2 | 0 |
| Analogical questions | Language & Reasoning | 2 | 0 |
| Analytic Entailment | Language & Reasoning | 2 | 0 |