| Plane Instance Segmentation | Computer Vision | 4 | 0 |
| Point-interactive Image Colorization Point-interactive colorization is a task of colorizing image… | Computer Vision | 4 | 0 |
| Printed Text Recognition | Language & Reasoning | 4 | 0 |
| Protein Annotation | Graphs & Structured Data | 4 | 0 |
| Quantum Chemistry Regression | Medical & Scientific | 4 | 0 |
| Reasoning Video Object Segmentation | Computer Vision | 4 | 0 |
| Reliable Intelligence Identification | Language & Reasoning | 4 | 0 |
| RF-based Action Recognition | Computer Vision | 4 | 0 |
| Self-Supervised Person Re-Identification Currently, self-supervised representation learning is mainly… | Computer Vision | 4 | 0 |
| Semantic Image-Text Similarity | Multimodal & Vision-Language | 4 | 0 |
| Semantic Role Labeling (predicted predicates) PropBank semantic role labeling with predicted predicates. | Language & Reasoning | 4 | 0 |
| Semi-Supervised Action Detection | Computer Vision | 4 | 0 |
| Sequential Place Learning State-of-the-art algorithms for route-based place recognitio… | Time Series & Forecasting | 4 | 0 |
| Single-cell modeling Single Cell RNA sequencing (scRNAseq) revolutionized our und… | Computer Vision | 4 | 0 |
| Single Choice Question | Language & Reasoning | 4 | 0 |
| Sleep Quality Prediction ( Image credit: [DeepSleep](https://github.com/GuanLab/DeepS… | Medical & Scientific | 4 | 0 |
| SNES Games The task is to train an agent to play SNES games such as Sup… | Reinforcement Learning & Robotics | 4 | 0 |
| Sonnet Generation Generating a poetry in the form of a sonnet. | Generative Models | 4 | 0 |
| Sound Prompted Semantic Segmentation Sound prompted semantic segmentation aims to predict a segme… | Audio & Speech | 4 | 0 |
| Soundscape evaluation Evaluation of soundscape in accordance to ISO/TS 12913-2 | Audio & Speech | 4 | 0 |
| Source Code Authorship Attribution | Foundations & Efficiency | 4 | 0 |
| Span-Extraction MRC | Language & Reasoning | 4 | 0 |
| spatial-aware image editing Spatial-aware image editing is an image editing technique th… | Computer Vision | 4 | 0 |
| Spatial Token Mixer Spatial Token Mixer (STM) is a module for vision transformer… | Computer Vision | 4 | 0 |
| Speaker-Specific Lip to Speech Synthesis How accurately can we infer an individual’s speech style and… | Audio & Speech | 4 | 0 |
| Speculation Detection Identifying information in text that is speculative as oppos… | Computer Vision | 4 | 0 |
| Speech Prompted Semantic Segmentation Speech prompted semantic segmentation aims to predict semant… | Audio & Speech | 4 | 0 |
| Steganographics | Graphs & Structured Data | 4 | 0 |
| Subject-driven Video Generation | Computer Vision | 4 | 0 |
| Suspicous (BIRADS 4,5)-no suspicous (BIRADS 1,2,3) per image classification Per image binary classification for mammography images based… | Computer Vision | 4 | 0 |
| Terrain Estimation | Computer Vision | 4 | 0 |
| Text2Sparql Conversion from natural language questions to SPARQL queries | Graphs & Structured Data | 4 | 0 |
| Text-Based Stock Prediction Make stock predictions based on text (e.g., news articles, t… | Time Series & Forecasting | 4 | 0 |
| Text Classification (Sentiment Analysis) | Language & Reasoning | 4 | 0 |
| Text-Line Extraction | Computer Vision | 4 | 0 |
| Text-to-video search | Multimodal & Vision-Language | 4 | 0 |
| Text-Variation Generate variations of the input text | Language & Reasoning | 4 | 0 |
| Timedial | Language & Reasoning | 4 | 0 |
| Time-interval Prediction | Time Series & Forecasting | 4 | 0 |
| Time Series Denoising | Time Series & Forecasting | 4 | 0 |
| Time Series Streams | Time Series & Forecasting | 4 | 0 |
| Trademark Retrieval | Recommendation & Retrieval | 4 | 0 |
| Transparent Object Depth Estimation Estimating the 3D shape of transparent objects | Computer Vision | 4 | 0 |
| Tweet Retrieval | Recommendation & Retrieval | 4 | 0 |
| Uncertainty-Aware Panoptic Segmentation The ternary difficulty levels assigned to each pixel during … | Computer Vision | 4 | 0 |
| Unconstrained Lip-synchronization Given a video of an arbitrary person, and an arbitrary drivi… | Computer Vision | 4 | 0 |
| Underwater 3D Scene Reconstruction | Computer Vision | 4 | 0 |
| Unsupervised semantic parsing | Language & Reasoning | 4 | 0 |
| Unsupervised Sentence Compression Producing a shorter sentence by removing redundant informati… | Foundations & Efficiency | 4 | 0 |
| Utterance-level pronounciation scoring Total pronunciation score of an utterance. | Audio & Speech | 4 | 0 |