| Cross-domain 3D Human Pose Estimation | Computer Vision | 2 | 0 |
| Cross Encoder Reranking | Recommendation & Retrieval | 2 | 0 |
| Cross-environment ASR | Audio & Speech | 2 | 0 |
| Cross-lingual Text-to-Image Generation | Multimodal & Vision-Language | 2 | 0 |
| Cross-Part Crowd Counting | Computer Vision | 2 | 0 |
| CW Attack Detection | Computer Vision | 2 | 0 |
| Damaged Tissue Detection | Computer Vision | 2 | 0 |
| Dial Meter Reading | Computer Vision | 2 | 0 |
| Distributed Voting | Foundations & Efficiency | 2 | 0 |
| Document-level Closed Information Extraction Document-level closed information extraction (DocIE) is a su… | Language & Reasoning | 2 | 0 |
| Drawing Pictures | Generative Models | 2 | 0 |
| Driver Authentication | Computer Vision | 2 | 0 |
| Drone Pose Estimation | Computer Vision | 2 | 0 |
| Drum Transcription in Music (DTM) | Audio & Speech | 2 | 0 |
| Dynamic Region Segmentation | Computer Vision | 2 | 0 |
| ECG based Sleep Staging Sleep Staging from only ECG signal | Medical & Scientific | 2 | 0 |
| ECG QRS Detection | Medical & Scientific | 2 | 0 |
| ECG Wave Delineation Delineation of the waveforms P, T and QRS complexes from ECG… | Medical & Scientific | 2 | 0 |
| Email Thread Summarization Image credit: [EmailSum: Abstractive Email Thread Summarizat… | Language & Reasoning | 2 | 0 |
| EMG Signal Prediction | Medical & Scientific | 2 | 0 |
| ENF (Electric Network Frequency) Detection Detect if into a source (for example audio, video or raw val… | Computer Vision | 2 | 0 |
| Engaging Comment Classification | Language & Reasoning | 2 | 0 |
| Entailed Polarity | Language & Reasoning | 2 | 0 |
| Evaluating Information Essentiality | Language & Reasoning | 2 | 0 |
| Event-Driven Trading Making stock trading decisions based on events. | Time Series & Forecasting | 2 | 0 |
| Exception type | Language & Reasoning | 2 | 0 |
| Extended Summarization | Language & Reasoning | 2 | 0 |
| Face Dubbing | Computer Vision | 2 | 0 |
| Fake Image Attribution Attribute the origin (model/architecture) of fake images. | Computer Vision | 2 | 0 |
| Fake Song Detection | Computer Vision | 2 | 0 |
| Fantasy Reasoning | Language & Reasoning | 2 | 0 |
| Federated Lifelong Person ReID | Computer Vision | 2 | 0 |
| Few-shot Age Estimation Only choose 1/2/4/8/16/32/64 samples in the training set fro… | Foundations & Efficiency | 2 | 0 |
| Few-Shot Video Object Detection Few-Shot Video Object Detection (FSVOD): given only a few su… | Computer Vision | 2 | 0 |
| Figure Of Speech Detection | Audio & Speech | 2 | 0 |
| Fine-Grained Facial Editing | Computer Vision | 2 | 0 |
| Folded Tissue Detection | Computer Vision | 2 | 0 |
| Football Action Valuation | Time Series & Forecasting | 2 | 0 |
| Frame Duplication Detection | Computer Vision | 2 | 0 |
| General Action Video Anomaly Detection Detecting if an entire short clip of a any action features a… | Time Series & Forecasting | 2 | 0 |
| General Classification Selection | Language & Reasoning | 2 | 0 |
| Generalized Few-Shot Classification | Foundations & Efficiency | 2 | 0 |
| Generalized Zero-Shot Learning - Unseen The average of the normalized top-1 prediction scores of uns… | Time Series & Forecasting | 2 | 0 |
| Generative Semantic Nursing Generative Semantic Nursing is a task of intervening the gen… | Language & Reasoning | 2 | 0 |
| Geometrical View | Language & Reasoning | 2 | 0 |
| Geometry-based operator learning Deep Operator Networks for multi-geometry problems | Time Series & Forecasting | 2 | 0 |
| Gesture Synchronization | Computer Vision | 2 | 0 |
| Global 3D Human Pose Estimation Global 3D Human Pose Estimation is extending RGB-based human… | Computer Vision | 2 | 0 |
| GRE Reading Comprehension | Language & Reasoning | 2 | 0 |
| Grounded Multiple Object Tracking | Computer Vision | 2 | 0 |