| Semi-Supervised Human Pose Estimation Semi-supervised human pose estimation aims to leverage the u… | Computer Vision | 5 | 0 |
| Semi-Supervised Person Re-Identification | Computer Vision | 5 | 0 |
| Sequence-to-sequence Language Modeling | Language & Reasoning | 5 | 0 |
| Sequential skip prediction | Audio & Speech | 5 | 0 |
| Sleep Micro-event detection | Computer Vision | 5 | 0 |
| Smile Recognition Smile recognition is the task of recognising a smiling face … | Computer Vision | 5 | 0 |
| Source-free Domain Generalization Source-free Domain Generalization aims to improve model's ge… | Computer Vision | 5 | 0 |
| Space group classification Predicting the space group of a material/nanomaterial graph. | Medical & Scientific | 5 | 0 |
| Sports Ball Detection and Tracking detect a (x, y)-coordinate of ball location from each image … | Computer Vision | 5 | 0 |
| Stress-Strain Relation Data-driven techniques for finding stress-strain relation in… | Medical & Scientific | 5 | 0 |
| TDC ADMET Benchmarking Group ADMET is a cornerstone of small molecule drug discovery, def… | Foundations & Efficiency | 5 | 0 |
| Text Effects Transfer Text effects transfer refers to the task of transferring typ… | Language & Reasoning | 5 | 0 |
| Therapeutics Data Commons Therapeutics Data Commons is a coordinated initiative to acc… | Medical & Scientific | 5 | 0 |
| Time-Series Few-Shot Learning with Heterogeneous Channels | Time Series & Forecasting | 5 | 0 |
| Tropical Cyclone Intensity Forecasting | Time Series & Forecasting | 5 | 0 |
| Unsupervised face recognition | Computer Vision | 5 | 0 |
| Unsupervised Part-Of-Speech Tagging Marking up a word in a text (corpus) as corresponding to a p… | Language & Reasoning | 5 | 0 |
| Vector Quantization (k-means problem) Given a data set $X$ of d-dimensional numeric vectors and a … | Foundations & Efficiency | 5 | 0 |
| Video Deinterlacing | Computer Vision | 5 | 0 |
| Video Harmonization | Computer Vision | 5 | 0 |
| Vietnamese Image Captioning | Multimodal & Vision-Language | 5 | 0 |
| Vietnamese Multimodal Learning | Computer Vision | 5 | 0 |
| Violence and Weaponized Violence Detection | Computer Vision | 5 | 0 |
| Virtual Try-Off Virtual Try-Off (VTOFF) is a novel computer vision task that… | Computer Vision | 5 | 0 |
| Visual Abductive Reasoning | Computer Vision | 5 | 0 |
| Visual Keyword Spotting Spot a given query keyword in a silent talking face video | Audio & Speech | 5 | 0 |
| Visual Question Answering (VQA) Split A | Multimodal & Vision-Language | 5 | 0 |
| Visual Question Answering (VQA) Split B | Multimodal & Vision-Language | 5 | 0 |
| Weakly Supervised Defect Detection | Computer Vision | 5 | 0 |
| Weakly-Supervised Object Segmentation | Computer Vision | 5 | 0 |
| Weakly Supervised Referring Expression Segmentation RES with less percentage of ground truth annotations | Multimodal & Vision-Language | 5 | 0 |
| Winowhy | Language & Reasoning | 5 | 0 |
| Word-level pronunciation scoring Total score of a word pronunciation. | Audio & Speech | 5 | 0 |
| Zero-shot Classification (unified classes) | Foundations & Efficiency | 5 | 0 |
| Zero Shot on BEIR (Inference Free Model) Zero Shot Performance on BEIR with Inference Free models | Language & Reasoning | 5 | 0 |
| Zero-shot Relation Triplet Extraction Given an input sentence, the task is to extract triplets con… | Graphs & Structured Data | 5 | 0 |
| Zero-shot Text-to-Video Generation | Multimodal & Vision-Language | 5 | 0 |
| 3D Character Animation From A Single Photo Image: [Weng et al](https://arxiv.org/pdf/1812.02246v1.pdf) | Computer Vision | 4 | 0 |
| 3D Holography The images that are presented here are multiplanar images th… | Graphs & Structured Data | 4 | 0 |
| 3D Parameter-Efficient Fine-Tuning for Classification | Computer Vision | 4 | 0 |
| 3D Pedestrian Tracking | Computer Vision | 4 | 0 |
| 3D Unsupervised Domain Adaptation | Computer Vision | 4 | 0 |
| Abstract Anaphora Resolution Abstract Anaphora Resolution aims to resolve nominal express… | Computer Vision | 4 | 0 |
| Activation Function Synthesis | Foundations & Efficiency | 4 | 0 |
| AI and Safety | Reinforcement Learning & Robotics | 4 | 0 |
| All-day Semantic Segmentation Semantic segmentation under all-day conditions | Computer Vision | 4 | 0 |
| Animal Action Recognition Cross-species (intra-class, inter-class) action recognition | Computer Vision | 4 | 0 |
| Anomaly Forecasting Anomaly forecasting is a critical aspect of modern data anal… | Time Series & Forecasting | 4 | 0 |
| Audio Effects Modeling Modeling of audio effects such as reverberation, compression… | Audio & Speech | 4 | 0 |
| Audio Fingerprint | Audio & Speech | 4 | 0 |