| Cross-Lingual Abstractive Summarization | Language & Reasoning | 15 | 0 |
| Exposure Fairness | Foundations & Efficiency | 15 | 0 |
| Face Image Retrieval Face image retrieval is the task of retrieving faces similar… | Computer Vision | 15 | 0 |
| Few-Shot Imitation Learning | Reinforcement Learning & Robotics | 15 | 0 |
| Finger Vein Recognition | Computer Vision | 15 | 0 |
| Generalized Referring Expression Segmentation Generalized Referring Expression Segmentation (GRES), introd… | Multimodal & Vision-Language | 15 | 0 |
| highlight removal Highlight removal refers to the process of eliminating or re… | Computer Vision | 15 | 0 |
| Key Detection | Computer Vision | 15 | 0 |
| Known Unknowns Language models have a tendency to generate text containing … | Language & Reasoning | 15 | 0 |
| Lightweight Deployment | Foundations & Efficiency | 15 | 0 |
| Logo Recognition | Computer Vision | 15 | 0 |
| Myocardial infarction detection | Computer Vision | 15 | 0 |
| Position regression Prediction of the absolute 3D position of nodes in molecular… | Foundations & Efficiency | 15 | 0 |
| Procedural Text Understanding | Language & Reasoning | 15 | 0 |
| Procedure Learning Given a set of videos of the same task, the goal is to ident… | Time Series & Forecasting | 15 | 0 |
| Production Forecasting | Time Series & Forecasting | 15 | 0 |
| Proper Noun | Language & Reasoning | 15 | 0 |
| Prosody Prediction Predicting prosodic prominence from text. This is a 2-way cl… | Audio & Speech | 15 | 0 |
| Pupil Detection | Computer Vision | 15 | 0 |
| Radar waveform design | Audio & Speech | 15 | 0 |
| Real-World Adversarial Attack Adversarial attacks that are presented in the real world | Foundations & Efficiency | 15 | 0 |
| Saliency Ranking | Computer Vision | 15 | 0 |
| SAR Ship Detection Ship detection based on SAR images | Computer Vision | 15 | 0 |
| Sentence ReWriting | Language & Reasoning | 15 | 0 |
| Survey Sampling | Foundations & Efficiency | 15 | 0 |
| Table Search | Language & Reasoning | 15 | 0 |
| Temporal Tagging Identification of the extent of a temporal expression (timex… | Time Series & Forecasting | 15 | 0 |
| TGIF-Frame | Multimodal & Vision-Language | 15 | 0 |
| Tomographic Reconstructions | Graphs & Structured Data | 15 | 0 |
| Unconditional Video Generation | Computer Vision | 15 | 0 |
| Unsupervised 3D Human Pose Estimation | Computer Vision | 15 | 0 |
| Unsupervised Facial Landmark Detection Facial landmark detection in the unsupervised setting popula… | Computer Vision | 15 | 0 |
| Variable misuse | Language & Reasoning | 15 | 0 |
| Video-based Generative Performance Benchmarking (Consistency) The benchmark evaluates a generative Video Conversational Mo… | Computer Vision | 15 | 0 |
| Video-based Generative Performance Benchmarking (Detail Orientation)) The benchmark evaluates a generative Video Conversational Mo… | Computer Vision | 15 | 0 |
| Video-based Generative Performance Benchmarking (Temporal Understanding) The benchmark evaluates a generative Video Conversational Mo… | Time Series & Forecasting | 15 | 0 |
| Video Visual Relation Detection Video Visual Relation Detection (VidVRD) aims to detect inst… | Computer Vision | 15 | 0 |
| Writer Retrieval | Recommendation & Retrieval | 15 | 0 |
| Zero-shot Text-to-Image Retrieval | Multimodal & Vision-Language | 15 | 0 |
| 3D Object Detection From Stereo Images Estimating oriented 3D bounding boxes from Stereo Cameras on… | Computer Vision | 14 | 0 |
| Active Object Detection Active Learning for Object Detection | Computer Vision | 14 | 0 |
| Air Pollution Prediction | Medical & Scientific | 14 | 0 |
| Alzheimer's Detection | Computer Vision | 14 | 0 |
| Arabic Speech Recognition | Audio & Speech | 14 | 0 |
| Argument Retrieval | Recommendation & Retrieval | 14 | 0 |
| Aspect Term Extraction and Sentiment Classification Extracting the aspect terms as well as the corresponding sen… | Language & Reasoning | 14 | 0 |
| Contact mechanics | Computer Vision | 14 | 0 |
| Continual Self-Supervised Learning | Computer Vision | 14 | 0 |
| controllable image captioning generate image captions conditioned on control signals | Multimodal & Vision-Language | 14 | 0 |
| COVID-19 Image Segmentation | Medical & Scientific | 14 | 0 |