| Traffic Data Imputation | Time Series & Forecasting | 23 | 0 |
| 3D Point Cloud Reconstruction Encoding and reconstruction of 3D point clouds. | Computer Vision | 22 | 0 |
| 3D Question Answering (3D-QA) A 3D-QA task requires models to answer a question when given… | Multimodal & Vision-Language | 22 | 0 |
| Aerial Scene Classification | Computer Vision | 22 | 0 |
| Building change detection for remote sensing images | Computer Vision | 22 | 0 |
| Constituency Grammar Induction Inducing a constituency-based phrase structure grammar. | Language & Reasoning | 22 | 0 |
| Conversational Response Generation Given an input conversation, generate a natural-looking text… | Language & Reasoning | 22 | 0 |
| Coronary Artery Segmentation | Computer Vision | 22 | 0 |
| Cross Document Coreference Resolution | Language & Reasoning | 22 | 0 |
| Cross-Lingual Entity Linking Cross-lingual entity linking is the task of using data and m… | Language & Reasoning | 22 | 0 |
| De-aliasing De-aliasing is the problem of recovering the original high-f… | Computer Vision | 22 | 0 |
| Design Synthesis | Medical & Scientific | 22 | 0 |
| Evidence Selection | Recommendation & Retrieval | 22 | 0 |
| Facial expression generation | Computer Vision | 22 | 0 |
| Human Agent Collaboration | Computer Vision | 22 | 0 |
| Image Rescaling Image rescaling is a bidirectional operation, which first do… | Computer Vision | 22 | 0 |
| Image-to-Image Regression | Computer Vision | 22 | 0 |
| Image to Point Cloud Registration Given a query image and a scene of point cloud, get the came… | Computer Vision | 22 | 0 |
| Linguistic steganography | Graphs & Structured Data | 22 | 0 |
| ListOps | Foundations & Efficiency | 22 | 0 |
| LLM-generated Text Detection Classifying human-written and LLM-generated texts. | Language & Reasoning | 22 | 0 |
| Long Term Action Anticipation | Computer Vision | 22 | 0 |
| Natural Language Moment Retrieval | Language & Reasoning | 22 | 0 |
| Predicate Classification | Language & Reasoning | 22 | 0 |
| Raindrop Removal | Computer Vision | 22 | 0 |
| Real-Time Visual Tracking | Computer Vision | 22 | 0 |
| Sar Image Despeckling Despeckling is the task of suppressing speckle from Syntheti… | Computer Vision | 22 | 0 |
| Sequence-To-Sequence Speech Recognition | Audio & Speech | 22 | 0 |
| Short-Text Conversation Given a short text, finding an appropriate response (Source:… | Language & Reasoning | 22 | 0 |
| Unsupervised Few-Shot Learning In contrast to supervised few-shot learning, only the unlabe… | Time Series & Forecasting | 22 | 0 |
| Unsupervised Image Registration | Computer Vision | 22 | 0 |
| 3D Inpainting 3D Inpainting is the removal of unwanted objects from a 3D s… | Computer Vision | 21 | 0 |
| Chord Recognition | Audio & Speech | 21 | 0 |
| Data Interaction Measure and value the train data interaction to interpret th… | Language & Reasoning | 21 | 0 |
| Egocentric Pose Estimation | Computer Vision | 21 | 0 |
| Graphon Estimation | Graphs & Structured Data | 21 | 0 |
| Low-latency processing | Foundations & Efficiency | 21 | 0 |
| Models Alignment Models Alignment is the process of ensuring that multiple mo… | Language & Reasoning | 21 | 0 |
| Motion Interpolation 3D human motion sequences interpolation and completion | Computer Vision | 21 | 0 |
| Neural Network simulation Simulation of abstract or biophysical neural networks in sil… | Foundations & Efficiency | 21 | 0 |
| Panoptic Scene Graph Generation PSG task abstracts the given image with a scene graph, where… | Graphs & Structured Data | 21 | 0 |
| Predicting Patient Outcomes | Medical & Scientific | 21 | 0 |
| Raspberry Pi 3 | Medical & Scientific | 21 | 0 |
| Speech Tokenization Speech tokenization is the task of representing speech signa… | Audio & Speech | 21 | 0 |
| Subject Transfer | Foundations & Efficiency | 21 | 0 |
| text-guided-generation | Generative Models | 21 | 0 |
| Text-Independent Speaker Recognition | Audio & Speech | 21 | 0 |
| Zero-shot Slot Filling | Language & Reasoning | 21 | 0 |
| Aesthetics Quality Assessment Automatic assessment of aesthetic-related subjective ratings… | Computer Vision | 20 | 0 |
| Audio-Visual Question Answering (AVQA) | Audio & Speech | 20 | 0 |