| Instrument Playing Technique Detection | 3 | 0 |
| Intraoperative Tracking | 3 | 0 |
| Lossy-Compression Artifact Reduction | 3 | 0 |
| Manner Of Articulation Detection | 3 | 0 |
| Meme Captioning Automatic generation of natural language descriptions of the… | 3 | 0 |
| Multi-Focus Microscopical Images Fusion | 3 | 0 |
| Multimodal fashion image editing Given a target model image, a sketch and a textual descripti… | 3 | 0 |
| Multiple Action Detection | 3 | 0 |
| Multispectral Image Super-resolution | 3 | 0 |
| Neural Radiance Caching Involves the task of predicting photorealistic pixel colors … | 3 | 0 |
| Obfuscation Detection | 3 | 0 |
| One-Shot Instance Segmentation ( Image credit: [Siamese Mask R-CNN ](https://github.com/bet… | 3 | 0 |
| Open Vocabulary Keypoint Detection | 3 | 0 |
| Open-Vocabulary Video Segmentation | 3 | 0 |
| Open-World Video Segmentation | 3 | 0 |
| Patient-Specific Segmentation | 3 | 0 |
| Political Salient Issue Orientation Detection | 3 | 0 |
| Prostate Zones Segmentation | 3 | 0 |
| Real-Time 3D Semantic Segmentation | 3 | 0 |
| Reasoning About Colored Objects | 3 | 0 |
| Recognizing And Localizing Human Actions | 3 | 0 |
| Referring Image Matting Extracting the meticulous alpha matte of the specific object… | 3 | 0 |
| Referring Image Matting (Expression-based) Expression-based referring image matting, taking an image an… | 3 | 0 |
| Referring Image Matting (Keyword-based) Keyword-based referring image matting, taking an image and a… | 3 | 0 |
| Referring Image Matting (RefMatte-RW100) Expression-based referring image matting on natural images a… | 3 | 0 |
| Relational Captioning | 3 | 0 |
| Replay Grounding Replay grounding is introduced in SoccerNet-v2 in the case o… | 3 | 0 |
| Reverse Style Transfer | 3 | 0 |
| Root Joint Localization | 3 | 0 |
| Scene Classification (unified classes) | 3 | 0 |
| Seismic Detection When recording seismic ground motion in multiple sites using… | 3 | 0 |
| Semi-Supervised Video Classification | 3 | 0 |
| Single-shot HDR Reconstruction SVE-based HDR imaging, also known as single-shot HDR imaging… | 3 | 0 |
| Social Media Mental Health Detection | 3 | 0 |
| Specular Reflection Mitigation | 3 | 0 |
| Speculation Scope Resolution Identifiy the scope of a speculation cue that indicates unce… | 3 | 0 |
| Stenosis Segmentation | 3 | 0 |
| Style change detection | 3 | 0 |
| Thermal Image Denoising | 3 | 0 |
| Training-free Object Counting | 3 | 0 |
| Transparency Separation | 3 | 0 |
| Unsupervised 3D Semantic Segmentation Unsupervised 3D Semantic Segmentation | 3 | 0 |
| Unsupervised Zero-Shot Instance Segmentation | 3 | 0 |
| Vehicle Color Recognition Vehicle Color Recognition (VCR) involves developing a system… | 3 | 0 |
| Video Narrative Grounding Video Narrative Grounding is the task of linking video narra… | 3 | 0 |
| Video Relationship | 3 | 0 |
| Video-to-Shop | 3 | 0 |
| Vietnamese Scene Text | 3 | 0 |
| Webcam (RGB) image classification | 3 | 0 |
| Webpage Object Detection | 3 | 0 |