| GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models | Jan 2, 2025 | Scene Understandingtext annotation | CodeCode Available | 4 |
| ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks | Mar 27, 2023 | text annotationText Classification | CodeCode Available | 4 |
| Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity | Dec 9, 2024 | Anomaly Detectiontext annotation | CodeCode Available | 2 |
| Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset | Jul 3, 2023 | Human Mesh RecoveryMotion Generation | CodeCode Available | 2 |
| POTATO: The Portable Text Annotation Tool | Dec 16, 2022 | Active Learningtext annotation | CodeCode Available | 2 |
| FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion | Oct 27, 2022 | Data Augmentationtext annotation | CodeCode Available | 2 |
| LViT: Language meets Vision Transformer in Medical Image Segmentation | Jun 29, 2022 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Fine-grained Image Captioning with CLIP Reward | May 26, 2022 | Caption GenerationDescriptive | CodeCode Available | 2 |
| DoTAT: A Domain-oriented Text Annotation Tool | May 1, 2022 | text annotation | CodeCode Available | 2 |
| Probably Approximately Correct Labels | Jun 12, 2025 | Protein Foldingtext annotation | CodeCode Available | 1 |