| Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D Generation | May 7, 2025 | 3D GenerationAttribute | CodeCode Available | 2 |
| GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest | Jul 7, 2023 | AttributeCommon Sense Reasoning | CodeCode Available | 2 |
| Faceptor: A Generalist Model for Face Perception | Mar 14, 2024 | Age EstimationAttribute | CodeCode Available | 2 |
| EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents | Jan 21, 2025 | AttributeQuestion Answering | CodeCode Available | 2 |
| Exploring CLIP's Dense Knowledge for Weakly Supervised Semantic Segmentation | Mar 26, 2025 | AttributeSemantic Segmentation | CodeCode Available | 2 |
| FaceDancer: Pose- and Occlusion-Aware High Fidelity Face Swapping | Oct 19, 2022 | AttributeDecoder | CodeCode Available | 2 |
| Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language Models | Jan 25, 2025 | AttributeContrastive Learning | CodeCode Available | 2 |
| DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning | Apr 20, 2025 | AttributeFace Swapping | CodeCode Available | 2 |
| DigiFace-1M: 1 Million Digital Face Images for Face Recognition | Oct 5, 2022 | AttributeFace Recognition | CodeCode Available | 2 |
| Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring | Jun 11, 2024 | AttributeDomain Generalization | CodeCode Available | 2 |