| Subobject-level Image Tokenization | Feb 22, 2024 | AttributeLanguage Modeling | CodeCode Available | 2 |
| Oceanship: A Large-Scale Dataset for Underwater Audio Target Recognition | Jan 4, 2024 | AttributeAudio Classification | CodeCode Available | 2 |
| Synthesize Diagnose and Optimize: Towards Fine-Grained Vision-Language Understanding | Jan 1, 2024 | Attribute | CodeCode Available | 2 |
| When StyleGAN Meets Stable Diffusion: a W+ Adapter for Personalized Image Generation | Jan 1, 2024 | AttributeDisentanglement | CodeCode Available | 2 |
| SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing | Dec 20, 2023 | AttributeCross-Modal Retrieval | CodeCode Available | 2 |
| Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers | Dec 13, 2023 | 3D Question Answering (3D-QA)Attribute | CodeCode Available | 2 |
| RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models | Dec 7, 2023 | AttributeVideo Editing | CodeCode Available | 2 |
| GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment | Oct 17, 2023 | AttributeObject | CodeCode Available | 2 |
| HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending | Oct 16, 2023 | Attribute | CodeCode Available | 2 |
| BlendFace: Re-designing Identity Encoders for Face-Swapping | Jul 20, 2023 | AttributeDisentanglement | CodeCode Available | 2 |
| T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation | Jul 12, 2023 | AttributeImage Generation | CodeCode Available | 2 |
| GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest | Jul 7, 2023 | AttributeCommon Sense Reasoning | CodeCode Available | 2 |
| StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation | May 30, 2023 | 3D GenerationAttribute | CodeCode Available | 2 |
| Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models | May 29, 2023 | Attribute | CodeCode Available | 2 |
| Link Prediction without Graph Neural Networks | May 23, 2023 | AttributeGraph Learning | CodeCode Available | 2 |
| Hierarchical Fine-Grained Image Forgery Detection and Localization | Mar 30, 2023 | AttributeClassification | CodeCode Available | 2 |
| HumanBench: Towards General Human-centric Perception with Projector Assisted Pretraining | Mar 10, 2023 | AttributeAutonomous Driving | CodeCode Available | 2 |
| StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces | Mar 10, 2023 | AttributeSuper-Resolution | CodeCode Available | 2 |
| PACO: Parts and Attributes of Common Objects | Jan 4, 2023 | 2D Object DetectionAttribute | CodeCode Available | 2 |
| Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models | Dec 31, 2022 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| Hard Sample Aware Network for Contrastive Deep Graph Clustering | Dec 16, 2022 | AttributeClustering | CodeCode Available | 2 |
| NMS Strikes Back | Dec 12, 2022 | Attributeobject-detection | CodeCode Available | 2 |
| Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis | Dec 9, 2022 | AttributeImage Generation | CodeCode Available | 2 |
| Spatio-Temporal Self-Supervised Learning for Traffic Flow Prediction | Dec 7, 2022 | AttributePrediction | CodeCode Available | 2 |
| High-fidelity 3D GAN Inversion by Pseudo-multi-view Optimization | Nov 28, 2022 | AttributeGenerative Adversarial Network | CodeCode Available | 2 |
| MARLIN: Masked Autoencoder for facial video Representation LearnINg | Nov 12, 2022 | Action ClassificationAttribute | CodeCode Available | 2 |
| FaceDancer: Pose- and Occlusion-Aware High Fidelity Face Swapping | Oct 19, 2022 | AttributeDecoder | CodeCode Available | 2 |
| DigiFace-1M: 1 Million Digital Face Images for Face Recognition | Oct 5, 2022 | AttributeFace Recognition | CodeCode Available | 2 |
| Omnigrok: Grokking Beyond Algorithmic Data | Oct 3, 2022 | AttributeRepresentation Learning | CodeCode Available | 2 |
| A Survey of Machine Unlearning | Sep 6, 2022 | AttributeMachine Unlearning | CodeCode Available | 2 |
| CelebV-HQ: A Large-Scale Video Facial Attributes Dataset | Jul 25, 2022 | AttributeDiversity | CodeCode Available | 2 |
| Point-to-Box Network for Accurate Object Detection via Single Point Supervision | Jul 14, 2022 | AttributeMultiple Instance Learning | CodeCode Available | 2 |
| CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification | Apr 29, 2022 | AttributeClassification | CodeCode Available | 2 |
| Uncertainty-Informed Deep Learning Models Enable High-Confidence Predictions for Digital Histopathology | Apr 9, 2022 | AttributeUncertainty Quantification | CodeCode Available | 2 |
| Video Polyp Segmentation: A Deep Learning Perspective | Mar 27, 2022 | AttributeDeep Learning | CodeCode Available | 2 |
| Respecting causality is all you need for training physics-informed neural networks | Mar 14, 2022 | AllAttribute | CodeCode Available | 2 |
| Restoring and attributing ancient texts using deep neural networks | Mar 9, 2022 | Ancient Text RestorationAttribute | CodeCode Available | 2 |
| MetaFormer: A Unified Meta Framework for Fine-Grained Recognition | Mar 5, 2022 | AttributeFine-Grained Image Classification | CodeCode Available | 2 |
| Tiny Object Tracking: A Large-scale Dataset and A Baseline | Feb 11, 2022 | AttributeKnowledge Distillation | CodeCode Available | 2 |
| Pedestrian Detection: Domain Generalization, CNNs, Transformers and Beyond | Jan 10, 2022 | AttributeAutonomous Driving | CodeCode Available | 2 |
| StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation | Nov 25, 2020 | AttributeImage Generation | CodeCode Available | 2 |
| Modular Primitives for High-Performance Differentiable Rendering | Nov 6, 2020 | AttributeInverse Rendering | CodeCode Available | 2 |
| StyleFlow: Attribute-conditioned Exploration of StyleGAN-Generated Images using Conditional Continuous Normalizing Flows | Aug 6, 2020 | Attribute | CodeCode Available | 2 |
| Closed-Form Factorization of Latent Semantics in GANs | Jul 13, 2020 | AttributeForm | CodeCode Available | 2 |
| InterFaceGAN: Interpreting the Disentangled Face Representation Learned by GANs | May 18, 2020 | AttributeFace Generation | CodeCode Available | 2 |
| MMFashion: An Open-Source Toolbox for Visual Fashion Analysis | May 18, 2020 | AttributeRetrieval | CodeCode Available | 2 |
| Plug and Play Language Models: A Simple Approach to Controlled Text Generation | Dec 4, 2019 | AttributeLanguage Modelling | CodeCode Available | 2 |
| Interpreting the Latent Space of GANs for Semantic Face Editing | Jul 25, 2019 | AttributeDisentanglement | CodeCode Available | 2 |
| Toward Controlled Generation of Text | Mar 2, 2017 | AttributeSentence | CodeCode Available | 2 |
| Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers | Jun 9, 2025 | Attribute | CodeCode Available | 1 |