SOTAVerified

Disentanglement

This is an approach to solve a diverse set of tasks in a data efficient manner by disentangling (or isolating ) the underlying structure of the main problem into disjoint parts of its representations. This disentanglement can be done by focussing on the "transformation" properties of the world(main problem)

Papers

Showing 150 of 1854 papers

TitleStatusHype
Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head SynthesisCode11
Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from BackboneCode6
Versatile Diffusion: Text, Images and Variations All in One Diffusion ModelCode6
UniK3D: Universal Camera Monocular 3D EstimationCode4
ControlVAE: Tuning, Analytical Properties, and Performance AnalysisCode4
DEADiff: An Efficient Stylization Diffusion Model with Disentangled RepresentationsCode3
UCF: Uncovering Common Features for Generalizable Deepfake DetectionCode3
Sigmoid Loss for Language Image Pre-TrainingCode3
Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D GenerationCode2
DLF: Disentangled-Language-Focused Multimodal Sentiment AnalysisCode2
HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image GenerationCode2
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model DisentanglementCode2
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance ControlCode2
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text EncoderCode2
Realistic and Efficient Face Swapping: A Unified Approach with Diffusion ModelsCode2
Emotion-driven Piano Music Generation via Two-stage Disentanglement and Functional RepresentationCode2
DiffArtist: Towards Structure and Appearance Controllable Image StylizationCode2
SaMoye: Zero-shot Singing Voice Conversion Model Based on Feature Disentanglement and EnhancementCode2
ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape DisentanglementCode2
Memory MosaicsCode2
Benchmarking Uncertainty Disentanglement: Specialized Uncertainties for Specialized TasksCode2
Preserving Fairness Generalization in Deepfake DetectionCode2
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image SynthesisCode2
DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text AlignmentCode2
Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival PredictionCode2
When StyleGAN Meets Stable Diffusion: a W+ Adapter for Personalized Image GenerationCode2
StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style AdapterCode2
BlendFace: Re-designing Identity Encoders for Face-SwappingCode2
EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face AnimationCode2
A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction from In-The-Wild ImagesCode2
DPE: Disentanglement of Pose and Expression for General Video Portrait EditingCode2
Generative Time Series Forecasting with Diffusion, Denoise, and DisentanglementCode2
eVAE: Evolutionary Variational AutoencoderCode2
Fine-Grained Face Swapping via Regional GAN InversionCode2
Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical PerspectivesCode2
3DFaceShop: Explicitly Controllable 3D-Aware Portrait GenerationCode2
ContentVec: An Improved Self-Supervised Speech Representation by Disentangling SpeakersCode2
MotionCLIP: Exposing Human Motion Generation to CLIP SpaceCode2
Third Time's the Charm? Image and Video Editing with StyleGAN3Code2
Dual Spoof Disentanglement Generation for Face Anti-spoofing with Depth Uncertainty LearningCode2
Compositional Transformers for Scene GenerationCode2
Compositional Transformers for Scene GenerationCode2
Generative Adversarial TransformersCode2
Learning an Animatable Detailed 3D Face Model from In-The-Wild ImagesCode2
Stylized Neural PaintingCode2
CausalVAE: Structured Causal Disentanglement in Variational AutoencoderCode2
Adversarial Latent AutoencodersCode2
Interpreting the Latent Space of GANs for Semantic Face EditingCode2
Challenging Common Assumptions in the Unsupervised Learning of Disentangled RepresentationsCode2
A Style-Based Generator Architecture for Generative Adversarial NetworksCode2
Show:102550
← PrevPage 1 of 38Next →

No leaderboard results yet.