SOTAVerified

Disentanglement

This is an approach to solve a diverse set of tasks in a data efficient manner by disentangling (or isolating ) the underlying structure of the main problem into disjoint parts of its representations. This disentanglement can be done by focussing on the "transformation" properties of the world(main problem)

Papers

Showing 150 of 1854 papers

TitleStatusHype
Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head SynthesisCode11
Versatile Diffusion: Text, Images and Variations All in One Diffusion ModelCode6
Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from BackboneCode6
UniK3D: Universal Camera Monocular 3D EstimationCode4
ControlVAE: Tuning, Analytical Properties, and Performance AnalysisCode4
Sigmoid Loss for Language Image Pre-TrainingCode3
UCF: Uncovering Common Features for Generalizable Deepfake DetectionCode3
DEADiff: An Efficient Stylization Diffusion Model with Disentangled RepresentationsCode3
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance ControlCode2
ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape DisentanglementCode2
Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival PredictionCode2
SaMoye: Zero-shot Singing Voice Conversion Model Based on Feature Disentanglement and EnhancementCode2
Stylized Neural PaintingCode2
Challenging Common Assumptions in the Unsupervised Learning of Disentangled RepresentationsCode2
StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style AdapterCode2
Memory MosaicsCode2
Emotion-driven Piano Music Generation via Two-stage Disentanglement and Functional RepresentationCode2
Generative Adversarial TransformersCode2
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model DisentanglementCode2
Fine-Grained Face Swapping via Regional GAN InversionCode2
HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image GenerationCode2
Learning an Animatable Detailed 3D Face Model from In-The-Wild ImagesCode2
Preserving Fairness Generalization in Deepfake DetectionCode2
Realistic and Efficient Face Swapping: A Unified Approach with Diffusion ModelsCode2
Dual Spoof Disentanglement Generation for Face Anti-spoofing with Depth Uncertainty LearningCode2
CausalVAE: Structured Causal Disentanglement in Variational AutoencoderCode2
DLF: Disentangled-Language-Focused Multimodal Sentiment AnalysisCode2
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text EncoderCode2
DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text AlignmentCode2
Third Time's the Charm? Image and Video Editing with StyleGAN3Code2
Compositional Transformers for Scene GenerationCode2
Compositional Transformers for Scene GenerationCode2
Benchmarking Uncertainty Disentanglement: Specialized Uncertainties for Specialized TasksCode2
A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction from In-The-Wild ImagesCode2
Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical PerspectivesCode2
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image SynthesisCode2
Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D GenerationCode2
DPE: Disentanglement of Pose and Expression for General Video Portrait EditingCode2
Adversarial Latent AutoencodersCode2
DiffArtist: Towards Structure and Appearance Controllable Image StylizationCode2
eVAE: Evolutionary Variational AutoencoderCode2
3DFaceShop: Explicitly Controllable 3D-Aware Portrait GenerationCode2
Generative Time Series Forecasting with Diffusion, Denoise, and DisentanglementCode2
A Style-Based Generator Architecture for Generative Adversarial NetworksCode2
ContentVec: An Improved Self-Supervised Speech Representation by Disentangling SpeakersCode2
Interpreting the Latent Space of GANs for Semantic Face EditingCode2
EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face AnimationCode2
MotionCLIP: Exposing Human Motion Generation to CLIP SpaceCode2
BlendFace: Re-designing Identity Encoders for Face-SwappingCode2
When StyleGAN Meets Stable Diffusion: a W+ Adapter for Personalized Image GenerationCode2
Show:102550
← PrevPage 1 of 38Next →

No leaderboard results yet.