SOTAVerified

Disentanglement

This is an approach to solve a diverse set of tasks in a data efficient manner by disentangling (or isolating ) the underlying structure of the main problem into disjoint parts of its representations. This disentanglement can be done by focussing on the "transformation" properties of the world(main problem)

Papers

Showing 76100 of 1854 papers

TitleStatusHype
Human-aligned Deep Learning: Explainability, Causality, and Biological Inspiration0
ODHSR: Online Dense 3D Reconstruction of Humans and Scenes from Monocular Videos0
DMAGaze: Gaze Estimation Based on Feature Disentanglement and Multi-Scale Attention0
VAE-based Feature Disentanglement for Data Augmentation and Compression in Generalized GNSS Interference Classification0
Deconfounded Reasoning for Multimodal Fake News Detection via Causal Intervention0
TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation0
Steering CLIP's vision transformer with sparse autoencoders0
Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical ImagingCode1
VideoSPatS: Video SPatiotemporal Splines for Disentangled Occlusion, Appearance and Motion Modeling and Editing0
Learning Sparse Disentangled Representations for Multimodal Exclusion Retrieval0
ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion TransferCode0
Efficient Model Editing with Task-Localized Sparse Fine-tuningCode0
EagleVision: Object-level Attribute Multimodal LLM for Remote SensingCode1
Language-Guided Trajectory Traversal in Disentangled Stable Diffusion Latent Space for Factorized Medical Image Generation0
Unsupervised Feature Disentanglement and Augmentation Network for One-class Face Anti-spoofing0
DIFFER: Disentangling Identity Features via Semantic Cues for Clothes-Changing Person Re-IDCode1
CMD-HAR: Cross-Modal Disentanglement for Wearable Human Activity Recognition0
NeuroLIP: Interpretable and Fair Cross-Modal Alignment of fMRI and Phenotypic Text0
Exploring Disentangled and Controllable Human Image Synthesis: From End-to-End to Stage-by-Stage0
SLIP: Spoof-Aware One-Class Face Anti-Spoofing with Language Image PretrainingCode0
DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model0
Fast and Physically-based Neural Explicit Surface for Relightable Human Avatars0
Seeing Speech and Sound: Distinguishing and Locating Audios in Visual Scenes0
Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion Model0
Federated Cross-Domain Click-Through Rate Prediction With Large Language Model Augmentation0
Show:102550
← PrevPage 4 of 75Next →

No leaderboard results yet.