SOTAVerified

Attribute

Papers

Showing 101125 of 5387 papers

TitleStatusHype
DynRefer: Delving into Region-level Multimodal Tasks via Dynamic ResolutionCode2
Exploring CLIP's Dense Knowledge for Weakly Supervised Semantic SegmentationCode2
DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image InpaintingCode2
DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group LearningCode2
Modular Primitives for High-Performance Differentiable RenderingCode2
MVGamba: Unify 3D Content Generation as State Space Sequence ModelingCode2
DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic ResolutionCode2
Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition MonitoringCode2
On the Role of Attention Heads in Large Language Model SafetyCode2
OpenFACADES: An Open Framework for Architectural Caption and Attribute Data Enrichment via Street View ImageryCode2
DigiFace-1M: 1 Million Digital Face Images for Face RecognitionCode2
Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language ModelsCode2
Point-to-Box Network for Accurate Object Detection via Single Point SupervisionCode2
Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language PromptCode2
FaceDancer: Pose- and Occlusion-Aware High Fidelity Face SwappingCode2
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept MatchingCode2
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic DirectionsCode2
BlendFace: Re-designing Identity Encoders for Face-SwappingCode2
Closed-Form Factorization of Latent Semantics in GANsCode2
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion ModelingCode2
ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape DisentanglementCode2
RouteFinder: Towards Foundation Models for Vehicle Routing ProblemsCode2
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMsCode2
Oceanship: A Large-Scale Dataset for Underwater Audio Target RecognitionCode2
CLIP-Art: Contrastive Pre-training for Fine-Grained Art ClassificationCode2
Show:102550
← PrevPage 5 of 216Next →

No leaderboard results yet.