SOTAVerified

Attribute

Papers

Showing 126150 of 5387 papers

TitleStatusHype
Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet EncodingsCode2
Is CLIP ideal? No. Can we fix it? Yes!Code2
Link Prediction without Graph Neural NetworksCode2
GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement LearningCode2
GPT4RoI: Instruction Tuning Large Language Model on Region-of-InterestCode2
Faceptor: A Generalist Model for Face PerceptionCode2
Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language ModelsCode2
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion ModelingCode2
Exploring CLIP's Dense Knowledge for Weakly Supervised Semantic SegmentationCode2
DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image InpaintingCode2
DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group LearningCode2
DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic ResolutionCode2
FaceDancer: Pose- and Occlusion-Aware High Fidelity Face SwappingCode2
DigiFace-1M: 1 Million Digital Face Images for Face RecognitionCode2
Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition MonitoringCode2
GenEval: An Object-Focused Framework for Evaluating Text-to-Image AlignmentCode2
Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D GenerationCode2
DynRefer: Delving into Region-level Multimodal Tasks via Dynamic ResolutionCode2
Hard Sample Aware Network for Contrastive Deep Graph ClusteringCode2
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic DirectionsCode2
Hierarchical Fine-Grained Image Forgery Detection and LocalizationCode2
EmbodiedEval: Evaluate Multimodal LLMs as Embodied AgentsCode2
A Synthetic Dataset for Personal Attribute InferenceCode2
Point-to-Box Network for Accurate Object Detection via Single Point SupervisionCode2
COLA: A Benchmark for Compositional Text-to-image RetrievalCode1
Show:102550
← PrevPage 6 of 216Next →

No leaderboard results yet.