SOTAVerified

Attribute

Papers

Showing 101150 of 5387 papers

TitleStatusHype
Subobject-level Image TokenizationCode2
Oceanship: A Large-Scale Dataset for Underwater Audio Target RecognitionCode2
Synthesize Diagnose and Optimize: Towards Fine-Grained Vision-Language UnderstandingCode2
When StyleGAN Meets Stable Diffusion: a W+ Adapter for Personalized Image GenerationCode2
SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote SensingCode2
Chat-Scene: Bridging 3D Scene and Large Language Models with Object IdentifiersCode2
RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion ModelsCode2
GenEval: An Object-Focused Framework for Evaluating Text-to-Image AlignmentCode2
HairCLIPv2: Unifying Hair Editing via Proxy Feature BlendingCode2
BlendFace: Re-designing Identity Encoders for Face-SwappingCode2
T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image GenerationCode2
GPT4RoI: Instruction Tuning Large Language Model on Region-of-InterestCode2
StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar GenerationCode2
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion ModelsCode2
Link Prediction without Graph Neural NetworksCode2
Hierarchical Fine-Grained Image Forgery Detection and LocalizationCode2
HumanBench: Towards General Human-centric Perception with Projector Assisted PretrainingCode2
StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned FacesCode2
PACO: Parts and Attributes of Common ObjectsCode2
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language ModelsCode2
Hard Sample Aware Network for Contrastive Deep Graph ClusteringCode2
NMS Strikes BackCode2
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image SynthesisCode2
Spatio-Temporal Self-Supervised Learning for Traffic Flow PredictionCode2
High-fidelity 3D GAN Inversion by Pseudo-multi-view OptimizationCode2
MARLIN: Masked Autoencoder for facial video Representation LearnINgCode2
FaceDancer: Pose- and Occlusion-Aware High Fidelity Face SwappingCode2
DigiFace-1M: 1 Million Digital Face Images for Face RecognitionCode2
Omnigrok: Grokking Beyond Algorithmic DataCode2
A Survey of Machine UnlearningCode2
CelebV-HQ: A Large-Scale Video Facial Attributes DatasetCode2
Point-to-Box Network for Accurate Object Detection via Single Point SupervisionCode2
CLIP-Art: Contrastive Pre-training for Fine-Grained Art ClassificationCode2
Uncertainty-Informed Deep Learning Models Enable High-Confidence Predictions for Digital HistopathologyCode2
Video Polyp Segmentation: A Deep Learning PerspectiveCode2
Respecting causality is all you need for training physics-informed neural networksCode2
Restoring and attributing ancient texts using deep neural networksCode2
MetaFormer: A Unified Meta Framework for Fine-Grained RecognitionCode2
Tiny Object Tracking: A Large-scale Dataset and A BaselineCode2
Pedestrian Detection: Domain Generalization, CNNs, Transformers and BeyondCode2
StyleSpace Analysis: Disentangled Controls for StyleGAN Image GenerationCode2
Modular Primitives for High-Performance Differentiable RenderingCode2
StyleFlow: Attribute-conditioned Exploration of StyleGAN-Generated Images using Conditional Continuous Normalizing FlowsCode2
Closed-Form Factorization of Latent Semantics in GANsCode2
InterFaceGAN: Interpreting the Disentangled Face Representation Learned by GANsCode2
MMFashion: An Open-Source Toolbox for Visual Fashion AnalysisCode2
Plug and Play Language Models: A Simple Approach to Controlled Text GenerationCode2
Interpreting the Latent Space of GANs for Semantic Face EditingCode2
Toward Controlled Generation of TextCode2
Rethinking Cross-Modal Interaction in Multimodal Diffusion TransformersCode1
Show:102550
← PrevPage 3 of 108Next →

No leaderboard results yet.