SOTAVerified

Attribute

Papers

Showing 301350 of 5387 papers

TitleStatusHype
Masked Attribute Description Embedding for Cloth-Changing Person Re-identificationCode1
AG-ReID.v2: Bridging Aerial and Ground Views for Person Re-identificationCode1
Boosting Spike Camera Image Reconstruction from a Perspective of Dealing with Spike FluctuationsCode1
Symbolic Cognitive Diagnosis via Hybrid Optimization for Intelligent Education SystemsCode1
Embedded feature selection in LSTM networks with multi-objective evolutionary ensemble learning for time series forecastingCode1
Forgery-aware Adaptive Transformer for Generalizable Synthetic Image DetectionCode1
C2T-Net: Channel-Aware Cross-Fused Transformer-Style Networks for Pedestrian Attribute RecognitionCode1
Enhancing User Intent Capture in Session-Based Recommendation with Attribute PatternsCode1
GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object DetectionCode1
TagAlign: Improving Vision-Language Alignment with Multi-Tag ClassificationCode1
HyperEditor: Achieving Both Authenticity and Cross-Domain Capability in Image Editing via HypernetworksCode1
Upper Bounding Barlow Twins: A Novel Filter for Multi-Relational ClusteringCode1
Decoupled Textual Embeddings for Customized Image GenerationCode1
AVA: Inconspicuous Attribute Variation-based Adversarial Attack bypassing DeepFake DetectionCode1
Proxy-based Item Representation for Attribute and Context-aware RecommendationCode1
You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric PerceptionCode1
Open-Vocabulary Segmentation with Semantic-Assisted CalibrationCode1
Guided Reconstruction with Conditioned Diffusion Models for Unsupervised Anomaly Detection in Brain MRIsCode1
Gaussian-Flow: 4D Reconstruction with Dynamic 3D Gaussian ParticleCode1
Does Vector Quantization Fail in Spatio-Temporal Forecasting? Exploring a Differentiable Sparse Soft-Vector Quantization ApproachCode1
Identifying Spurious Correlations using Counterfactual AlignmentCode1
Hypergraph Contrastive Learning for Drug Trafficking Community DetectionCode1
Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vision-Language UnderstandingCode1
When StyleGAN Meets Stable Diffusion: a W_+ Adapter for Personalized Image GenerationCode1
Critical Influence of Overparameterization on Sharpness-aware MinimizationCode1
EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Language ModelsCode1
Self-correcting LLM-controlled Diffusion ModelsCode1
Benchmarking Robustness of Text-Image Composed RetrievalCode1
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction DataCode1
LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual DescriptionsCode1
Exploring Variational Auto-Encoder Architectures, Configurations, and Datasets for Generative Music Explainable AICode1
AMBER: An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination EvaluationCode1
SEMQA: Semi-Extractive Multi-Source Question AnsweringCode1
A Simple and Efficient Baseline for Data Attribution on ImagesCode1
Towards Machine Unlearning Benchmarks: Forgetting the Personal Identities in Facial Recognition SystemsCode1
HAP: Structure-Aware Masked Image Modeling for Human-Centric PerceptionCode1
GaitFormer: Learning Gait Representations with Noisy Multi-Task LearningCode1
Chain-of-Choice Hierarchical Policy Learning for Conversational RecommendationCode1
Causality-Inspired Fair Representation Learning for Multimodal RecommendationCode1
Salient Object Detection in RGB-D VideosCode1
MIRACLE: Towards Personalized Dialogue Generation with Latent-Space Multiple Personal Attribute ControlCode1
GraphMaker: Can Diffusion Models Generate Large Attributed Graphs?Code1
Learning with Unmasked Tokens Drives Stronger Vision LearnersCode1
ExtractGPT: Exploring the Potential of Large Language Models for Product Attribute Value ExtractionCode1
Multi‑camera trajectory matching based on hierarchical clustering and constraintsCode1
ExtSwap: Leveraging Extended Latent Mapper for Generating High Quality Face SwappingCode1
Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?Code1
Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward ModelCode1
Multimodal Variational Auto-encoder based Audio-Visual SegmentationCode1
Sentence-level Prompts Benefit Composed Image RetrievalCode1
Show:102550
← PrevPage 7 of 108Next →

No leaderboard results yet.