SOTAVerified

Attribute

Papers

Showing 151200 of 5387 papers

TitleStatusHype
SEED: A Benchmark Dataset for Sequential Facial Attribute Editing with Diffusion ModelsCode1
One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing FrameworkCode1
Introducing voice timbre attribute detectionCode1
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention RoutingCode1
Learning to Attribute with AttentionCode1
Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical ImagingCode1
Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token PredictionCode1
Q-Adapt: Adapting LMM for Visual Quality Assessment with Progressive Instruction TuningCode1
Do Theory of Mind Benchmarks Need Explicit Human-like Reasoning in Language Models?Code1
EagleVision: Object-level Attribute Multimodal LLM for Remote SensingCode1
Fine-Grained Evaluation of Large Vision-Language Models in Autonomous DrivingCode1
FaceBench: A Multi-View Multi-Level Facial Attribute VQA Dataset for Benchmarking Face Perception MLLMsCode1
Demand Estimation with Text and Image DataCode1
Fine-grained Textual Inversion Network for Zero-Shot Composed Image RetrievalCode1
Attention IoU: Examining Biases in CelebA using Attention MapsCode1
Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image RetrievalCode1
Exploring Contextual Attribute Density in Referring Expression CountingCode1
Can Large Reasoning Models do Analogical Reasoning under Perceptual Uncertainty?Code1
NullFace: Training-Free Localized Face AnonymizationCode1
Generating Novel Brain Morphology by Deforming Learned TemplatesCode1
ToLo: A Two-Stage, Training-Free Layout-To-Image Generation Framework For High-Overlap LayoutsCode1
SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation ModelsCode1
Aligning LLMs to Ask Good Questions A Case Study in Clinical ReasoningCode1
Model Generalization on Text Attribute Graphs: Principles with Large Language ModelsCode1
Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video GroundingCode1
Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal ModelsCode1
Learning Clustering-based Prototypes for Compositional Zero-shot LearningCode1
CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modallyCode1
Controllable Protein Sequence Generation with LLM Preference OptimizationCode1
Retrieval-Augmented Dialogue Knowledge Aggregation for Expressive Conversational Speech SynthesisCode1
Super-class guided Transformer for Zero-Shot Attribute ClassificationCode1
RecKG: Knowledge Graph for Recommender SystemsCode1
Dr. Tongue: Sign-Oriented Multi-label Detection for Remote Tongue DiagnosisCode1
Chebyshev Attention Depth Permutation Texture Network with Latent Texture Attribute LossCode1
OW-OVD: Unified Open World and Open Vocabulary Object DetectionCode1
Exploring Contextual Attribute Density in Referring Expression CountingCode1
Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart UnderstandingCode1
Sign-IDD: Iconicity Disentangled Diffusion for Sign Language ProductionCode1
CA-Edit: Causality-Aware Condition Adapter for High-Fidelity Local Facial Attribute EditingCode1
Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image CaptioningCode1
Efficient 3D Recognition with Event-driven Spike Sparse ConvolutionCode1
Towards Learning to Reason: Comparing LLMs with Neuro-Symbolic on Arithmetic Relations in Abstract ReasoningCode1
Grounding Descriptions in Images informs Zero-Shot Visual RecognitionCode1
MIND: Effective Incorrect Assignment Detection through a Multi-Modal Structure-Enhanced Language ModelCode1
GeoAI-Enhanced Community Detection on Spatial Networks with Graph Deep LearningCode1
Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot LearningCode1
Att2CPC: Attention-Guided Lossy Attribute Compression of Point CloudsCode1
MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward ModelsCode1
Scalable Influence and Fact Tracing for Large Language Model PretrainingCode1
Progressive Compositionality In Text-to-Image Generative ModelsCode1
Show:102550
← PrevPage 4 of 108Next →

No leaderboard results yet.