| Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs | Jan 2, 2025 | Adversarial AttackAttribute | —Unverified | 0 |
| Automated Self-Refinement and Self-Correction for LLM-based Product Attribute Value Extraction | Jan 2, 2025 | AttributeAttribute Value Extraction | CodeCode Available | 0 |
| Large Vision-Language Model Alignment and Misalignment: A Survey Through the Lens of Explainability | Jan 2, 2025 | AttributeLanguage Modeling | —Unverified | 0 |
| RePerformer: Immersive Human-centric Volumetric Videos from Playback to Photoreal Reperformance | Jan 1, 2025 | AttributePosition | —Unverified | 0 |
| Rethinking Spiking Self-Attention Mechanism: Implementing a-XNOR Similarity Calculation in Spiking Transformers | Jan 1, 2025 | Attribute | —Unverified | 0 |
| Chat-based Person Retrieval via Dialogue-Refined Cross-Modal Alignment | Jan 1, 2025 | Attributecross-modal alignment | —Unverified | 0 |
| Navigating the Unseen: Zero-shot Scene Graph Generation via Capsule-Based Equivariant Features | Jan 1, 2025 | AttributeGraph Generation | —Unverified | 0 |
| Z-Magic: Zero-shot Multiple Attributes Guided Image Creator | Jan 1, 2025 | AttributeImage Generation | —Unverified | 0 |
| LOGICZSL: Exploring Logic-induced Representation for Compositional Zero-shot Learning | Jan 1, 2025 | AttributeCompositional Zero-Shot Learning | —Unverified | 0 |
| Harnessing Global-Local Collaborative Adversarial Perturbation for Anti-Customization | Jan 1, 2025 | AttributeImage Generation | —Unverified | 0 |
| FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing | Jan 1, 2025 | AttributeDenoising | —Unverified | 0 |
| Model Diagnosis and Correction via Linguistic and Implicit Attribute Editing | Jan 1, 2025 | Attributecounterfactual | —Unverified | 0 |
| FluxSpace: Disentangled Semantic Editing in Rectified Flow Models | Jan 1, 2025 | AttributeDisentanglement | —Unverified | 0 |
| Beyond Image Classification: A Video Benchmark and Dual-Branch Hybrid Discrimination Framework for Compositional Zero-Shot Learning | Jan 1, 2025 | Action RecognitionAttribute | —Unverified | 0 |
| AttriReBoost: A Gradient-Free Propagation Optimization Method for Cold Start Mitigation in Attribute Missing Graphs | Jan 1, 2025 | AttributeComputational Efficiency | CodeCode Available | 0 |
| Enhanced Visual-Semantic Interaction with Tailored Prompts for Pedestrian Attribute Recognition | Jan 1, 2025 | AttributePedestrian Attribute Recognition | —Unverified | 0 |
| DaCapo: Score Distillation as Stacked Bridge for Fast and High-quality 3D Editing | Jan 1, 2025 | 3D scene EditingAttribute | —Unverified | 0 |
| Attribute-Missing Multi-view Graph Clustering | Jan 1, 2025 | AttributeClustering | —Unverified | 0 |
| Simplification Is All You Need against Out-of-Distribution Overconfidence | Jan 1, 2025 | AllAttribute | —Unverified | 0 |
| OASIS Uncovers: High-Quality T2I Models, Same Old Stereotypes | Jan 1, 2025 | AttributeImage Generation | —Unverified | 0 |
| GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model | Jan 1, 2025 | AttributeLanguage Modeling | —Unverified | 0 |
| Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Mutimodal Models | Jan 1, 2025 | AttributeDiagnostic | —Unverified | 0 |
| Generative Gaussian Splatting for Unbounded 3D City Generation | Jan 1, 2025 | 3D GenerationAttribute | —Unverified | 0 |
| Detecting Open World Objects via Partial Attribute Assignment | Jan 1, 2025 | Attributeobject-detection | —Unverified | 0 |
| Compositional Caching for Training-free Open-vocabulary Attribute Detection | Jan 1, 2025 | AttributeOpen Vocabulary Attribute Detection | —Unverified | 0 |
| Composing Parts for Expressive Object Generation | Jan 1, 2025 | AttributeDenoising | —Unverified | 0 |
| Adaptive Dropout: Unleashing Dropout across Layers for Generalizable Image Super-Resolution | Jan 1, 2025 | AttributeBlind Super-Resolution | —Unverified | 0 |
| CNC: Cross-modal Normality Constraint for Unsupervised Multi-class Anomaly Detection | Dec 31, 2024 | Anomaly DetectionAttribute | —Unverified | 0 |
| Two Birds with One Stone: Improving Rumor Detection by Addressing the Unfairness Issue | Dec 30, 2024 | AttributeFairness | —Unverified | 0 |
| PERSE: Personalized 3D Generative Avatars from A Single Portrait | Dec 30, 2024 | Attribute | —Unverified | 0 |
| MAFT: Efficient Model-Agnostic Fairness Testing for Deep Neural Networks via Zero-Order Gradient Search | Dec 28, 2024 | AttributeDecision Making | —Unverified | 0 |
| MAKIMA: Tuning-free Multi-Attribute Open-domain Video Editing via Mask-Guided Attention Modulation | Dec 28, 2024 | AttributeComputational Efficiency | —Unverified | 0 |
| FashionFAE: Fine-grained Attributes Enhanced Fashion Vision-Language Pre-training | Dec 28, 2024 | AttributeImage Reconstruction | —Unverified | 0 |
| Focusing Image Generation to Mitigate Spurious Correlations | Dec 27, 2024 | AttributeData Augmentation | —Unverified | 0 |
| FACEMUG: A Multimodal Generative and Fusion Framework for Local Facial Editing | Dec 26, 2024 | AttributeFacial Editing | —Unverified | 0 |
| A Review of Resilience Enhancement Measures for Hydrogen-penetrated Multi-energy Systems | Dec 26, 2024 | Attribute | —Unverified | 0 |
| Multi-Attribute Constraint Satisfaction via Language Model Rewriting | Dec 26, 2024 | AttributeLanguage Modeling | —Unverified | 0 |
| Imperceptible Adversarial Attacks on Point Clouds Guided by Point-to-Surface Field | Dec 26, 2024 | Adversarial RobustnessAttribute | —Unverified | 0 |
| DebiasDiff: Debiasing Text-to-image Diffusion Models with Self-discovering Latent Attribute Directions | Dec 25, 2024 | Attribute | —Unverified | 0 |
| Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition | Dec 25, 2024 | Attributespeech-recognition | —Unverified | 0 |
| Same Company, Same Signal: The Role of Identity in Earnings Call Transcripts | Dec 23, 2024 | Attribute | —Unverified | 0 |
| SyNeg: LLM-Driven Synthetic Hard-Negatives for Dense Retrieval | Dec 23, 2024 | AttributeRetrieval | —Unverified | 0 |
| Semantic Hierarchical Prompt Tuning for Parameter-Efficient Fine-Tuning | Dec 22, 2024 | Attributeparameter-efficient fine-tuning | CodeCode Available | 0 |
| Visual Prompting with Iterative Refinement for Design Critique Generation | Dec 22, 2024 | AttributeVisual Prompting | —Unverified | 0 |
| FAP-CD: Fairness-Driven Age-Friendly Community Planning via Conditional Diffusion Generation | Dec 21, 2024 | AttributeDenoising | CodeCode Available | 0 |
| Revisiting MLLMs: An In-Depth Analysis of Image Classification Abilities | Dec 21, 2024 | AttributeClassification | —Unverified | 0 |
| DualGFL: Federated Learning with a Dual-Level Coalition-Auction Game | Dec 20, 2024 | AttributeFederated Learning | —Unverified | 0 |
| SemDP: Semantic-level Differential Privacy Protection for Face Datasets | Dec 20, 2024 | AttributeImage Generation | —Unverified | 0 |
| Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage | Dec 20, 2024 | AttributeBenchmarking | —Unverified | 0 |
| AdaCred: Adaptive Causal Decision Transformers with Feature Crediting | Dec 19, 2024 | AttributeImitation Learning | —Unverified | 0 |