| Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs | Jan 2, 2025 | Adversarial AttackAttribute | —Unverified | 0 |
| Large Vision-Language Model Alignment and Misalignment: A Survey Through the Lens of Explainability | Jan 2, 2025 | AttributeLanguage Modeling | —Unverified | 0 |
| ProjectedEx: Enhancing Generation in Explainable AI for Prostate Cancer | Jan 2, 2025 | AttributeDiagnostic | CodeCode Available | 0 |
| FluxSpace: Disentangled Semantic Editing in Rectified Flow Models | Jan 1, 2025 | AttributeDisentanglement | —Unverified | 0 |
| Chat-based Person Retrieval via Dialogue-Refined Cross-Modal Alignment | Jan 1, 2025 | Attributecross-modal alignment | —Unverified | 0 |
| Harnessing Global-Local Collaborative Adversarial Perturbation for Anti-Customization | Jan 1, 2025 | AttributeImage Generation | —Unverified | 0 |
| OASIS Uncovers: High-Quality T2I Models, Same Old Stereotypes | Jan 1, 2025 | AttributeImage Generation | —Unverified | 0 |
| Rethinking Spiking Self-Attention Mechanism: Implementing a-XNOR Similarity Calculation in Spiking Transformers | Jan 1, 2025 | Attribute | —Unverified | 0 |
| Beyond Image Classification: A Video Benchmark and Dual-Branch Hybrid Discrimination Framework for Compositional Zero-Shot Learning | Jan 1, 2025 | Action RecognitionAttribute | —Unverified | 0 |
| Detecting Open World Objects via Partial Attribute Assignment | Jan 1, 2025 | Attributeobject-detection | —Unverified | 0 |
| FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing | Jan 1, 2025 | AttributeDenoising | —Unverified | 0 |
| Simplification Is All You Need against Out-of-Distribution Overconfidence | Jan 1, 2025 | AllAttribute | —Unverified | 0 |
| Z-Magic: Zero-shot Multiple Attributes Guided Image Creator | Jan 1, 2025 | AttributeImage Generation | —Unverified | 0 |
| AttriReBoost: A Gradient-Free Propagation Optimization Method for Cold Start Mitigation in Attribute Missing Graphs | Jan 1, 2025 | AttributeComputational Efficiency | CodeCode Available | 0 |
| Navigating the Unseen: Zero-shot Scene Graph Generation via Capsule-Based Equivariant Features | Jan 1, 2025 | AttributeGraph Generation | —Unverified | 0 |
| Model Diagnosis and Correction via Linguistic and Implicit Attribute Editing | Jan 1, 2025 | Attributecounterfactual | —Unverified | 0 |
| DaCapo: Score Distillation as Stacked Bridge for Fast and High-quality 3D Editing | Jan 1, 2025 | 3D scene EditingAttribute | —Unverified | 0 |
| Attribute-Missing Multi-view Graph Clustering | Jan 1, 2025 | AttributeClustering | —Unverified | 0 |
| Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Mutimodal Models | Jan 1, 2025 | AttributeDiagnostic | —Unverified | 0 |
| Enhanced Visual-Semantic Interaction with Tailored Prompts for Pedestrian Attribute Recognition | Jan 1, 2025 | AttributePedestrian Attribute Recognition | —Unverified | 0 |
| RePerformer: Immersive Human-centric Volumetric Videos from Playback to Photoreal Reperformance | Jan 1, 2025 | AttributePosition | —Unverified | 0 |
| GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model | Jan 1, 2025 | AttributeLanguage Modeling | —Unverified | 0 |
| Generative Gaussian Splatting for Unbounded 3D City Generation | Jan 1, 2025 | 3D GenerationAttribute | —Unverified | 0 |
| Compositional Caching for Training-free Open-vocabulary Attribute Detection | Jan 1, 2025 | AttributeOpen Vocabulary Attribute Detection | —Unverified | 0 |
| Composing Parts for Expressive Object Generation | Jan 1, 2025 | AttributeDenoising | —Unverified | 0 |