| Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs | Jan 2, 2025 | Adversarial AttackAttribute | —Unverified | 0 |
| Large Vision-Language Model Alignment and Misalignment: A Survey Through the Lens of Explainability | Jan 2, 2025 | AttributeLanguage Modeling | —Unverified | 0 |
| ProjectedEx: Enhancing Generation in Explainable AI for Prostate Cancer | Jan 2, 2025 | AttributeDiagnostic | CodeCode Available | 0 |
| FluxSpace: Disentangled Semantic Editing in Rectified Flow Models | Jan 1, 2025 | AttributeDisentanglement | —Unverified | 0 |
| Chat-based Person Retrieval via Dialogue-Refined Cross-Modal Alignment | Jan 1, 2025 | Attributecross-modal alignment | —Unverified | 0 |
| LOGICZSL: Exploring Logic-induced Representation for Compositional Zero-shot Learning | Jan 1, 2025 | AttributeCompositional Zero-Shot Learning | —Unverified | 0 |
| Detecting Open World Objects via Partial Attribute Assignment | Jan 1, 2025 | Attributeobject-detection | —Unverified | 0 |
| Rethinking Spiking Self-Attention Mechanism: Implementing a-XNOR Similarity Calculation in Spiking Transformers | Jan 1, 2025 | Attribute | —Unverified | 0 |
| Beyond Image Classification: A Video Benchmark and Dual-Branch Hybrid Discrimination Framework for Compositional Zero-Shot Learning | Jan 1, 2025 | Action RecognitionAttribute | —Unverified | 0 |
| Generative Gaussian Splatting for Unbounded 3D City Generation | Jan 1, 2025 | 3D GenerationAttribute | —Unverified | 0 |