| SEED: A Benchmark Dataset for Sequential Facial Attribute Editing with Diffusion Models | May 31, 2025 | AttributeFacial Editing | CodeCode Available | 1 |
| One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework | May 16, 2025 | AttributeImage Generation | CodeCode Available | 1 |
| Introducing voice timbre attribute detection | May 14, 2025 | Attribute | CodeCode Available | 1 |
| MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing | May 5, 2025 | Attribute | CodeCode Available | 1 |
| Learning to Attribute with Attention | Apr 18, 2025 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging | Apr 11, 2025 | AttributeComputational Efficiency | CodeCode Available | 1 |
| Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction | Apr 4, 2025 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Q-Adapt: Adapting LMM for Visual Quality Assessment with Progressive Instruction Tuning | Apr 2, 2025 | AttributeImage Quality Assessment | CodeCode Available | 1 |
| Do Theory of Mind Benchmarks Need Explicit Human-like Reasoning in Language Models? | Apr 2, 2025 | AttributeReinforcement Learning (RL) | CodeCode Available | 1 |
| EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing | Mar 30, 2025 | AttributeDisentanglement | CodeCode Available | 1 |
| FaceBench: A Multi-View Multi-Level Facial Attribute VQA Dataset for Benchmarking Face Perception MLLMs | Mar 27, 2025 | AttributeBenchmarking | CodeCode Available | 1 |
| Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving | Mar 27, 2025 | AttributeAutonomous Driving | CodeCode Available | 1 |
| Demand Estimation with Text and Image Data | Mar 26, 2025 | Attributecounterfactual | CodeCode Available | 1 |
| Attention IoU: Examining Biases in CelebA using Attention Maps | Mar 25, 2025 | Attribute | CodeCode Available | 1 |
| Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval | Mar 25, 2025 | AttributeImage Retrieval | CodeCode Available | 1 |
| Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval | Mar 21, 2025 | AttributeImage Retrieval | CodeCode Available | 1 |
| Exploring Contextual Attribute Density in Referring Expression Counting | Mar 16, 2025 | AttributeReferring Expression | CodeCode Available | 1 |
| Can Large Reasoning Models do Analogical Reasoning under Perceptual Uncertainty? | Mar 14, 2025 | Attribute | CodeCode Available | 1 |
| NullFace: Training-Free Localized Face Anonymization | Mar 11, 2025 | AttributeFace Anonymization | CodeCode Available | 1 |
| Generating Novel Brain Morphology by Deforming Learned Templates | Mar 4, 2025 | AttributeDecoder | CodeCode Available | 1 |
| ToLo: A Two-Stage, Training-Free Layout-To-Image Generation Framework For High-Overlap Layouts | Mar 3, 2025 | AttributeImage Generation | CodeCode Available | 1 |
| SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models | Feb 28, 2025 | AttributeAutonomous Driving | CodeCode Available | 1 |
| Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning | Feb 20, 2025 | AttributeDiagnostic | CodeCode Available | 1 |
| Model Generalization on Text Attribute Graphs: Principles with Large Language Models | Feb 17, 2025 | AttributeGraph Learning | CodeCode Available | 1 |
| Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding | Feb 16, 2025 | AttributeObject | CodeCode Available | 1 |