| SEED: A Benchmark Dataset for Sequential Facial Attribute Editing with Diffusion Models | May 31, 2025 | AttributeFacial Editing | CodeCode Available | 1 |
| One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework | May 16, 2025 | AttributeImage Generation | CodeCode Available | 1 |
| Introducing voice timbre attribute detection | May 14, 2025 | Attribute | CodeCode Available | 1 |
| MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing | May 5, 2025 | Attribute | CodeCode Available | 1 |
| Learning to Attribute with Attention | Apr 18, 2025 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging | Apr 11, 2025 | AttributeComputational Efficiency | CodeCode Available | 1 |
| Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction | Apr 4, 2025 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Q-Adapt: Adapting LMM for Visual Quality Assessment with Progressive Instruction Tuning | Apr 2, 2025 | AttributeImage Quality Assessment | CodeCode Available | 1 |
| Do Theory of Mind Benchmarks Need Explicit Human-like Reasoning in Language Models? | Apr 2, 2025 | AttributeReinforcement Learning (RL) | CodeCode Available | 1 |
| EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing | Mar 30, 2025 | AttributeDisentanglement | CodeCode Available | 1 |
| Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving | Mar 27, 2025 | AttributeAutonomous Driving | CodeCode Available | 1 |
| FaceBench: A Multi-View Multi-Level Facial Attribute VQA Dataset for Benchmarking Face Perception MLLMs | Mar 27, 2025 | AttributeBenchmarking | CodeCode Available | 1 |
| Demand Estimation with Text and Image Data | Mar 26, 2025 | Attributecounterfactual | CodeCode Available | 1 |
| Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval | Mar 25, 2025 | AttributeImage Retrieval | CodeCode Available | 1 |
| Attention IoU: Examining Biases in CelebA using Attention Maps | Mar 25, 2025 | Attribute | CodeCode Available | 1 |
| Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval | Mar 21, 2025 | AttributeImage Retrieval | CodeCode Available | 1 |
| Exploring Contextual Attribute Density in Referring Expression Counting | Mar 16, 2025 | AttributeReferring Expression | CodeCode Available | 1 |
| Can Large Reasoning Models do Analogical Reasoning under Perceptual Uncertainty? | Mar 14, 2025 | Attribute | CodeCode Available | 1 |
| NullFace: Training-Free Localized Face Anonymization | Mar 11, 2025 | AttributeFace Anonymization | CodeCode Available | 1 |
| Generating Novel Brain Morphology by Deforming Learned Templates | Mar 4, 2025 | AttributeDecoder | CodeCode Available | 1 |
| ToLo: A Two-Stage, Training-Free Layout-To-Image Generation Framework For High-Overlap Layouts | Mar 3, 2025 | AttributeImage Generation | CodeCode Available | 1 |
| SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models | Feb 28, 2025 | AttributeAutonomous Driving | CodeCode Available | 1 |
| Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning | Feb 20, 2025 | AttributeDiagnostic | CodeCode Available | 1 |
| Model Generalization on Text Attribute Graphs: Principles with Large Language Models | Feb 17, 2025 | AttributeGraph Learning | CodeCode Available | 1 |
| Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding | Feb 16, 2025 | AttributeObject | CodeCode Available | 1 |
| Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models | Feb 12, 2025 | AttributeDiagnostic | CodeCode Available | 1 |
| Learning Clustering-based Prototypes for Compositional Zero-shot Learning | Feb 10, 2025 | AttributeClustering | CodeCode Available | 1 |
| CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally | Feb 5, 2025 | Attributecross-modal alignment | CodeCode Available | 1 |
| Controllable Protein Sequence Generation with LLM Preference Optimization | Jan 25, 2025 | AttributeProtein Design | CodeCode Available | 1 |
| Retrieval-Augmented Dialogue Knowledge Aggregation for Expressive Conversational Speech Synthesis | Jan 11, 2025 | AttributeBenchmarking | CodeCode Available | 1 |
| Super-class guided Transformer for Zero-Shot Attribute Classification | Jan 10, 2025 | AttributeClassification | CodeCode Available | 1 |
| RecKG: Knowledge Graph for Recommender Systems | Jan 7, 2025 | AttributeData Integration | CodeCode Available | 1 |
| Dr. Tongue: Sign-Oriented Multi-label Detection for Remote Tongue Diagnosis | Jan 6, 2025 | AttributeDiagnostic | CodeCode Available | 1 |
| Chebyshev Attention Depth Permutation Texture Network with Latent Texture Attribute Loss | Jan 1, 2025 | AttributeMaterial Recognition | CodeCode Available | 1 |
| OW-OVD: Unified Open World and Open Vocabulary Object Detection | Jan 1, 2025 | AttributeIncremental Learning | CodeCode Available | 1 |
| Exploring Contextual Attribute Density in Referring Expression Counting | Jan 1, 2025 | AttributeReferring Expression | CodeCode Available | 1 |
| Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding | Dec 21, 2024 | AttributeQuestion Answering | CodeCode Available | 1 |
| Sign-IDD: Iconicity Disentangled Diffusion for Sign Language Production | Dec 18, 2024 | AttributeDisentanglement | CodeCode Available | 1 |
| CA-Edit: Causality-Aware Condition Adapter for High-Fidelity Local Facial Attribute Editing | Dec 18, 2024 | Attribute | CodeCode Available | 1 |
| Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning | Dec 11, 2024 | AttributeBenchmarking | CodeCode Available | 1 |
| Efficient 3D Recognition with Event-driven Spike Sparse Convolution | Dec 10, 2024 | Attribute | CodeCode Available | 1 |
| Towards Learning to Reason: Comparing LLMs with Neuro-Symbolic on Arithmetic Relations in Abstract Reasoning | Dec 7, 2024 | Attribute | CodeCode Available | 1 |
| Grounding Descriptions in Images informs Zero-Shot Visual Recognition | Dec 5, 2024 | AttributeBenchmarking | CodeCode Available | 1 |
| MIND: Effective Incorrect Assignment Detection through a Multi-Modal Structure-Enhanced Language Model | Dec 5, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 |
| GeoAI-Enhanced Community Detection on Spatial Networks with Graph Deep Learning | Nov 23, 2024 | AttributeCommunity Detection | CodeCode Available | 1 |
| Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning | Nov 18, 2024 | AttributeCompositional Zero-Shot Learning | CodeCode Available | 1 |
| Att2CPC: Attention-Guided Lossy Attribute Compression of Point Clouds | Oct 23, 2024 | Attribute | CodeCode Available | 1 |
| MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models | Oct 23, 2024 | AttributeFairness | CodeCode Available | 1 |
| Scalable Influence and Fact Tracing for Large Language Model Pretraining | Oct 22, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Progressive Compositionality In Text-to-Image Generative Models | Oct 22, 2024 | AttributeContrastive Learning | CodeCode Available | 1 |