| Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models | Feb 12, 2025 | AttributeDiagnostic | CodeCode Available | 1 |
| Learning Clustering-based Prototypes for Compositional Zero-shot Learning | Feb 10, 2025 | AttributeClustering | CodeCode Available | 1 |
| CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally | Feb 5, 2025 | Attributecross-modal alignment | CodeCode Available | 1 |
| Controllable Protein Sequence Generation with LLM Preference Optimization | Jan 25, 2025 | AttributeProtein Design | CodeCode Available | 1 |
| Retrieval-Augmented Dialogue Knowledge Aggregation for Expressive Conversational Speech Synthesis | Jan 11, 2025 | AttributeBenchmarking | CodeCode Available | 1 |
| Super-class guided Transformer for Zero-Shot Attribute Classification | Jan 10, 2025 | AttributeClassification | CodeCode Available | 1 |
| RecKG: Knowledge Graph for Recommender Systems | Jan 7, 2025 | AttributeData Integration | CodeCode Available | 1 |
| Dr. Tongue: Sign-Oriented Multi-label Detection for Remote Tongue Diagnosis | Jan 6, 2025 | AttributeDiagnostic | CodeCode Available | 1 |
| Chebyshev Attention Depth Permutation Texture Network with Latent Texture Attribute Loss | Jan 1, 2025 | AttributeMaterial Recognition | CodeCode Available | 1 |
| OW-OVD: Unified Open World and Open Vocabulary Object Detection | Jan 1, 2025 | AttributeIncremental Learning | CodeCode Available | 1 |
| Exploring Contextual Attribute Density in Referring Expression Counting | Jan 1, 2025 | AttributeReferring Expression | CodeCode Available | 1 |
| Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding | Dec 21, 2024 | AttributeQuestion Answering | CodeCode Available | 1 |
| Sign-IDD: Iconicity Disentangled Diffusion for Sign Language Production | Dec 18, 2024 | AttributeDisentanglement | CodeCode Available | 1 |
| CA-Edit: Causality-Aware Condition Adapter for High-Fidelity Local Facial Attribute Editing | Dec 18, 2024 | Attribute | CodeCode Available | 1 |
| Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning | Dec 11, 2024 | AttributeBenchmarking | CodeCode Available | 1 |
| Efficient 3D Recognition with Event-driven Spike Sparse Convolution | Dec 10, 2024 | Attribute | CodeCode Available | 1 |
| Towards Learning to Reason: Comparing LLMs with Neuro-Symbolic on Arithmetic Relations in Abstract Reasoning | Dec 7, 2024 | Attribute | CodeCode Available | 1 |
| Grounding Descriptions in Images informs Zero-Shot Visual Recognition | Dec 5, 2024 | AttributeBenchmarking | CodeCode Available | 1 |
| MIND: Effective Incorrect Assignment Detection through a Multi-Modal Structure-Enhanced Language Model | Dec 5, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 |
| GeoAI-Enhanced Community Detection on Spatial Networks with Graph Deep Learning | Nov 23, 2024 | AttributeCommunity Detection | CodeCode Available | 1 |
| Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning | Nov 18, 2024 | AttributeCompositional Zero-Shot Learning | CodeCode Available | 1 |
| Att2CPC: Attention-Guided Lossy Attribute Compression of Point Clouds | Oct 23, 2024 | Attribute | CodeCode Available | 1 |
| MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models | Oct 23, 2024 | AttributeFairness | CodeCode Available | 1 |
| Scalable Influence and Fact Tracing for Large Language Model Pretraining | Oct 22, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Progressive Compositionality In Text-to-Image Generative Models | Oct 22, 2024 | AttributeContrastive Learning | CodeCode Available | 1 |