| ToLo: A Two-Stage, Training-Free Layout-To-Image Generation Framework For High-Overlap Layouts | Mar 3, 2025 | AttributeImage Generation | CodeCode Available | 1 |
| SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models | Feb 28, 2025 | AttributeAutonomous Driving | CodeCode Available | 1 |
| Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning | Feb 20, 2025 | AttributeDiagnostic | CodeCode Available | 1 |
| Model Generalization on Text Attribute Graphs: Principles with Large Language Models | Feb 17, 2025 | AttributeGraph Learning | CodeCode Available | 1 |
| Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding | Feb 16, 2025 | AttributeObject | CodeCode Available | 1 |
| Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models | Feb 12, 2025 | AttributeDiagnostic | CodeCode Available | 1 |
| Learning Clustering-based Prototypes for Compositional Zero-shot Learning | Feb 10, 2025 | AttributeClustering | CodeCode Available | 1 |
| CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally | Feb 5, 2025 | Attributecross-modal alignment | CodeCode Available | 1 |
| Controllable Protein Sequence Generation with LLM Preference Optimization | Jan 25, 2025 | AttributeProtein Design | CodeCode Available | 1 |
| Retrieval-Augmented Dialogue Knowledge Aggregation for Expressive Conversational Speech Synthesis | Jan 11, 2025 | AttributeBenchmarking | CodeCode Available | 1 |