| OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction | Mar 5, 2025 | Vision-Language-ActionZero-shot Generalization | —Unverified | 0 |
| RAILGUN: A Unified Convolutional Policy for Multi-Agent Path Finding Across Different Environments and Tasks | Mar 4, 2025 | Multi-Agent Path FindingZero-shot Generalization | —Unverified | 0 |
| Re-Imagining Multimodal Instruction Tuning: A Representation View | Mar 2, 2025 | Instruction FollowingMME | CodeCode Available | 0 |
| Contrastive Learning of English Language and Crystal Graphs for Multimodal Representation of Materials Knowledge | Feb 23, 2025 | Contrastive LearningZero-shot Generalization | —Unverified | 0 |
| Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models | Feb 20, 2025 | Reinforcement Learning (RL)Zero-shot Generalization | —Unverified | 0 |
| GeLLMO: Generalizing Large Language Models for Multi-property Molecule Optimization | Feb 19, 2025 | Zero-shot Generalization | CodeCode Available | 0 |
| WRT-SAM: Foundation Model-Driven Segmentation for Generalized Weld Radiographic Testing | Feb 17, 2025 | Anomaly DetectionImage Segmentation | —Unverified | 0 |
| Salience-Invariant Consistent Policy Learning for Generalization in Visual Reinforcement Learning | Feb 12, 2025 | Zero-shot Generalization | —Unverified | 0 |
| Mechanistic Understandings of Representation Vulnerabilities and Engineering Robust Vision Transformers | Feb 7, 2025 | Zero-shot Generalization | —Unverified | 0 |
| SimSort: A Data-Driven Framework for Spike Sorting by Large-Scale Electrophysiology Simulation | Feb 5, 2025 | Spike SortingZero-shot Generalization | —Unverified | 0 |