| Mechanistic Understandings of Representation Vulnerabilities and Engineering Robust Vision Transformers | Feb 7, 2025 | Zero-shot Generalization | —Unverified | 0 |
| DiffCJK: Conditional Diffusion Model for High-Quality and Wide-coverage CJK Character Generation | Apr 8, 2024 | Zero-shot Generalization | —Unverified | 0 |
| I-PHYRE: Interactive Physical Reasoning | Dec 4, 2023 | Zero-shot Generalization | —Unverified | 0 |
| Language Models are General-Purpose Interfaces | Jun 13, 2022 | Causal Language ModelingFew-Shot Learning | —Unverified | 0 |
| In the Era of Prompt Learning with Vision-Language Models | Nov 7, 2024 | Domain AdaptationDomain Generalization | —Unverified | 0 |
| Categorical Traffic Transformer: Interpretable and Diverse Behavior Prediction with Tokenized Latent | Nov 30, 2023 | Autonomous VehiclesCommon Sense Reasoning | —Unverified | 0 |
| A Recipe for Improving Remote Sensing VLM Zero Shot Generalization | Mar 10, 2025 | Cross-Modal RetrievalZero-Shot Cross-Modal Retrieval | —Unverified | 0 |
| Interaction Modeling with Multiplex Attention | Aug 23, 2022 | Social NavigationTrajectory Forecasting | —Unverified | 0 |
| InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining | Oct 11, 2023 | 4kDecoder | —Unverified | 0 |
| DEUX: Active Exploration for Learning Unsupervised Depth Perception | Sep 16, 2023 | Depth CompletionDepth Estimation | —Unverified | 0 |