| Vision Language Models are In-Context Value Learners | Nov 7, 2024 | In-Context LearningWorld Knowledge | —Unverified | 0 |
| Pre-trained Visual Dynamics Representations for Efficient Policy Learning | Nov 5, 2024 | Reinforcement Learning (RL)Video Prediction | —Unverified | 0 |
| ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model | Nov 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On the Exploration of LM-Based Soft Modular Robot Design | Nov 1, 2024 | World Knowledge | —Unverified | 0 |
| Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation | Nov 1, 2024 | EpidemiologyKnowledge Distillation | —Unverified | 0 |
| EMMA: End-to-End Multimodal Model for Autonomous Driving | Oct 30, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| GRADE: Quantifying Sample Diversity in Text-to-Image Models | Oct 29, 2024 | AttributeDiversity | —Unverified | 0 |
| ADAM: An Embodied Causal Agent in Open-World Environments | Oct 29, 2024 | Lifelong learningMinecraft | —Unverified | 0 |
| Learning and Unlearning of Fabricated Knowledge in Language Models | Oct 29, 2024 | Data PoisoningLanguage Modeling | —Unverified | 0 |
| ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval | Oct 24, 2024 | Image RetrievalRetrieval | CodeCode Available | 0 |