| 3D-VLA: A 3D Vision-Language-Action Generative World Model | Mar 14, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| Logical Discrete Graphical Models Must Supplement Large Language Models for Information Synthesis | Mar 14, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Can We Talk Models Into Seeing the World Differently? | Mar 14, 2024 | Image CaptioningImage Classification | CodeCode Available | 1 |
| VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework | Mar 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CodingTeachLLM: Empowering LLM's Coding Ability via AST Prior Knowledge | Mar 13, 2024 | Dialogue EvaluationHumanEval | —Unverified | 0 |
| Do Large Language Models Solve ARC Visual Analogies Like People Do? | Mar 13, 2024 | ARCLanguage Modeling | CodeCode Available | 0 |
| AutoGuide: Automated Generation and Selection of Context-Aware Guidelines for Large Language Model Agents | Mar 13, 2024 | Decision MakingIn-Context Learning | —Unverified | 0 |
| Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization | Mar 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Boosting Disfluency Detection with Large Language Model as Disfluency Generator | Mar 13, 2024 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| Is Context Helpful for Chat Translation Evaluation? | Mar 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |