| Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning | Feb 19, 2025 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 | 0 |
| Vision-centric Token Compression in Large Language Model | Feb 2, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 | 0 |
| VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework | Mar 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Vision-Integrated LLMs for Autonomous Driving Assistance : Human Performance Comparison and Trust Evaluation | Feb 6, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Vision-Language Models Represent Darker-Skinned Black Individuals as More Homogeneous than Lighter-Skinned Black Individuals | Dec 12, 2024 | Image CaptioningImage Generation | —Unverified | 0 | 0 |
| VisionLLM-based Multimodal Fusion Network for Glottic Carcinoma Early Detection | Dec 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| [Vision Paper] PRObot: Enhancing Patient-Reported Outcome Measures for Diabetic Retinopathy using Chatbots and Generative AI | Nov 5, 2024 | ChatbotLanguage Modeling | —Unverified | 0 | 0 |
| VisionTrap: Vision-Augmented Trajectory Prediction Guided by Textual Descriptions | Jul 17, 2024 | Autonomous VehiclesLanguage Modeling | —Unverified | 0 | 0 |
| Visual Adversarial Attack on Vision-Language Models for Autonomous Driving | Nov 27, 2024 | Adversarial AttackAutonomous Driving | —Unverified | 0 | 0 |
| Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning | Jun 3, 2022 | Image Paragraph CaptioningLanguage Modeling | —Unverified | 0 | 0 |