| RAGAR, Your Falsehood Radar: RAG-Augmented Reasoning for Political Fact-Checking using Multimodal Large Language Models | Apr 18, 2024 | Fact CheckingLanguage Modeling | —Unverified | 0 |
| RespLLM: Unifying Audio and Text with Multimodal LLMs for Generalized Respiratory Health Prediction | Oct 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Modelwith Spatio-Temporal Visual Representation | May 30, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Model with Spatio-Temporal Visual Representation | Jan 1, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation | May 27, 2024 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| SemGrasp: Semantic Grasp Generation via Language Aligned Discretization | Apr 4, 2024 | Grasp GenerationLanguage Modeling | —Unverified | 0 |
| SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model | Dec 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection | Mar 5, 2024 | Concept AlignmentExplanation Generation | —Unverified | 0 |
| SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ST^3: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming | Dec 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| StreetviewLLM: Extracting Geographic Information Using a Chain-of-Thought Multimodal Large Language Model | Nov 19, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization | Mar 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults | Dec 22, 2024 | Data AugmentationFault Diagnosis | —Unverified | 0 |
| TalkFashion: Intelligent Virtual Try-On Assistant Based on Multimodal Large Language Model | Jul 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The NTNU System at the S&I Challenge 2025 SLA Open Track | Jun 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Solution for CVPR2024 Foundational Few-Shot Object Detection Challenge | Jun 18, 2024 | Few-Shot Object DetectionLanguage Modeling | —Unverified | 0 |
| Think Before You Diffuse: LLMs-Guided Physics-Aware Video Generation | May 27, 2025 | Large Language ModelMultimodal Large Language Model | —Unverified | 0 |
| Realistic Corner Case Generation for Autonomous Vehicles with Multimodal Large Language Model | Nov 29, 2024 | Autonomous VehiclesLanguage Modeling | —Unverified | 0 |
| A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization | Dec 27, 2024 | Face SwappingImage Segmentation | —Unverified | 0 |
| AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability | May 23, 2024 | cross-modal alignmentLanguage Modelling | —Unverified | 0 |
| A Medical Multimodal Large Language Model for Pediatric Pneumonia | Sep 4, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| A Neural Matrix Decomposition Recommender System Model based on the Multimodal Large Language Model | Jul 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions | Jun 4, 2025 | Data AugmentationDiversity | —Unverified | 0 |
| ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM | Jun 17, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |