VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making May 6, 2025 Decision Making General Knowledge
— Unverified 0Sim2Real Transfer for Vision-Based Grasp Verification May 5, 2025 Object object-detection
Code Code Available 0Structure Causal Models and LLMs Integration in Medical Visual Question Answering May 5, 2025 Causal Inference Medical Visual Question Answering
— Unverified 0Task-Oriented Semantic Communication in Large Multimodal Models-based Vehicle Networks May 5, 2025 Question Answering Semantic Communication
— Unverified 0Compositional Image-Text Matching and Retrieval by Grounding Entities May 4, 2025 Image Captioning Image-text matching
Code Code Available 0OODTE: A Differential Testing Engine for the ONNX Optimizer May 3, 2025 object-detection Object Detection
— Unverified 0Knowledge-Augmented Language Models Interpreting Structured Chest X-Ray Findings May 3, 2025 Question Answering Visual Question Answering
— Unverified 0Adaptive Token Boundaries: Integrating Human Chunking Mechanisms into Multimodal LLMs May 3, 2025 Chunking Question Answering
— Unverified 0TRAVELER: A Benchmark for Evaluating Temporal Reasoning across Vague, Implicit and Explicit References May 2, 2025 Natural Language Understanding Question Answering
— Unverified 0Transferable Adversarial Attacks on Black-Box Vision-Language Models May 2, 2025 Image Captioning Object Recognition
— Unverified 0Grounding Task Assistance with Multimodal Cues from a Single Demonstration May 2, 2025 Question Answering Visual Question Answering
— Unverified 0Beyond Attention: Toward Machines with Intrinsic Higher Mental States May 2, 2025 Question Answering
— Unverified 0Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation May 1, 2025 Question Answering Specificity
Code Code Available 0CSE-SFP: Enabling Unsupervised Sentence Representation Learning via a Single Forward Pass May 1, 2025 Contrastive Learning Information Retrieval
— Unverified 0HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection May 1, 2025 Extractive Question-Answering Hallucination
— Unverified 0AdCare-VLM: Leveraging Large Vision Language Model (LVLM) to Monitor Long-Term Medication Adherence and Care May 1, 2025 Language Modeling Language Modelling
Code Code Available 0Talk Before You Retrieve: Agent-Led Discussions for Better RAG in Medical QA Apr 30, 2025 Information Retrieval Medical Question Answering
Code Code Available 0Calibrating Uncertainty Quantification of Multi-Modal LLMs using Grounding Apr 30, 2025 Medical Question Answering Question Answering
— Unverified 0ConSens: Assessing context grounding in open-book question answering Apr 30, 2025 Question Answering
— Unverified 0Zoomer: Adaptive Image Focus Optimization for Black-box MLLM Apr 30, 2025 Image Captioning Object Recognition
— Unverified 0LMME3DHF: Benchmarking and Evaluating Multimodal 3D Human Face Generation with LMMs Apr 29, 2025 Benchmarking Face Generation
— Unverified 0SetKE: Knowledge Editing for Knowledge Elements Overlap Apr 29, 2025 Incremental Learning knowledge editing
— Unverified 0LLM Enhancer: Merged Approach using Vector Embedding for Reducing Large Language Model Hallucinations with External Knowledge Apr 29, 2025 Language Modeling Language Modelling
— Unverified 0m-KAILIN: Knowledge-Driven Agentic Scientific Corpus Distillation Framework for Biomedical Large Language Models Training Apr 28, 2025 Question Answering
— Unverified 0Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models Apr 28, 2025 Mathematical Reasoning Meta-Learning
Code Code Available 0OpenTCM: A GraphRAG-Empowered LLM-based System for Traditional Chinese Medicine Knowledge Retrieval and Diagnosis Apr 28, 2025 Diagnostic Information Retrieval
— Unverified 0Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom Apr 28, 2025 Domain Adaptation Knowledge Distillation
— Unverified 0SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning Apr 28, 2025 Question Answering Spatial Reasoning
— Unverified 0Test It Before You Trust It: Applying Software Testing for Trustworthy In-context Learning Apr 26, 2025 In-Context Learning Philosophy
Code Code Available 0An Empirical Study of Evaluating Long-form Question Answering Apr 25, 2025 Form Informativeness
Code Code Available 0Pushing the boundary on Natural Language Inference Apr 25, 2025 Fact Checking Information Retrieval
— Unverified 0A Comprehensive Survey of Knowledge-Based Vision Question Answering Systems: The Lifecycle of Knowledge in Visual Reasoning Task Apr 24, 2025 Question Answering Retrieval
— Unverified 0Data-Driven Calibration of Prediction Sets in Large Vision-Language Models Based on Inductive Conformal Prediction Apr 24, 2025 Conformal Prediction Hallucination
— Unverified 0TraveLLaMA: Facilitating Multi-modal Large Language Models to Understand Urban Scenes and Provide Travel Assistance Apr 23, 2025 Question Answering Scene Understanding
— Unverified 0FinDER: Financial Dataset for Question Answering and Evaluating Retrieval-Augmented Generation Apr 22, 2025 Question Answering RAG
— Unverified 0Towards Understanding Camera Motions in Any Video Apr 21, 2025 Question Answering Text Retrieval
— Unverified 0Efficient Document Retrieval with G-Retriever Apr 21, 2025 graph construction Question Answering
Code Code Available 0The Great Nugget Recall: Automating Fact Extraction and RAG Evaluation with Large Language Models Apr 21, 2025 Question Answering RAG
— Unverified 0Are Vision LLMs Road-Ready? A Comprehensive Benchmark for Safety-Critical Driving Video Understanding Apr 20, 2025 Autonomous Driving Image Captioning
Code Code Available 0FairSteer: Inference Time Debiasing for LLMs with Dynamic Activation Steering Apr 20, 2025 counterfactual Fairness
— Unverified 0A Hierarchical Framework for Measuring Scientific Paper Innovation via Large Language Models Apr 20, 2025 Question Answering
— Unverified 0FinSage: A Multi-aspect RAG System for Financial Filings Question Answering Apr 20, 2025 Question Answering RAG
— Unverified 0Neglected Risks: The Disturbing Reality of Children's Images in Datasets and the Urgent Call for Accountability Apr 20, 2025 Question Answering Visual Question Answering
— Unverified 0CoLoTa: A Dataset for Entity-based Commonsense Reasoning over Long-Tail Knowledge Apr 20, 2025 Claim Verification Graph Question Answering
— Unverified 0SConU: Selective Conformal Uncertainty in Large Language Models Apr 19, 2025 Conformal Prediction Question Answering
— Unverified 0Bottom-Up Synthesis of Knowledge-Grounded Task-Oriented Dialogues with Iteratively Self-Refined Prompts Apr 19, 2025 Conversational Question Answering Language Modeling
— Unverified 0LegalRAG: A Hybrid RAG System for Multilingual Legal Information Retrieval Apr 19, 2025 Information Retrieval Question Answering
— Unverified 0Long-context Non-factoid Question Answering in Indic Languages Apr 18, 2025 coreference-resolution Coreference Resolution
Code Code Available 0ChartQA-X: Generating Explanations for Charts Apr 17, 2025 Decision Making Explanation Generation
— Unverified 0WebLists: Extracting Structured Information From Complex Interactive Websites Using Executable LLM Agents Apr 17, 2025 Navigate Question Answering
— Unverified 0