ReLoop: "Seeing Twice and Thinking Backwards" via Closed-loop Training to Mitigate Hallucinations in Multimodal understanding Jul 7, 2025 Hallucination Question Answering
— Unverified 0AI-VaxGuide: An Agentic RAG-Based LLM for Vaccination Decisions Jul 4, 2025 Question Answering RAG
— Unverified 0Agent-Based Detection and Resolution of Incompleteness and Ambiguity in Interactions with Large Language Models Jul 4, 2025 Question Answering
— Unverified 0OpenTable-R1: A Reinforcement Learning Augmented Tool Agent for Open-Domain Table Question Answering Jul 2, 2025 Language Modeling Language Modelling
Code Code Available 0Revisiting CroPA: A Reproducibility Study and Enhancements for Cross-Prompt Adversarial Transferability in Vision-Language Models Jun 28, 2025 image-classification Image Classification
Code Code Available 0SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning Jun 26, 2025 In-Context Learning Medical Visual Question Answering
— Unverified 0IPFormer-VideoLLM: Enhancing Multi-modal Video Understanding for Multi-shot Scenes Jun 26, 2025 Attribute Question Answering
— Unverified 0DrishtiKon: Multi-Granular Visual Grounding for Text-Rich Document Images Jun 26, 2025 document understanding Optical Character Recognition (OCR)
Code Code Available 0ComRAG: Retrieval-Augmented Generation with Dynamic Vector Stores for Real-time Community Question Answering in Industry Jun 26, 2025 Community Question Answering Question Answering
— Unverified 0Large Language Model Agent for Modular Task Execution in Drug Discovery Jun 26, 2025 Drug Discovery Language Modeling
— Unverified 0Response Quality Assessment for Retrieval-Augmented Generation via Conditional Conformal Factuality Jun 26, 2025 Conformal Prediction Question Answering
Code Code Available 0Towards Probabilistic Question Answering Over Tabular Data Jun 25, 2025 Natural Language Queries Question Answering
— Unverified 0SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models Jun 25, 2025 Code Generation In-Context Learning
— Unverified 0MultiFinRAG: An Optimized Multimodal Retrieval-Augmented Generation (RAG) Framework for Financial Question Answering Jun 25, 2025 Multimodal Reasoning Question Answering
— Unverified 0Memento: Note-Taking for Your Future Self Jun 25, 2025 Multi-hop Question Answering Question Answering
— Unverified 0Knowledge-Aware Diverse Reranking for Cross-Source Question Answering Jun 25, 2025 Question Answering RAG
— Unverified 0Semantic-enhanced Modality-asymmetric Retrieval for Online E-commerce Search Jun 25, 2025 Question Answering Retrieval
— Unverified 0HRIBench: Benchmarking Vision-Language Models for Real-Time Human Perception in Human-Robot Interaction Jun 25, 2025 Benchmarking Person Identification
Code Code Available 0FOCUS: Internal MLLM Representations for Efficient Fine-Grained Visual Question Answering Jun 25, 2025 Question Answering Visual Question Answering
— Unverified 0ITFormer: Bridging Time Series and Natural Language for Multi-Modal QA with Large-Scale Multitask Dataset Jun 25, 2025 Computational Efficiency Question Answering
— Unverified 0COIN: Uncertainty-Guarding Selective Question Answering for Foundation Models with Provable Risk Guarantees Jun 25, 2025 Conformal Prediction Question Answering
— Unverified 0Inference Scaled GraphRAG: Improving Multi Hop Question Answering on Knowledge Graphs Jun 24, 2025 Information Retrieval Knowledge Graphs
— Unverified 0KunLunBaizeRAG: Reinforcement Learning Driven Inference Performance Leap for Large Language Models Jun 24, 2025 Multi-hop Question Answering Question Answering
— Unverified 0ToSA: Token Merging with Spatial Awareness Jun 24, 2025 Embodied Question Answering Question Answering
Code Code Available 0Enhancing Biosecurity in Tamper-Resistant Large Language Models With Quantum Gradient Descent Jun 23, 2025 Question Answering Sensitivity
— Unverified 0Semantic similarity estimation for domain specific data using BERT and other techniques Jun 23, 2025 Information Retrieval Machine Translation
— Unverified 0Mental Health Equity in LLMs: Leveraging Multi-Hop Question Answering to Detect Amplified and Silenced Perspectives Jun 22, 2025 Multi-hop Question Answering Question Answering
— Unverified 0GEMeX-ThinkVG: Towards Thinking with Visual Grounding in Medical VQA via Reinforcement Learning Jun 22, 2025 Answer Generation Decision Making
— Unverified 0Scene-R1: Video-Grounded Large Language Models for 3D Scene Reasoning without 3D Annotations Jun 21, 2025 Question Answering Scene Understanding
— Unverified 0General-Purpose Robotic Navigation via LVLM-Orchestrated Perception, Reasoning, and Acting Jun 20, 2025 Embodied Question Answering Question Answering
— Unverified 0UProp: Investigating the Uncertainty Propagation of LLMs in Multi-Step Agentic Decision-Making Jun 20, 2025 Decision Making Question Answering
Code Code Available 0Can Common VLMs Rival Medical VLMs? Evaluation and Strategic Insights Jun 19, 2025 Question Answering Visual Question Answering
— Unverified 0How Far Can Off-the-Shelf Multimodal Large Language Models Go in Online Episodic Memory Question Answering? Jun 19, 2025 Multiple-choice Question Answering
— Unverified 0From RAG to Agentic: Validating Islamic-Medicine Responses with LLM Agents Jun 18, 2025 Language Modeling Language Modelling
— Unverified 0MEGC2025: Micro-Expression Grand Challenge on Spot Then Recognize and Visual Question Answering Jun 18, 2025 Multimodal Reasoning Question Answering
— Unverified 0WikiMixQA: A Multimodal Benchmark for Question Answering over Tables and Charts Jun 18, 2025 document understanding Multiple-choice
— Unverified 0Adapting Lightweight Vision Language Models for Radiological Visual Question Answering Jun 17, 2025 Diagnostic Question Answering
Code Code Available 0GenerationPrograms: Fine-grained Attribution with Executable Programs Jun 17, 2025 Document Summarization Long Form Question Answering
Code Code Available 0Re-Initialization Token Learning for Tool-Augmented Large Language Models Jun 17, 2025 GSM8K Question Answering
Code Code Available 0Enhancing Omics Cohort Discovery for Research on Neurodegeneration through Ontology-Augmented Embedding Models Jun 16, 2025 Question Answering
Code Code Available 0CAPO: Reinforcing Consistent Reasoning in Medical Decision-Making Jun 15, 2025 Answer Generation Decision Making
— Unverified 0MM-R5: MultiModal Reasoning-Enhanced ReRanker via Reinforcement Learning for Document Retrieval Jun 14, 2025 Instruction Following Multimodal Reasoning
Code Code Available 0AntiGrounding: Lifting Robotic Actions into VLM Representation Space for Decision Making Jun 14, 2025 Decision Making Question Answering
— Unverified 0Training-free LLM Merging for Multi-task Learning Jun 14, 2025 Multiple-choice Multi-Task Learning
Code Code Available 0MTabVQA: Evaluating Multi-Tabular Reasoning of Language Models in Visual Space Jun 13, 2025 Question Answering Visual Question Answering
— Unverified 0Instruction Tuning and CoT Prompting for Contextual Medical QA with LLMs Jun 13, 2025 Medical Question Answering MedQA
— Unverified 0A Fast, Reliable, and Secure Programming Language for LLM Agents with Code Actions Jun 13, 2025 Conformal Prediction Question Answering
— Unverified 0Benchmarking Multimodal LLMs on Recognition and Understanding over Chemical Tables Jun 13, 2025 Benchmarking Descriptive
— Unverified 0EQA-RM: A Generative Embodied Reward Model with Test-time Scaling Jun 12, 2025 Embodied Question Answering Question Answering
Code Code Available 0Can We Infer Confidential Properties of Training Data from LLMs? Jun 12, 2025 image-classification Image Classification
— Unverified 0