MDIT: A Model-free Data Interpolation Method for Diverse Instruction Tuning Apr 9, 2025 Code Generation Diversity
— Unverified 0Towards an AI-Driven Video-Based American Sign Language Dictionary: Exploring Design and Usage Experience with Learners Apr 8, 2025 Question Answering
Code Code Available 0Simplifying Data Integration: SLM-Driven Systems for Unified Semantic Queries Across Heterogeneous Databases Apr 8, 2025 Data Integration Language Modeling
— Unverified 0Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness Apr 7, 2025 Knowledge Graphs Language Modeling
— Unverified 0Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models Apr 7, 2025 Question Answering Scheduling
— Unverified 0Enhancing Compositional Reasoning in Vision-Language Models with Synthetic Preference Data Apr 7, 2025 Question Answering Visual Question Answering
Code Code Available 0Towards Visual Text Grounding of Multimodal Large Language Model Apr 7, 2025 Benchmarking Language Modeling
— Unverified 0Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use Apr 7, 2025 GSM8K Math
— Unverified 0Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration Apr 7, 2025 Language Modeling Language Modelling
Code Code Available 1ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question Answering Apr 7, 2025 Chart Question Answering Chart Understanding
Code Code Available 1RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model Apr 7, 2025 Image Captioning image-classification
— Unverified 0MedM-VL: What Makes a Good Medical LVLM? Apr 6, 2025 Medical Image Analysis Question Answering
Code Code Available 2ArxivBench: Can LLMs Assist Researchers in Conducting Research? Apr 6, 2025 Articles Question Answering
Code Code Available 0Advancing Egocentric Video Question Answering with Multimodal Large Language Models Apr 6, 2025 Object Recognition Question Answering
— Unverified 0UniRVQA: A Unified Framework for Retrieval-Augmented Vision Question Answering via Self-Reflective Joint Training Apr 5, 2025 Articles Question Answering
— Unverified 0Sigma: A dataset for text-to-code semantic parsing with statistical analysis Apr 5, 2025 Question Answering Semantic Parsing
Code Code Available 0YaleNLP @ PerAnsSumm 2025: Multi-Perspective Integration via Mixture-of-Agents for Enhanced Healthcare QA Summarization Apr 4, 2025 Community Question Answering Question Answering
Code Code Available 0Hierarchical Modeling for Medical Visual Question Answering with Cross-Attention Fusion Apr 4, 2025 Diagnostic Medical Visual Question Answering
— Unverified 0SARLANG-1M: A Benchmark for Vision-Language Modeling in SAR Image Understanding Apr 4, 2025 Language Modeling Language Modelling
Code Code Available 1Generative AI Enhanced Financial Risk Management Information Retrieval Apr 4, 2025 Information Retrieval Management
Code Code Available 0QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning Apr 4, 2025 Data Augmentation Image Generation
— Unverified 0Multilingual Retrieval-Augmented Generation for Knowledge-Intensive Task Apr 4, 2025 Open-Domain Question Answering Question Answering
— Unverified 0Bonsai: Interpretable Tree-Adaptive Grounded Reasoning Apr 4, 2025 Question Answering Specificity
— Unverified 0Single-Pass Document Scanning for Question Answering Apr 4, 2025 Question Answering
Code Code Available 1Adapting Large Language Models for Multi-Domain Retrieval-Augmented-Generation Apr 3, 2025 Domain Generalization Question Answering
— Unverified 0LexPam: Legal Procedure Awareness-Guided Mathematical Reasoning Apr 3, 2025 Mathematical Reasoning Question Answering
— Unverified 0SocialGesture: Delving into Multi-person Gesture Understanding Apr 3, 2025 Gesture Recognition Question Answering
— Unverified 0STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection Apr 3, 2025 Instruction Following Language Modeling
Code Code Available 1Leveraging Static Relationships for Intra-Type and Inter-Type Message Passing in Video Question Answering Apr 3, 2025 Question Answering Video Question Answering
— Unverified 0Biomedical Question Answering via Multi-Level Summarization on a Local Knowledge Graph Apr 2, 2025 Language Modeling Language Modelling
— Unverified 0Scaling Test-Time Inference with Policy-Optimized, Dynamic Retrieval-Augmented Generation via KV Caching and Decoding Apr 2, 2025 Question Answering RAG
— Unverified 0CoRAG: Collaborative Retrieval-Augmented Generation Apr 2, 2025 Few-Shot Learning Open-Domain Question Answering
— Unverified 0GMAI-VL-R1: Harnessing Reinforcement Learning for Multimodal Medical Reasoning Apr 2, 2025 Decision Making Diagnostic
Code Code Available 1GTR: Graph-Table-RAG for Cross-Table Question Answering Apr 2, 2025 Question Answering RAG
— Unverified 0GeoRAG: A Question-Answering Approach from a Geographical Perspective Apr 2, 2025 Attribute Geographic Question Answering
— Unverified 0CyberBOT: Towards Reliable Cybersecurity Education via Ontology-Grounded Retrieval Augmented Generation Apr 1, 2025 Chatbot Question Answering
— Unverified 0MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving Apr 1, 2025 Autonomous Driving Prompt Learning
— Unverified 0Visual Environment-Interactive Planning for Embodied Complex-Question Answering Apr 1, 2025 Question Answering Task Planning
— Unverified 0Automated Factual Benchmarking for In-Car Conversational Systems using Large Language Models Apr 1, 2025 Benchmarking Conversational Question Answering
— Unverified 0SViQA: A Unified Speech-Vision Multimodal Model for Textless Visual Question Answering Apr 1, 2025 cross-modal alignment Question Answering
— Unverified 0FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning Apr 1, 2025 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
Code Code Available 2Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation Mar 31, 2025 Knowledge Graphs Question Answering
— Unverified 0KOFFVQA: An Objectively Evaluated Free-form VQA Benchmark for Large Vision-Language Models in the Korean Language Mar 31, 2025 Form Question Answering
Code Code Available 0An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering Mar 30, 2025 Hallucination Multi-hop Question Answering
— Unverified 0Question-Aware Knowledge Graph Prompting for Enhancing Large Language Models Mar 30, 2025 Knowledge Graphs Multiple-choice
Code Code Available 0OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model Mar 30, 2025 Autonomous Driving Decision Making
Code Code Available 4A Retrieval-Augmented Knowledge Mining Method with Deep Thinking LLMs for Biomedical Research and Clinical Support Mar 29, 2025 Answer Generation Articles
— Unverified 0Memory-Aware and Uncertainty-Guided Retrieval for Multi-Hop Question Answering Mar 29, 2025 Multi-hop Question Answering Question Answering
— Unverified 0A Training-free LLM Framework with Interaction between Contextually Related Subtasks in Solving Complex Tasks Mar 29, 2025 Decision Making Multi-hop Question Answering
— Unverified 0FReM: A Flexible Reasoning Mechanism for Balancing Quick and Slow Thinking in Long-Context Question Answering Mar 29, 2025 Question Answering
— Unverified 0