Enhancing Compositional Reasoning in Vision-Language Models with Synthetic Preference Data Apr 7, 2025 Question Answering Visual Question Answering
Code Code Available 0RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model Apr 7, 2025 Image Captioning image-classification
— Unverified 0Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models Apr 7, 2025 Question Answering Scheduling
— Unverified 0Towards Visual Text Grounding of Multimodal Large Language Model Apr 7, 2025 Benchmarking Language Modeling
— Unverified 0ArxivBench: Can LLMs Assist Researchers in Conducting Research? Apr 6, 2025 Articles Question Answering
Code Code Available 0Advancing Egocentric Video Question Answering with Multimodal Large Language Models Apr 6, 2025 Object Recognition Question Answering
— Unverified 0UniRVQA: A Unified Framework for Retrieval-Augmented Vision Question Answering via Self-Reflective Joint Training Apr 5, 2025 Articles Question Answering
— Unverified 0Sigma: A dataset for text-to-code semantic parsing with statistical analysis Apr 5, 2025 Question Answering Semantic Parsing
Code Code Available 0QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning Apr 4, 2025 Data Augmentation Image Generation
— Unverified 0Multilingual Retrieval-Augmented Generation for Knowledge-Intensive Task Apr 4, 2025 Open-Domain Question Answering Question Answering
— Unverified 0Bonsai: Interpretable Tree-Adaptive Grounded Reasoning Apr 4, 2025 Question Answering Specificity
— Unverified 0YaleNLP @ PerAnsSumm 2025: Multi-Perspective Integration via Mixture-of-Agents for Enhanced Healthcare QA Summarization Apr 4, 2025 Community Question Answering Question Answering
Code Code Available 0Generative AI Enhanced Financial Risk Management Information Retrieval Apr 4, 2025 Information Retrieval Management
Code Code Available 0Hierarchical Modeling for Medical Visual Question Answering with Cross-Attention Fusion Apr 4, 2025 Diagnostic Medical Visual Question Answering
— Unverified 0Adapting Large Language Models for Multi-Domain Retrieval-Augmented-Generation Apr 3, 2025 Domain Generalization Question Answering
— Unverified 0Leveraging Static Relationships for Intra-Type and Inter-Type Message Passing in Video Question Answering Apr 3, 2025 Question Answering Video Question Answering
— Unverified 0SocialGesture: Delving into Multi-person Gesture Understanding Apr 3, 2025 Gesture Recognition Question Answering
— Unverified 0LexPam: Legal Procedure Awareness-Guided Mathematical Reasoning Apr 3, 2025 Mathematical Reasoning Question Answering
— Unverified 0Biomedical Question Answering via Multi-Level Summarization on a Local Knowledge Graph Apr 2, 2025 Language Modeling Language Modelling
— Unverified 0GeoRAG: A Question-Answering Approach from a Geographical Perspective Apr 2, 2025 Attribute Geographic Question Answering
— Unverified 0CoRAG: Collaborative Retrieval-Augmented Generation Apr 2, 2025 Few-Shot Learning Open-Domain Question Answering
— Unverified 0GTR: Graph-Table-RAG for Cross-Table Question Answering Apr 2, 2025 Question Answering RAG
— Unverified 0Scaling Test-Time Inference with Policy-Optimized, Dynamic Retrieval-Augmented Generation via KV Caching and Decoding Apr 2, 2025 Question Answering RAG
— Unverified 0Visual Environment-Interactive Planning for Embodied Complex-Question Answering Apr 1, 2025 Question Answering Task Planning
— Unverified 0MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving Apr 1, 2025 Autonomous Driving Prompt Learning
— Unverified 0CyberBOT: Towards Reliable Cybersecurity Education via Ontology-Grounded Retrieval Augmented Generation Apr 1, 2025 Chatbot Question Answering
— Unverified 0Automated Factual Benchmarking for In-Car Conversational Systems using Large Language Models Apr 1, 2025 Benchmarking Conversational Question Answering
— Unverified 0SViQA: A Unified Speech-Vision Multimodal Model for Textless Visual Question Answering Apr 1, 2025 cross-modal alignment Question Answering
— Unverified 0KOFFVQA: An Objectively Evaluated Free-form VQA Benchmark for Large Vision-Language Models in the Korean Language Mar 31, 2025 Form Question Answering
Code Code Available 0Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation Mar 31, 2025 Knowledge Graphs Question Answering
— Unverified 0Question-Aware Knowledge Graph Prompting for Enhancing Large Language Models Mar 30, 2025 Knowledge Graphs Multiple-choice
Code Code Available 0An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering Mar 30, 2025 Hallucination Multi-hop Question Answering
— Unverified 0A Retrieval-Augmented Knowledge Mining Method with Deep Thinking LLMs for Biomedical Research and Clinical Support Mar 29, 2025 Answer Generation Articles
— Unverified 0Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery Mar 29, 2025 Action Understanding Instrument Recognition
— Unverified 0FReM: A Flexible Reasoning Mechanism for Balancing Quick and Slow Thinking in Long-Context Question Answering Mar 29, 2025 Question Answering
— Unverified 0Memory-Aware and Uncertainty-Guided Retrieval for Multi-Hop Question Answering Mar 29, 2025 Multi-hop Question Answering Question Answering
— Unverified 0A Training-free LLM Framework with Interaction between Contextually Related Subtasks in Solving Complex Tasks Mar 29, 2025 Decision Making Multi-hop Question Answering
— Unverified 0How Well Can Vison-Language Models Understand Humans' Intention? An Open-ended Theory of Mind Question Evaluation Benchmark Mar 28, 2025 Question Answering Visual Question Answering
— Unverified 0Patience is all you need! An agentic system for performing scientific literature review Mar 28, 2025 All Articles
— Unverified 0Preference-based Learning with Retrieval Augmented Generation for Conversational Question Answering Mar 28, 2025 Conversational Question Answering Question Answering
Code Code Available 0MemInsight: Autonomous Memory Augmentation for LLM Agents Mar 27, 2025 Conversational Recommendation Language Modeling
— Unverified 0Leveraging LLMs with Iterative Loop Structure for Enhanced Social Intelligence in Video Question Answering Mar 27, 2025 Emotion Recognition Question Answering
— Unverified 0SWI: Speaking with Intent in Large Language Models Mar 27, 2025 Mathematical Reasoning Question Answering
Code Code Available 0CTRL-O: Language-Controllable Object-Centric Visual Representation Learning Mar 27, 2025 Image Generation Object
— Unverified 0AssistPDA: An Online Video Surveillance Assistant for Video Anomaly Prediction, Detection, and Analysis Mar 27, 2025 Anomaly Detection Anomaly Forecasting
— Unverified 0JEEM: Vision-Language Understanding in Four Arabic Dialects Mar 27, 2025 Image Captioning Question Answering
— Unverified 0AskSport: Web Application for Sports Question-Answering Mar 27, 2025 Question Answering
— Unverified 0Vision-Amplified Semantic Entropy for Hallucination Detection in Medical Visual Question Answering Mar 26, 2025 Diagnostic Hallucination
— Unverified 0Self-ReS: Self-Reflection in Large Vision-Language Models for Long Video Understanding Mar 26, 2025 GPU Question Answering
— Unverified 0Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs Mar 26, 2025 Hallucination Hallucination Evaluation
— Unverified 0