MapQA: Open-domain Geospatial Question Answering on Map Data Mar 10, 2025 Diversity Language Modeling
— Unverified 0Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru Mar 10, 2025 Autonomous Driving Question Answering
— Unverified 0From Text to Visuals: Using LLMs to Generate Math Diagrams with Vector Graphics Mar 10, 2025 Math Question Answering
— Unverified 0Talking to GDELT Through Knowledge Graphs Mar 10, 2025 Articles Knowledge Graphs
— Unverified 0Taking Notes Brings Focus? Towards Multi-Turn Multimodal Dialogue Learning Mar 10, 2025 Question Answering
— Unverified 0TI-JEPA: An Innovative Energy-based Joint Embedding Strategy for Text-Image Multimodal Systems Mar 9, 2025 Multimodal Sentiment Analysis Question Answering
— Unverified 0Human Cognition Inspired RAG with Knowledge Graph for Complex Problem Solving Mar 9, 2025 Graph Question Answering Question Answering
— Unverified 0Green Prompting Mar 9, 2025 Code Generation Question Answering
— Unverified 0VisualSimpleQA: A Benchmark for Decoupled Evaluation of Large Vision-Language Models in Fact-Seeking Question Answering Mar 9, 2025 Question Answering
— Unverified 0Delusions of Large Language Models Mar 9, 2025 Question Answering Retrieval-augmented Generation
— Unverified 0Vector Quantized Feature Fields for Fast 3D Semantic Lifting Mar 9, 2025 Embodied Question Answering Question Answering
— Unverified 0SplatTalk: 3D VQA with Gaussian Splatting Mar 8, 2025 3DGS Question Answering
— Unverified 0Integrating Frequency-Domain Representations with Low-Rank Adaptation in Vision-Language Models Mar 8, 2025 Caption Generation Question Answering
— Unverified 0MoEMoE: Question Guided Dense and Scalable Sparse Mixture-of-Expert for Multi-source Multi-modal Answering Mar 8, 2025 Answer Generation Mixture-of-Experts
— Unverified 0Treble Counterfactual VLMs: A Causal Approach to Hallucination Mar 8, 2025 Autonomous Driving counterfactual
Code Code Available 0Correctness Coverage Evaluation for Medical Multiple-Choice Question Answering Based on the Enhanced Conformal Prediction Framework Mar 7, 2025 Conformal Prediction Medical Question Answering
— Unverified 0Chart-HQA: A Benchmark for Hypothetical Question Answering in Charts Mar 6, 2025 counterfactual Counterfactual Reasoning
— Unverified 0Architecture for a Trustworthy Quantum Chatbot Mar 6, 2025 Chatbot Large Language Model
— Unverified 0Evaluating Answer Reranking Strategies in Time-sensitive Question Answering Mar 6, 2025 Answer Selection Information Retrieval
— Unverified 0Dynamic-KGQA: A Scalable Framework for Generating Adaptive Question Answering Datasets Mar 6, 2025 Benchmarking Dataset Generation
— Unverified 0Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities Mar 6, 2025 Audio captioning Language Modeling
— Unverified 0LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model Compression Mar 6, 2025 Benchmarking Common Sense Reasoning
Code Code Available 0Robust Data Watermarking in Language Models by Injecting Fictitious Knowledge Mar 6, 2025 Continual Pretraining Memorization
Code Code Available 0Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation Mar 6, 2025 Active Learning Image Segmentation
— Unverified 0Enhancing Vietnamese VQA through Curriculum Learning on Raw and Augmented Text Representations Mar 5, 2025 Question Answering Visual Question Answering
Code Code Available 0FANS -- Formal Answer Selection for Natural Language Math Reasoning Using Lean4 Mar 5, 2025 Answer Selection Math
— Unverified 0Vision-Language Models Struggle to Align Entities across Modalities Mar 5, 2025 Attribute Code Generation
— Unverified 0Task-Agnostic Attacks Against Vision Foundation Models Mar 5, 2025 Depth Estimation Question Answering
Code Code Available 0Towards Understanding Multi-Round Large Language Model Reasoning: Approximability, Learnability and Generalizability Mar 5, 2025 Language Modeling Language Modelling
— Unverified 0Structured Outputs Enable General-Purpose LLMs to be Medical Experts Mar 5, 2025 Clinical Knowledge Medical Question Answering
— Unverified 0AttackSeqBench: Benchmarking Large Language Models' Understanding of Sequential Patterns in Cyber Attacks Mar 5, 2025 Benchmarking graph construction
Code Code Available 0Addressing Overprescribing Challenges: Fine-Tuning Large Language Models for Medication Recommendation Tasks Mar 5, 2025 Medical Question Answering parameter-efficient fine-tuning
Code Code Available 0Zero-Shot Complex Question-Answering on Long Scientific Documents Mar 4, 2025 Answer Generation document understanding
Code Code Available 0Towards Robust Expert Finding in Community Question Answering Platforms Mar 4, 2025 Community Question Answering Question Answering
Code Code Available 0OWLViz: An Open-World Benchmark for Visual Question Answering Mar 4, 2025 Question Answering Visual Question Answering
— Unverified 0BioD2C: A Dual-level Semantic Consistency Constraint Framework for Biomedical VQA Mar 4, 2025 Medical Diagnosis Question Answering
Code Code Available 0EchoQA: A Large Collection of Instruction Tuning Data for Echocardiogram Reports Mar 4, 2025 Fairness Question Answering
— Unverified 0Optimizing open-domain question answering with graph-based retrieval augmented generation Mar 4, 2025 Benchmarking Language Modeling
— Unverified 0Watch Out Your Album! On the Inadvertent Privacy Memorization in Multi-Modal Large Language Models Mar 3, 2025 Memorization Question Answering
Code Code Available 0SAGE: A Framework of Precise Retrieval for RAG Mar 3, 2025 Question Answering RAG
— Unverified 0Beyond Prompting: An Efficient Embedding Framework for Open-Domain Question Answering Mar 3, 2025 Contrastive Learning Open-Domain Question Answering
— Unverified 0Causal Tree Extraction from Medical Case Reports: A Novel Task for Experts-like Text Comprehension Mar 3, 2025 Diagnostic Medical Relation Extraction
— Unverified 0When an LLM is apprehensive about its answers -- and when its uncertainty is justified Mar 3, 2025 Math MMLU
Code Code Available 0SRAG: Structured Retrieval-Augmented Generation for Multi-Entity Question Answering over Wikipedia Graph Mar 3, 2025 Question Answering RAG
— Unverified 0Generate, Discriminate, Evolve: Enhancing Context Faithfulness via Fine-Grained Sentence-Level Self-Evolution Mar 3, 2025 counterfactual Domain Adaptation
— Unverified 0Q-NL Verifier: Leveraging Synthetic Data for Robust Knowledge Graph Question Answering Mar 3, 2025 Graph Question Answering Question Answering
Code Code Available 0Parameter-free Video Segmentation for Vision and Language Understanding Mar 3, 2025 Question Answering Video Question Answering
— Unverified 0Towards Efficient Educational Chatbots: Benchmarking RAG Frameworks Mar 2, 2025 Benchmarking Chatbot
— Unverified 0Optimizing Multi-Hop Document Retrieval Through Intermediate Representations Mar 2, 2025 Multi-hop Question Answering Question Answering
— Unverified 0ER-RAG: Enhance RAG with ER-Based Unified Modeling of Heterogeneous Data Sources Mar 2, 2025 Entity Retrieval Knowledge Graphs
— Unverified 0