ALLM4ADD: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection May 16, 2025 Audio Deepfake Detection Audio Question Answering
— Unverified 0A Dataset for Spatiotemporal-Sensitive POI Question Answering May 16, 2025 Question Answering RAG
Code Code Available 0Incorporating brain-inspired mechanisms for multimodal learning in artificial intelligence May 15, 2025 Computational Efficiency Continual Learning
Code Code Available 0CAFE: Retrieval Head-based Coarse-to-Fine Information Seeking to Enhance Multi-Document QA Capability May 15, 2025 Question Answering RAG
— Unverified 0What Does Neuro Mean to Cardio? Investigating the Role of Clinical Specialty Data in Medical LLMs May 15, 2025 All Benchmarking
— Unverified 0End-to-End Vision Tokenizer Tuning May 15, 2025 Image Generation Question Answering
— Unverified 0Leveraging Graph Retrieval-Augmented Generation to Support Learners' Understanding of Knowledge Concepts in MOOCs May 15, 2025 Knowledge Graphs Question Answering
— Unverified 0Enhancing Multi-Image Question Answering via Submodular Subset Selection May 15, 2025 Question Answering Retrieval
— Unverified 0DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs May 15, 2025 Benchmarking Fairness
— Unverified 0DO-RAG: A Domain-Specific QA Framework Using Knowledge Graph-Enhanced Retrieval-Augmented Generation May 15, 2025 graph construction Hallucination
Code Code Available 0SafePath: Conformal Prediction for Safe LLM-Based Autonomous Navigation May 14, 2025 Autonomous Driving Autonomous Navigation
— Unverified 0PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt Tuning May 14, 2025 Math Mathematical Problem-Solving
Code Code Available 0The Impact of Large Language Models on Task Automation in Manufacturing Services May 14, 2025 Hallucination Question Answering
— Unverified 0Focus, Merge, Rank: Improved Question Answering Based on Semi-structured Knowledge Bases May 14, 2025 Knowledge Graphs Multi-hop Question Answering
Code Code Available 0Omni-R1: Do You Really Need Audio to Fine-Tune Your Audio LLM? May 14, 2025 Audio Question Answering Question Answering
— Unverified 0Atomic Consistency Preference Optimization for Long-Form Question Answering May 14, 2025 Form Long Form Question Answering
Code Code Available 0Variational Visual Question Answering May 14, 2025 Question Answering Visual Question Answering
— Unverified 0Scent of Knowledge: Optimizing Search-Enhanced Reasoning with Information Foraging May 14, 2025 Question Answering Retrieval
Code Code Available 0Fusing Bidirectional Chains of Thought and Reward Mechanisms A Method for Enhancing Question-Answering Capabilities of Large Language Models for Chinese Intangible Cultural Heritage May 13, 2025 Knowledge Distillation Large Language Model
— Unverified 0Judging the Judges: Can Large Vision-Language Models Fairly Evaluate Chart Comprehension and Reasoning? May 13, 2025 Chart Question Answering Fact Checking
Code Code Available 0WixQA: A Multi-Dataset Benchmark for Enterprise Retrieval-Augmented Generation May 13, 2025 Question Answering RAG
— Unverified 0Visually Interpretable Subtask Reasoning for Visual Question Answering May 12, 2025 Attribute Object Recognition
Code Code Available 0Multi-Domain Audio Question Answering Toward Acoustic Content Reasoning in The DCASE 2025 Challenge May 12, 2025 Audio Question Answering Question Answering
— Unverified 0Relative Overfitting and Accept-Reject Framework May 12, 2025 Language Modeling Language Modelling
— Unverified 0Private LoRA Fine-tuning of Open-Source LLMs with Homomorphic Encryption May 12, 2025 GPU Knowledge Base Question Answering
— Unverified 0Multi-Modal Explainable Medical AI Assistant for Trustworthy Human-AI Collaboration May 11, 2025 Benchmarking Descriptive
— Unverified 0Building a Human-Verified Clinical Reasoning Dataset via a Human LLM Hybrid Pipeline for Trustworthy Medical AI May 11, 2025 Medical Question Answering Question Answering
— Unverified 0Overview of the NLPCC 2025 Shared Task 4: Multi-modal, Multilingual, and Multi-hop Medical Instructional Video Question Answering Challenge May 11, 2025 Multimodal Reasoning Question Answering
— Unverified 0PLHF: Prompt Optimization with Few-Shot Human Feedback May 11, 2025 Question Answering
— Unverified 0OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal Retrieval May 10, 2025 Cross-Modal Retrieval Question Answering
— Unverified 0NeoQA: Evidence-based Question Answering with Generated News Events May 9, 2025 Articles Question Answering
Code Code Available 0CellVerse: Do Large Language Models Really Understand Cell Biology? May 9, 2025 Drug Response Prediction Question Answering
— Unverified 0Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information May 9, 2025 Benchmarking Form
— Unverified 0A Grounded Memory System For Smart Personal Assistants May 9, 2025 Entity Disambiguation Image Captioning
— Unverified 0Document Attribution: Examining Citation Relationships using Large Language Models May 9, 2025 Document Summarization Natural Language Inference
— Unverified 0Towards Developmentally Plausible Rewards: Communicative Success as a Learning Signal for Interactive Language Models May 9, 2025 Language Acquisition Question Answering
— Unverified 0Natural Reflection Backdoor Attack on Vision Language Model for Autonomous Driving May 9, 2025 Autonomous Driving Backdoor Attack
— Unverified 0Assessing Robustness to Spurious Correlations in Post-Training Language Models May 9, 2025 Instruction Following Mathematical Reasoning
— Unverified 0Lost in OCR Translation? Vision-Based Approaches to Robust Document Retrieval May 8, 2025 Computational Efficiency Optical Character Recognition
— Unverified 0An Open-Source Dual-Loss Embedding Model for Semantic Retrieval in Higher Education May 8, 2025 Large Language Model Question Answering
— Unverified 0LiTransProQA: an LLM-based Literary Translation evaluation metric with Professional Question Answering May 8, 2025 Machine Translation Question Answering
Code Code Available 0Probabilistic Embeddings for Frozen Vision-Language Models: Uncertainty Quantification with Gaussian Process Latent Variable Models May 8, 2025 Active Learning cross-modal alignment
Code Code Available 0SITE: towards Spatial Intelligence Thorough Evaluation May 8, 2025 Question Answering Spatial Reasoning
— Unverified 0Fine-Tuning Large Language Models and Evaluating Retrieval Methods for Improved Question Answering on Building Codes May 7, 2025 Language Modeling Language Modelling
— Unverified 0HiPerRAG: High-Performance Retrieval Augmented Generation for Scientific Insights May 7, 2025 Articles Contrastive Learning
— Unverified 0Q-Heart: ECG Question Answering via Knowledge-Informed Multimodal LLMs May 7, 2025 Electrocardiography (ECG) Language Modeling
— Unverified 0Characterising Topic Familiarity and Query Specificity Using Eye-Tracking Data May 6, 2025 Pupil Dilation Question Answering
Code Code Available 0VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making May 6, 2025 Decision Making General Knowledge
— Unverified 0A Reasoning-Focused Legal Retrieval Benchmark May 6, 2025 Question Answering RAG
— Unverified 0DyGEnc: Encoding a Sequence of Textual Scene Graphs to Reason and Answer Questions in Dynamic Scenes May 6, 2025 Question Answering
Code Code Available 0