Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images Feb 23, 2025 Adversarial Attack Question Answering
— Unverified 0MQADet: A Plug-and-Play Paradigm for Enhancing Open-Vocabulary Object Detection via Multimodal Question Answering Feb 23, 2025 Object object-detection
— Unverified 0Uncertainty-Aware Fusion: An Ensemble Framework for Mitigating Hallucinations in Large Language Models Feb 22, 2025 Hallucination Question Answering
— Unverified 0Echo: A Large Language Model with Temporal Episodic Memory Feb 22, 2025 Language Modeling Language Modelling
— Unverified 0EPERM: An Evidence Path Enhanced Reasoning Model for Knowledge Graph Question and Answering Feb 22, 2025 Graph Question Answering Knowledge Graphs
— Unverified 0Wrong Answers Can Also Be Useful: PlausibleQA -- A Large-Scale QA Dataset with Answer Plausibility Scores Feb 22, 2025 Distractor Generation Information Retrieval
Code Code Available 0MHQA: A Diverse, Knowledge Intensive Mental Health Question Answering Challenge for Language Models Feb 21, 2025 Benchmarking Diagnostic
— Unverified 0TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba Feb 21, 2025 image-classification Image Classification
— Unverified 0Chats-Grid: An Iterative Retrieval Q&A Optimization Scheme Leveraging Large Model and Retrieval Enhancement Generation in smart grid Feb 21, 2025 Large Language Model Prompt Engineering
— Unverified 0Empowering LLMs with Logical Reasoning: A Comprehensive Survey Feb 21, 2025 Logical Reasoning Negation
— Unverified 0Improving Consistency in Large Language Models through Chain of Guidance Feb 21, 2025 Question Answering
Code Code Available 0KVLink: Accelerating Large Language Models via Efficient KV Cache Reuse Feb 21, 2025 Question Answering
Code Code Available 1Mind the Gap! Static and Interactive Evaluations of Large Audio Models Feb 21, 2025 Question Answering
— Unverified 0Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path? Feb 21, 2025 Question Answering
— Unverified 0Directional Gradient Projection for Robust Fine-Tuning of Foundation Models Feb 21, 2025 image-classification Image Classification
— Unverified 0Is Relevance Propagated from Retriever to Generator in RAG? Feb 20, 2025 Large Language Model Question Answering
— Unverified 0On the Influence of Context Size and Model Choice in Retrieval-Augmented Generation Systems Feb 20, 2025 Long Form Question Answering Question Answering
Code Code Available 0How to Get Your LLM to Generate Challenging Problems for Evaluation Feb 20, 2025 Code Completion Math
Code Code Available 1Multimodal RewardBench: Holistic Evaluation of Reward Models for Vision Language Models Feb 20, 2025 Question Answering Visual Question Answering
Code Code Available 2Benchmarking Multimodal RAG through a Chart-based Document Question-Answering Generation Framework Feb 20, 2025 Benchmarking Question Answering
Code Code Available 0Measuring Faithfulness of Chains of Thought by Unlearning Reasoning Steps Feb 20, 2025 Question Answering
Code Code Available 1Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information Feb 20, 2025 Question Answering
Code Code Available 1ChatVLA: Unified Multimodal Understanding and Robot Control with Vision-Language-Action Model Feb 20, 2025 Mixture-of-Experts Question Answering
Code Code Available 1Mitigating Lost-in-Retrieval Problems in Retrieval Augmented Multi-Hop Question Answering Feb 20, 2025 Answer Generation Multi-hop Question Answering
— Unverified 0NLP-AKG: Few-Shot Construction of NLP Academic Knowledge Graph Based on LLM Feb 20, 2025 graph construction Question Answering
— Unverified 0Effects of Prompt Length on Domain-specific Tasks for Large Language Models Feb 20, 2025 Machine Translation Prompt Engineering
— Unverified 0EpMAN: Episodic Memory AttentioN for Generalizing to Longer Contexts Feb 20, 2025 16k Decoder
— Unverified 0Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests Feb 20, 2025 Logical Reasoning MMLU
— Unverified 0Exploring Advanced Techniques for Visual Question Answering: A Comprehensive Comparison Feb 20, 2025 Diversity Language Modeling
— Unverified 0Argument-Based Comparative Question Answering Evaluation Benchmark Feb 20, 2025 Question Answering
— Unverified 0How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? Feb 20, 2025 Question Answering
Code Code Available 0MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models Feb 20, 2025 Decision Making Hallucination
— Unverified 0Towards Adaptive Memory-Based Optimization for Enhanced Retrieval-Augmented Generation Feb 19, 2025 Question Answering RAG
— Unverified 0Sce2DriveX: A Generalized MLLM Framework for Scene-to-Drive Learning Feb 19, 2025 Autonomous Driving Bench2Drive
— Unverified 0PitVQA++: Vector Matrix-Low-Rank Adaptation for Open-Ended Visual Question Answering in Pituitary Surgery Feb 19, 2025 Question Answering Visual Question Answering
Code Code Available 0Navigating Semantic Relations: Challenges for Language Models in Abstract Common-Sense Reasoning Feb 19, 2025 Common Sense Reasoning Mathematical Problem-Solving
— Unverified 0Which of These Best Describes Multiple Choice Evaluation with LLMs? A) Forced B) Flawed C) Fixable D) All of the Above Feb 19, 2025 All Multiple-choice
— Unverified 0Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering Feb 19, 2025 Question Answering
Code Code Available 0PRIV-QA: Privacy-Preserving Question Answering for Cloud Large Language Models Feb 19, 2025 Open-Ended Question Answering Privacy Preserving
Code Code Available 0MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads Feb 19, 2025 Contrastive Learning Question Answering
Code Code Available 0Quantifying Memorization and Retriever Performance in Retrieval-Augmented Vision-Language Models Feb 19, 2025 Memorization Question Answering
— Unverified 0RGAR: Recurrence Generation-augmented Retrieval for Factual-aware Medical Question Answering Feb 19, 2025 Decision Making Language Modeling
— Unverified 0MCTS-KBQA: Monte Carlo Tree Search for Knowledge Base Question Answering Feb 19, 2025 Decision Making Knowledge Base Question Answering
— Unverified 0PeerQA: A Scientific Question Answering Dataset from Peer Reviews Feb 19, 2025 answerability prediction Answer Generation
Code Code Available 1REFIND: Retrieval-Augmented Factuality Hallucination Detection in Large Language Models Feb 19, 2025 Hallucination Language Modeling
— Unverified 0TabSD: Large Free-Form Table Question Answering with SQL-Based Table Decomposition Feb 19, 2025 Answer Generation Form
— Unverified 0DH-RAG: A Dynamic Historical Context-Powered Retrieval-Augmented Generation Method for Multi-Turn Dialogue Feb 19, 2025 Question Answering RAG
— Unverified 0TrustRAG: An Information Assistant with Retrieval Augmented Generation Feb 19, 2025 Answer Generation Chunking
Code Code Available 5Multilingual European Language Models: Benchmarking Approaches and Challenges Feb 18, 2025 Benchmarking Question Answering
— Unverified 0Savaal: Scalable Concept-Driven Question Generation to Enhance Human Learning Feb 18, 2025 Question Answering Question Generation
— Unverified 0