Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework Nov 14, 2024 Question Answering RAG
Code Code Available 1Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering Nov 14, 2024 Medical Question Answering Misinformation
— Unverified 0The Limited Impact of Medical Adaptation of Large Language and Vision-Language Models Nov 13, 2024 Medical Question Answering Question Answering
Code Code Available 0Deceiving Question-Answering Models: A Hybrid Word-Level Adversarial Approach Nov 12, 2024 Abstractive Text Summarization Machine Translation
Code Code Available 0SparrowVQE: Visual Question Explanation for Course Content Understanding Nov 12, 2024 Question Answering Visual Question Answering
Code Code Available 0Query Optimization for Parametric Knowledge Refinement in Retrieval-Augmented Large Language Models Nov 12, 2024 Knowledge Distillation Question Answering
— Unverified 0Likelihood as a Performance Gauge for Retrieval-Augmented Generation Nov 12, 2024 Language Modeling Language Modelling
Code Code Available 0Greenback Bears and Fiscal Hawks: Finance is a Jungle and Text Embeddings Must Adapt Nov 11, 2024 Question Answering
— Unverified 0Controllable Context Sensitivity and the Knob Behind It Nov 11, 2024 Question Answering Retrieval-augmented Generation
Code Code Available 1Toward Optimal Search and Retrieval for RAG Nov 11, 2024 Question Answering RAG
Code Code Available 0Invar-RAG: Invariant LLM-aligned Retrieval for Better Generation Nov 11, 2024 Hallucination Information Retrieval
— Unverified 0EVQAScore: Efficient Video Question Answering Data Evaluation Nov 11, 2024 Keyword Extraction Question Answering
— Unverified 0Subgraph Retrieval Enhanced by Graph-Text Alignment for Commonsense Question Answering Nov 11, 2024 Contrastive Learning Question Answering
— Unverified 0Self-Training Meets Consistency: Improving LLMs' Reasoning With Consistency-Driven Rationale Evaluation Nov 10, 2024 Question Answering
Code Code Available 0Exploring Knowledge Boundaries in Large Language Models for Retrieval Judgment Nov 9, 2024 Question Answering RAG
— Unverified 0M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework Nov 9, 2024 document understanding Question Answering
— Unverified 0Leveraging Retrieval-Augmented Generation for Persian University Knowledge Retrieval Nov 9, 2024 Information Retrieval Prompt Engineering
— Unverified 0The Empirical Impact of Data Sanitization on Language Models Nov 8, 2024 Language Modeling Language Modelling
— Unverified 0Multi-Document Financial Question Answering using LLMs Nov 8, 2024 Knowledge Distillation Knowledge Graphs
— Unverified 0GUIDEQ: Framework for Guided Questioning for progressive informational collection and classification Nov 8, 2024 Question Answering text-classification
Code Code Available 0Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models Nov 8, 2024 Quantization Question Answering
— Unverified 0Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent Nov 8, 2024 Autonomous Driving Language Modeling
— Unverified 0SciDQA: A Deep Reading Comprehension Dataset over Scientific Papers Nov 8, 2024 Articles Question Answering
Code Code Available 0Unmasking the Limits of Large Language Models: A Systematic Evaluation of Masked Text Processing Ability through MskQA and MskCal Nov 8, 2024 Question Answering
Code Code Available 0Poze: Sports Technique Feedback under Data Constraints Nov 8, 2024 Pose Estimation Question Answering
— Unverified 0End-to-End Navigation with Vision Language Models: Transforming Spatial Reasoning into Question-Answering Nov 8, 2024 Language Modeling Language Modelling
Code Code Available 2Seeing Through the Fog: A Cost-Effectiveness Analysis of Hallucination Detection Systems Nov 8, 2024 Diagnostic Hallucination
— Unverified 0WeatherGFM: Learning A Weather Generalist Foundation Model via In-context Learning Nov 8, 2024 In-Context Learning Question Answering
— Unverified 0Survey on Semantic Interpretation of Tabular Data: Challenges and Directions Nov 7, 2024 Knowledge Graphs Question Answering
— Unverified 0A Brief History of Named Entity Recognition Nov 7, 2024 named-entity-recognition Named Entity Recognition
— Unverified 0M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding Nov 7, 2024 document understanding Optical Character Recognition
— Unverified 0Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning Nov 7, 2024 Offline RL Policy Gradient Methods
— Unverified 0SaSR-Net: Source-Aware Semantic Representation Network for Enhancing Audio-Visual Question Answering Nov 7, 2024 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
— Unverified 0DELIFT: Data Efficient Language model Instruction Fine Tuning Nov 7, 2024 Language Modeling Language Modelling
Code Code Available 1Seeing is Deceiving: Exploitation of Visual Pathways in Multi-Modal Language Models Nov 7, 2024 Adversarial Attack Image Captioning
— Unverified 0Lexicalization Is All You Need: Examining the Impact of Lexical Knowledge in a Compositional QALD System Nov 6, 2024 All Question Answering
Code Code Available 0M3SciQA: A Multi-Modal Multi-Document Scientific QA Benchmark for Evaluating Foundation Models Nov 6, 2024 Information Retrieval Question Answering
— Unverified 0VQA^2: Visual Question Answering for Video Quality Assessment Nov 6, 2024 Question Answering Video Quality Assessment
Code Code Available 2Select2Plan: Training-Free ICL-Based Planning through VQA and Memory Retrieval Nov 6, 2024 Autonomous Navigation In-Context Learning
— Unverified 0NeurIPS 2023 Competition: Privacy Preserving Federated Learning Document VQA Nov 6, 2024 Federated Learning Language Modelling
— Unverified 0MEG: Medical Knowledge-Augmented Large Language Models for Question Answering Nov 6, 2024 Knowledge Graph Embeddings Multiple-choice
Code Code Available 1Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress? Nov 6, 2024 Medical Question Answering Question Answering
Code Code Available 0From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing Nov 5, 2024 Change Detection Contrastive Learning
— Unverified 0Leveraging Large Language Models in Code Question Answering: Baselines and Issues Nov 5, 2024 Large Language Model Question Answering
Code Code Available 0PersianRAG: A Retrieval-Augmented Generation System for Persian Language Nov 5, 2024 Language Modeling Language Modelling
— Unverified 0Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent Nov 5, 2024 Benchmarking Hallucination
Code Code Available 3VERITAS: A Unified Approach to Reliability Evaluation Nov 5, 2024 Fact Checking Hallucination
— Unverified 0MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning Nov 5, 2024 MME Question Answering
— Unverified 0Multimodal Commonsense Knowledge Distillation for Visual Question Answering Nov 5, 2024 Knowledge Distillation Question Answering
— Unverified 0A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness Nov 4, 2024 Question Answering Text Generation
Code Code Available 3