FoRAG: Factuality-optimized Retrieval Augmented Generation for Web-enhanced Long-form Question Answering Jun 19, 2024 Answer Generation Form
— Unverified 0Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens Jun 19, 2024 Caption Generation image-classification
Code Code Available 0Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation Jun 19, 2024 Question Answering RAG
Code Code Available 1Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language Models Jun 19, 2024 Benchmarking Open-Domain Question Answering
— Unverified 0Thread: A Logic-Based Data Organization Paradigm for How-To Question Answering with Retrieval Augmented Generation Jun 19, 2024 Decision Making Question Answering
— Unverified 0Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators Jun 19, 2024 Fact Verification Question Answering
Code Code Available 1Nash CoT: Multi-Path Inference with Preference Equilibrium Jun 18, 2024 Diversity Question Answering
Code Code Available 0Diversify, Rationalize, and Combine: Ensembling Multiple QA Strategies for Zero-shot Knowledge-based VQA Jun 18, 2024 Question Answering Visual Question Answering
Code Code Available 0LightPAL: Lightweight Passage Retrieval for Open Domain Multi-Document Summarization Jun 18, 2024 Document Summarization Language Modelling
— Unverified 0Towards Understanding Domain Adapted Sentence Embeddings for Document Retrieval Jun 18, 2024 Domain Adaptation Question Answering
— Unverified 0Intermediate Distillation: Data-Efficient Distillation from Black-Box LLMs for Information Retrieval Jun 18, 2024 Information Retrieval Knowledge Distillation
— Unverified 0VoCo-LLaMA: Towards Vision Compression with Large Language Models Jun 18, 2024 Computational Efficiency Question Answering
Code Code Available 3Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Aligned with Human Cognitive Principles Jun 18, 2024 Arithmetic Reasoning Code Generation
Code Code Available 1GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace Theory Jun 18, 2024 Code Generation Mathematical Problem-Solving
Code Code Available 0Problem-Solving in Language Model Networks Jun 18, 2024 Language Modeling Language Modelling
Code Code Available 0Breaking the Ceiling of the LLM Community by Treating Token Generation as a Classification for Ensembling Jun 18, 2024 Arithmetic Reasoning Language Modeling
Code Code Available 2From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries Jun 18, 2024 Question Answering RAG
— Unverified 0VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image Understanding Jun 18, 2024 Image Captioning Question Answering
Code Code Available 2Exploring the Robustness of Language Models for Tabular Question Answering via Attention Analysis Jun 18, 2024 In-Context Learning Question Answering
— Unverified 0PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue Systems Jun 18, 2024 Language Modeling Language Modelling
— Unverified 0InternalInspector I^2: Robust Confidence Estimation in LLMs through Internal States Jun 17, 2024 Benchmarking Contrastive Learning
— Unverified 0Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning Jun 17, 2024 Data Augmentation Mathematical Reasoning
Code Code Available 2Mitigating Large Language Model Hallucination with Faithful Finetuning Jun 17, 2024 Hallucination Language Modeling
— Unverified 0Extrinsic Evaluation of Cultural Competence in Large Language Models Jun 17, 2024 Open-Ended Question Answering Question Answering
Code Code Available 0MedCalc-Bench: Evaluating Large Language Models for Medical Calculations Jun 17, 2024 Descriptive Medical Diagnosis
Code Code Available 2Do Not Design, Learn: A Trainable Scoring Function for Uncertainty Estimation in Generative LLMs Jun 17, 2024 Question Answering
— Unverified 0Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations Jun 17, 2024 AI and Safety Question Answering
Code Code Available 1MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model Jun 17, 2024 Language Modeling Language Modelling
Code Code Available 1Soft Prompting for Unlearning in Large Language Models Jun 17, 2024 In-Context Learning Machine Unlearning
Code Code Available 1SeRTS: Self-Rewarding Tree Search for Biomedical Retrieval-Augmented Generation Jun 17, 2024 Question Answering RAG
— Unverified 0RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content Jun 17, 2024 Benchmarking General Knowledge
Code Code Available 0ISR-DPO: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO Jun 17, 2024 Language Modelling Question Answering
Code Code Available 2TRACE the Evidence: Constructing Knowledge-Grounded Reasoning Chains for Retrieval-Augmented Generation Jun 17, 2024 Question Answering RAG
Code Code Available 1Context Graph Jun 17, 2024 Knowledge Graphs Question Answering
— Unverified 0Task Me Anything Jun 17, 2024 2k Attribute
Code Code Available 2GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities Jun 17, 2024 Audio Question Answering Instruction Following
Code Code Available 2Boosting Scientific Concepts Understanding: Can Analogy from Teacher Models Empower Student Models? Jun 17, 2024 Question Answering Self-Learning
Code Code Available 0LLARVA: Vision-Action Instruction Tuning Enhances Robot Learning Jun 17, 2024 Image Captioning Question Answering
— Unverified 0Iterative Utility Judgment Framework via LLMs Inspired by Relevance in Philosophy Jun 17, 2024 Answer Generation Information Retrieval
— Unverified 0Program Synthesis Benchmark for Visual Programming in XLogoOnline Environment Jun 17, 2024 Logical Reasoning Math
— Unverified 0AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning Jun 17, 2024 Language Modeling Language Modelling
Code Code Available 3MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language Models Jun 17, 2024 Benchmarking Fact Checking
Code Code Available 1Hallucination Mitigation Prompts Long-term Video Understanding Jun 17, 2024 Answer Generation Hallucination
Code Code Available 0Refiner: Restructure Retrieval Content Efficiently to Advance Question-Answering Capabilities Jun 17, 2024 Question Answering RAG
Code Code Available 0Mixture-of-Subspaces in Low-Rank Adaptation Jun 16, 2024 Common Sense Reasoning Image Generation
Code Code Available 0Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers Jun 16, 2024 Conversational Question Answering Passage Retrieval
— Unverified 0Towards Lifelong Dialogue Agents via Timeline-based Memory Management Jun 16, 2024 counterfactual Management
— Unverified 0Identifying Query-Relevant Neurons in Large Language Models for Long-Form Texts Jun 16, 2024 Decoder Form
Code Code Available 0SCAR: Efficient Instruction-Tuning for Large Language Models via Style Consistency-Aware Response Ranking Jun 16, 2024 Open-Ended Question Answering Question Answering
Code Code Available 1FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture Jun 16, 2024 Diversity Multiple-choice
Code Code Available 1