VBART: The Turkish LLM Mar 2, 2024 Abstractive Text Summarization Question Answering
— Unverified 0API Is Enough: Conformal Prediction for Large Language Models Without Logit-Access Mar 2, 2024 Conformal Prediction Open-Ended Question Answering
— Unverified 0MediSwift: Efficient Sparse Pre-trained Biomedical Language Models Mar 1, 2024 Question Answering
— Unverified 0Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models Mar 1, 2024 Benchmarking Mathematical Reasoning
— Unverified 0LocalRQA: From Generating Data to Locally Training, Testing, and Deploying Retrieval-Augmented QA Systems Mar 1, 2024 Question Answering Retrieval
Code Code Available 0TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning Feb 29, 2024 Question Answering Video Understanding
— Unverified 0Prompting Explicit and Implicit Knowledge for Multi-hop Question Answering Based on Human Reading Process Feb 29, 2024 Multi-hop Question Answering Question Answering
— Unverified 0Let LLMs Take on the Latest Challenges! A Chinese Dynamic Question Answering Benchmark Feb 29, 2024 Question Answering
Code Code Available 1Survey in Characterization of Semantic Change Feb 29, 2024 Information Retrieval Question Answering
— Unverified 0OpenMedLM: Prompt engineering can out-perform fine-tuning in medical question-answering with open-source large language models Feb 29, 2024 Medical Question Answering MedQA
— Unverified 0Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension Feb 28, 2024 Language Modeling Language Modelling
Code Code Available 1Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation Feb 28, 2024 Attribute Extractive Question-Answering
Code Code Available 4Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions Feb 28, 2024 Benchmarking Multiple-choice
Code Code Available 1Can GPT Improve the State of Prior Authorization via Guideline Based Automated Question Answering? Feb 28, 2024 Question Answering Text Generation
— Unverified 0Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation Feb 28, 2024 Code Generation In-Context Learning
Code Code Available 2The First Place Solution of WSDM Cup 2024: Leveraging Large Language Models for Conversational Multi-Doc QA Feb 28, 2024 Natural Language Understanding Question Answering
Code Code Available 2A Cognitive Evaluation Benchmark of Image Reasoning and Description for Large Vision-Language Models Feb 28, 2024 Image Description Question Answering
— Unverified 0Self-Refinement of Language Models from External Proxy Metrics Feedback Feb 27, 2024 Question Answering Response Generation
— Unverified 0Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey Feb 27, 2024 Language Modeling Language Modelling
Code Code Available 2BlendSQL: A Scalable Dialect for Unifying Hybrid Question Answering in Relational Algebra Feb 27, 2024 Question Answering
Code Code Available 2JMLR: Joint Medical LLM and Retrieval Training for Enhancing Reasoning and Professional Question Answering Capability Feb 27, 2024 GPU Information Retrieval
Code Code Available 0ArcSin: Adaptive ranged cosine Similarity injected noise for Language-Driven Visual Tasks Feb 27, 2024 Domain Generalization Image Captioning
— Unverified 0Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese Feb 27, 2024 General Knowledge Question Answering
Code Code Available 1Fact-and-Reflection (FaR) Improves Confidence Calibration of Large Language Models Feb 27, 2024 Common Sense Reasoning Question Answering
Code Code Available 0Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents Feb 27, 2024 Known Unknowns Question Answering
— Unverified 0NextLevelBERT: Masked Language Modeling with Higher-Level Representations for Long Documents Feb 27, 2024 Document Classification Language Modeling
Code Code Available 1Reasoning in Conversation: Solving Subjective Tasks through Dialogue Simulation for Large Language Models Feb 27, 2024 Dark Humor Detection Dialogue Generation
— Unverified 0REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering Feb 27, 2024 Open-Domain Question Answering Question Answering
Code Code Available 1Unsupervised multiple choices question answering via universal corpus Feb 27, 2024 Form Knowledge Graphs
— Unverified 0TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space Feb 27, 2024 Contrastive Learning Hallucination
Code Code Available 2MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning Feb 27, 2024 8k Language Modeling
Code Code Available 0VCD: Knowledge Base Guided Visual Commonsense Discovery in Images Feb 27, 2024 Decision Making Language Modelling
— Unverified 0Evaluating Very Long-Term Conversational Memory of LLM Agents Feb 27, 2024 Avg Dialogue Generation
Code Code Available 1Read and Think: An Efficient Step-wise Multimodal Language Model for Document Understanding and Reasoning Feb 26, 2024 Data Augmentation document understanding
— Unverified 0LLM-Assisted Multi-Teacher Continual Learning for Visual Question Answering in Robotic Surgery Feb 26, 2024 Continual Learning Exemplar-Free
Code Code Available 0Two-stage Generative Question Answering on Temporal Knowledge Graph Using Large Language Models Feb 26, 2024 Answer Generation Generative Question Answering
— Unverified 0PAQA: Toward ProActive Open-Retrieval Question Answering Feb 26, 2024 Conversational Search Passage Retrieval
— Unverified 0RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering Feb 26, 2024 Form Open-Domain Question Answering
Code Code Available 2Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts Feb 26, 2024 Diversity Question Answering
— Unverified 0GigaPevt: Multimodal Medical Assistant Feb 26, 2024 Question Answering
— Unverified 0Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision Feb 26, 2024 Answer Generation Cross-Lingual Question Answering
Code Code Available 0Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question Answering Feb 26, 2024 Evidence Selection Open-Ended Question Answering
Code Code Available 4PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering Feb 26, 2024 Question Answering Retrieval
— Unverified 0MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property Feb 26, 2024 Language Modeling Language Modelling
Code Code Available 1EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge Summaries Feb 25, 2024 Decision Making Question Answering
Code Code Available 1Efficient Temporal Extrapolation of Multimodal Large Language Models with Temporal Grounding Bridge Feb 25, 2024 Computational Efficiency Language Modelling
Code Code Available 1Deep Learning Approaches for Improving Question Answering Systems in Hepatocellular Carcinoma Research Feb 25, 2024 Question Answering
— Unverified 0Prompt Perturbation Consistency Learning for Robust Language Models Feb 24, 2024 Data Augmentation intent-classification
— Unverified 0Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA Feb 24, 2024 3D Question Answering (3D-QA) Question Answering
Code Code Available 1DOSA: A Dataset of Social Artifacts from Different Indian Geographical Subcultures Feb 23, 2024 Question Answering Text Generation
— Unverified 0