Evaluating the Elementary Multilingual Capabilities of Large Language Models with MultiQ Mar 6, 2024 Open-Ended Question Answering Question Answering
Code Code Available 0Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem Mar 6, 2024 Benchmarking Hallucination
Code Code Available 0CLEVR-POC: Reasoning-Intensive Visual Question Answering in Partially Observable Environments Mar 5, 2024 Language Modelling Large Language Model
— Unverified 0Evidence-Focused Fact Summarization for Knowledge-Augmented Zero-Shot Question Answering Mar 5, 2024 Form Knowledge Graphs
Code Code Available 0Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use Mar 5, 2024 image-classification Image Classification
— Unverified 0Enhancing Generalization in Medical Visual Question Answering Tasks via Gradient-Guided Model Perturbation Mar 5, 2024 Data Augmentation Medical Visual Question Answering
— Unverified 0A General and Flexible Multi-concept Parsing Framework for Multilingual Semantic Matching Mar 5, 2024 Chatbot Community Question Answering
— Unverified 0Reliable, Adaptable, and Attributable Language Models with Retrieval Mar 5, 2024 Question Answering Retrieval
— Unverified 0MOKA: Open-World Robotic Manipulation through Mark-Based Visual Prompting Mar 5, 2024 In-Context Learning Object Rearrangement
— Unverified 0The Claude 3 Model Family: Opus, Sonnet, Haiku Mar 4, 2024 1 Image, 2*2 Stitching Arithmetic Reasoning
— Unverified 0EEE-QA: Exploring Effective and Efficient Question-Answer Representations Mar 4, 2024 Knowledge Graphs Question Answering
Code Code Available 0An Improved Traditional Chinese Evaluation Suite for Foundation Model Mar 4, 2024 Multiple-choice Question Answering
— Unverified 0Fine Tuning vs. Retrieval Augmented Generation for Less Popular Knowledge Mar 3, 2024 Data Augmentation Question Answering
Code Code Available 0Right for Right Reasons: Large Language Models for Verifiable Commonsense Knowledge Graph Question Answering Mar 3, 2024 Claim Verification Graph Question Answering
— Unverified 0Answerability in Retrieval-Augmented Open-Domain Question Answering Mar 3, 2024 Open-Domain Question Answering Question Answering
— Unverified 0KorMedMCQA: Multi-Choice Question Answering Benchmark for Korean Healthcare Professional Licensing Examinations Mar 3, 2024 MedQA MMLU
— Unverified 0Automatic Question-Answer Generation for Long-Tail Knowledge Mar 3, 2024 Answer Generation Knowledge Graphs
— Unverified 0SyllabusQA: A Course Logistics Question Answering Dataset Mar 3, 2024 Language Modeling Language Modelling
Code Code Available 0Improving Cross-lingual Representation for Semantic Retrieval with Code-switching Mar 3, 2024 Question Answering Retrieval
— Unverified 0API Is Enough: Conformal Prediction for Large Language Models Without Logit-Access Mar 2, 2024 Conformal Prediction Open-Ended Question Answering
— Unverified 0VBART: The Turkish LLM Mar 2, 2024 Abstractive Text Summarization Question Answering
— Unverified 0MediSwift: Efficient Sparse Pre-trained Biomedical Language Models Mar 1, 2024 Question Answering
— Unverified 0LocalRQA: From Generating Data to Locally Training, Testing, and Deploying Retrieval-Augmented QA Systems Mar 1, 2024 Question Answering Retrieval
Code Code Available 0Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models Mar 1, 2024 Benchmarking Mathematical Reasoning
— Unverified 0TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning Feb 29, 2024 Question Answering Video Understanding
— Unverified 0OpenMedLM: Prompt engineering can out-perform fine-tuning in medical question-answering with open-source large language models Feb 29, 2024 Medical Question Answering MedQA
— Unverified 0Prompting Explicit and Implicit Knowledge for Multi-hop Question Answering Based on Human Reading Process Feb 29, 2024 Multi-hop Question Answering Question Answering
— Unverified 0Survey in Characterization of Semantic Change Feb 29, 2024 Information Retrieval Question Answering
— Unverified 0Can GPT Improve the State of Prior Authorization via Guideline Based Automated Question Answering? Feb 28, 2024 Question Answering Text Generation
— Unverified 0A Cognitive Evaluation Benchmark of Image Reasoning and Description for Large Vision-Language Models Feb 28, 2024 Image Description Question Answering
— Unverified 0Unsupervised multiple choices question answering via universal corpus Feb 27, 2024 Form Knowledge Graphs
— Unverified 0Fact-and-Reflection (FaR) Improves Confidence Calibration of Large Language Models Feb 27, 2024 Common Sense Reasoning Question Answering
Code Code Available 0ArcSin: Adaptive ranged cosine Similarity injected noise for Language-Driven Visual Tasks Feb 27, 2024 Domain Generalization Image Captioning
— Unverified 0JMLR: Joint Medical LLM and Retrieval Training for Enhancing Reasoning and Professional Question Answering Capability Feb 27, 2024 GPU Information Retrieval
Code Code Available 0Reasoning in Conversation: Solving Subjective Tasks through Dialogue Simulation for Large Language Models Feb 27, 2024 Dark Humor Detection Dialogue Generation
— Unverified 0MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning Feb 27, 2024 8k Language Modeling
Code Code Available 0VCD: Knowledge Base Guided Visual Commonsense Discovery in Images Feb 27, 2024 Decision Making Language Modelling
— Unverified 0Self-Refinement of Language Models from External Proxy Metrics Feedback Feb 27, 2024 Question Answering Response Generation
— Unverified 0Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents Feb 27, 2024 Known Unknowns Question Answering
— Unverified 0PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering Feb 26, 2024 Question Answering Retrieval
— Unverified 0Read and Think: An Efficient Step-wise Multimodal Language Model for Document Understanding and Reasoning Feb 26, 2024 Data Augmentation document understanding
— Unverified 0Two-stage Generative Question Answering on Temporal Knowledge Graph Using Large Language Models Feb 26, 2024 Answer Generation Generative Question Answering
— Unverified 0Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision Feb 26, 2024 Answer Generation Cross-Lingual Question Answering
Code Code Available 0Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts Feb 26, 2024 Diversity Question Answering
— Unverified 0PAQA: Toward ProActive Open-Retrieval Question Answering Feb 26, 2024 Conversational Search Passage Retrieval
— Unverified 0LLM-Assisted Multi-Teacher Continual Learning for Visual Question Answering in Robotic Surgery Feb 26, 2024 Continual Learning Exemplar-Free
Code Code Available 0GigaPevt: Multimodal Medical Assistant Feb 26, 2024 Question Answering
— Unverified 0Deep Learning Approaches for Improving Question Answering Systems in Hepatocellular Carcinoma Research Feb 25, 2024 Question Answering
— Unverified 0Prompt Perturbation Consistency Learning for Robust Language Models Feb 24, 2024 Data Augmentation intent-classification
— Unverified 0Multimodal Transformer With a Low-Computational-Cost Guarantee Feb 23, 2024 Action Recognition Question Answering
— Unverified 0