MM-PhyQA: Multimodal Physics Question-Answering With Multi-Image CoT Prompting Apr 11, 2024 Question Answering
— Unverified 0RiskLabs: Predicting Financial Risk Using Large Language Model based on Multimodal and Multi-Sources Data Apr 11, 2024 Binary Classification Language Modeling
— Unverified 0Unraveling the Dilemma of AI Errors: Exploring the Effectiveness of Human and Machine Explanations for Large Language Models Apr 11, 2024 Explainable artificial intelligence Explainable Artificial Intelligence (XAI)
— Unverified 0On Unified Prompt Tuning for Request Quality Assurance in Public Code Review Apr 11, 2024 Language Modeling Language Modelling
— Unverified 0Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs Apr 11, 2024 Descriptive Hallucination
Code Code Available 0Audio Dialogues: Dialogues dataset for audio and music understanding Apr 11, 2024 Audio captioning Audio Question Answering
— Unverified 0Language Models Meet Anomaly Detection for Better Interpretability and Generalizability Apr 11, 2024 Anomaly Detection Language Modelling
Code Code Available 0Transferable and Efficient Non-Factual Content Detection via Probe Training with Offline Consistency Checking Apr 10, 2024 Question Answering
Code Code Available 0Enhancing Question Answering for Enterprise Knowledge Bases using Large Language Models Apr 10, 2024 Management Question Answering
— Unverified 0Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study Apr 10, 2024 Form Long Form Question Answering
— Unverified 0LLMs' Reading Comprehension Is Affected by Parametric Knowledge and Struggles with Hypothetical Statements Apr 9, 2024 Natural Language Understanding Question Answering
— Unverified 0SurveyAgent: A Conversational System for Personalized and Efficient Research Survey Apr 9, 2024 Management Question Answering
— Unverified 0MoReVQA: Exploring Modular Reasoning Models for Video Question Answering Apr 9, 2024 EgoSchema Multiple-choice
— Unverified 0Identifying Shopping Intent in Product QA for Proactive Recommendations Apr 9, 2024 Friction Mixture-of-Experts
— Unverified 0Enhancing Software-Related Information Extraction via Single-Choice Question Answering with Large Language Models Apr 8, 2024 Descriptive In-Context Learning
— Unverified 0Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods Apr 8, 2024 Adversarial Text Machine Translation
— Unverified 0PerkwE_COQA: Enhanced Persian Conversational Question Answering by combining contextual keyword extraction with Large Language Models Apr 8, 2024 Conversational Question Answering Keyword Extraction
— Unverified 0HAMMR: HierArchical MultiModal React agents for generic VQA Apr 8, 2024 Optical Character Recognition (OCR) Question Answering
— Unverified 0MedExpQA: Multilingual Benchmarking of Large Language Models for Medical Question Answering Apr 8, 2024 Benchmarking Medical Question Answering
— Unverified 0Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding Apr 8, 2024 Domain Adaptation Extractive Question-Answering
— Unverified 0The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models Apr 8, 2024 Question Answering Reading Comprehension
— Unverified 0FRACTAL: Fine-Grained Scoring from Aggregate Text Labels Apr 7, 2024 Math Multiple Instance Learning
— Unverified 0LLM-aided explanations of EDA synthesis errors Apr 7, 2024 Question Answering Reading Comprehension
— Unverified 0Your Finetuned Large Language Model is Already a Powerful Out-of-distribution Detector Apr 7, 2024 Language Modeling Language Modelling
— Unverified 0X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Model Apr 7, 2024 Action Recognition Decision Making
— Unverified 0Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement Apr 6, 2024 Image-text Retrieval object-detection
— Unverified 0Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models Apr 6, 2024 MME Object
Code Code Available 0Multicalibration for Confidence Scoring in LLMs Apr 6, 2024 Benchmarking Question Answering
— Unverified 0Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning Apr 6, 2024 Domain Generalization Image Retrieval
Code Code Available 0KazQAD: Kazakh Open-Domain Question Answering Dataset Apr 6, 2024 Information Retrieval Machine Translation
Code Code Available 0Koala: Key frame-conditioned long video-LLM Apr 5, 2024 Action Recognition Question Answering
— Unverified 0BuDDIE: A Business Document Dataset for Multi-task Information Extraction Apr 5, 2024 Document Classification document understanding
— Unverified 0Do Sentence Transformers Learn Quasi-Geospatial Concepts from General Text? Apr 5, 2024 Question Answering Recommendation Systems
— Unverified 0Best Response Shaping Apr 5, 2024 Deep Reinforcement Learning Question Answering
— Unverified 0Neural-Symbolic VideoQA: Learning Compositional Spatio-Temporal Reasoning for Real-world Video Question Answering Apr 5, 2024 Question Answering Video Question Answering
— Unverified 0PRobELM: Plausibility Ranking Evaluation for Language Models Apr 4, 2024 Question Answering TruthfulQA
— Unverified 0Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought Apr 4, 2024 Extractive Question-Answering Knowledge Distillation
— Unverified 0Learning to Plan and Generate Text with Citations Apr 4, 2024 Long Form Question Answering Question Answering
— Unverified 0Mitigating LLM Hallucinations via Conformal Abstention Apr 4, 2024 Conformal Prediction Generative Question Answering
— Unverified 0Untangle the KNOT: Interweaving Conflicting Knowledge and Reasoning Skills in Large Language Models Apr 4, 2024 Question Answering
Code Code Available 0TinyVQA: Compact Multimodal Deep Neural Network for Visual Question Answering on Resource-Constrained Devices Apr 4, 2024 Quantization Question Answering
— Unverified 0The Death of Feature Engineering? BERT with Linguistic Features on SQuAD 2.0 Apr 4, 2024 Feature Engineering Machine Reading Comprehension
— Unverified 0Enhancing Human-Computer Interaction in Chest X-ray Analysis using Vision and Language Model with Eye Gaze Patterns Apr 3, 2024 Language Modeling Language Modelling
— Unverified 0Automatic Prompt Selection for Large Language Models Apr 3, 2024 GSM8K Question Answering
— Unverified 0Rematch: Robust and Efficient Matching of Local Knowledge Graphs to Improve Structural and Semantic Similarity Apr 2, 2024 Abstract Meaning Representation Fact Checking
Code Code Available 0Using Large Language Models to Understand Telecom Standards Apr 2, 2024 Question Answering
— Unverified 0Self-Improvement Programming for Temporal Knowledge Graph Question Answering Apr 2, 2024 Graph Question Answering In-Context Learning
— Unverified 0Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization Apr 2, 2024 Memorization Open-Domain Question Answering
— Unverified 0mChartQA: A universal benchmark for multimodal Chart Question Answer based on Vision-Language Alignment and Reasoning Apr 2, 2024 Chart Question Answering Language Modeling
— Unverified 0Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts Apr 2, 2024 In-Context Learning Language Modeling
Code Code Available 0