Pretraining and Updates of Domain-Specific LLM: A Case Study in the Japanese Business Domain Apr 12, 2024 Continual Pretraining General Knowledge
— Unverified 0Synthetic Dataset Creation and Fine-Tuning of Transformer Models for Question Answering in Serbian Apr 12, 2024 Question Answering
Code Code Available 0Small Models Are (Still) Effective Cross-Domain Argument Extractors Apr 12, 2024 Event Argument Extraction Question Answering
Code Code Available 0Enhancing Visual Question Answering through Question-Driven Image Captions as Prompts Apr 12, 2024 Image Captioning Question Answering
Code Code Available 1Improving Health Question Answering with Reliable and Time-Aware Evidence Retrieval Apr 12, 2024 Articles Question Answering
Code Code Available 0MM-PhyQA: Multimodal Physics Question-Answering With Multi-Image CoT Prompting Apr 11, 2024 Question Answering
— Unverified 0View Selection for 3D Captioning via Diffusion Ranking Apr 11, 2024 3D Object Captioning Hallucination
Code Code Available 3Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs Apr 11, 2024 Descriptive Hallucination
Code Code Available 0Audio Dialogues: Dialogues dataset for audio and music understanding Apr 11, 2024 Audio captioning Audio Question Answering
— Unverified 0Language Models Meet Anomaly Detection for Better Interpretability and Generalizability Apr 11, 2024 Anomaly Detection Language Modelling
Code Code Available 0RiskLabs: Predicting Financial Risk Using Large Language Model based on Multimodal and Multi-Sources Data Apr 11, 2024 Binary Classification Language Modeling
— Unverified 0Unraveling the Dilemma of AI Errors: Exploring the Effectiveness of Human and Machine Explanations for Large Language Models Apr 11, 2024 Explainable artificial intelligence Explainable Artificial Intelligence (XAI)
— Unverified 0OpenBias: Open-set Bias Detection in Text-to-Image Generative Models Apr 11, 2024 Bias Detection Fairness
Code Code Available 1On Unified Prompt Tuning for Request Quality Assurance in Public Code Review Apr 11, 2024 Language Modeling Language Modelling
— Unverified 0LLoCO: Learning Long Contexts Offline Apr 11, 2024 4k In-Context Learning
Code Code Available 2Enhancing Question Answering for Enterprise Knowledge Bases using Large Language Models Apr 10, 2024 Management Question Answering
— Unverified 0Transferable and Efficient Non-Factual Content Detection via Probe Training with Offline Consistency Checking Apr 10, 2024 Question Answering
Code Code Available 0Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study Apr 10, 2024 Form Long Form Question Answering
— Unverified 0Superposition Prompting: Improving and Accelerating Retrieval-Augmented Generation Apr 10, 2024 Question Answering RAG
Code Code Available 2LLMs' Reading Comprehension Is Affected by Parametric Knowledge and Struggles with Hypothetical Statements Apr 9, 2024 Natural Language Understanding Question Answering
— Unverified 0Identifying Shopping Intent in Product QA for Proactive Recommendations Apr 9, 2024 Friction Mixture-of-Experts
— Unverified 0Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks Apr 9, 2024 Answer Selection Long-Context Understanding
Code Code Available 2SurveyAgent: A Conversational System for Personalized and Efficient Research Survey Apr 9, 2024 Management Question Answering
— Unverified 0Visually Descriptive Language Model for Vector Graphics Reasoning Apr 9, 2024 Descriptive Language Modeling
Code Code Available 9MoReVQA: Exploring Modular Reasoning Models for Video Question Answering Apr 9, 2024 EgoSchema Multiple-choice
— Unverified 0The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models Apr 8, 2024 Question Answering Reading Comprehension
— Unverified 0Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods Apr 8, 2024 Adversarial Text Machine Translation
— Unverified 0Enhancing Software-Related Information Extraction via Single-Choice Question Answering with Large Language Models Apr 8, 2024 Descriptive In-Context Learning
— Unverified 0MedExpQA: Multilingual Benchmarking of Large Language Models for Medical Question Answering Apr 8, 2024 Benchmarking Medical Question Answering
— Unverified 0MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding Apr 8, 2024 GPU Multiple-choice
Code Code Available 3HAMMR: HierArchical MultiModal React agents for generic VQA Apr 8, 2024 Optical Character Recognition (OCR) Question Answering
— Unverified 0PerkwE_COQA: Enhanced Persian Conversational Question Answering by combining contextual keyword extraction with Large Language Models Apr 8, 2024 Conversational Question Answering Keyword Extraction
— Unverified 0Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding Apr 8, 2024 Domain Adaptation Extractive Question-Answering
— Unverified 0Your Finetuned Large Language Model is Already a Powerful Out-of-distribution Detector Apr 7, 2024 Language Modeling Language Modelling
— Unverified 0LLM-aided explanations of EDA synthesis errors Apr 7, 2024 Question Answering Reading Comprehension
— Unverified 0X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Model Apr 7, 2024 Action Recognition Decision Making
— Unverified 0FRACTAL: Fine-Grained Scoring from Aggregate Text Labels Apr 7, 2024 Math Multiple Instance Learning
— Unverified 0Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement Apr 6, 2024 Image-text Retrieval object-detection
— Unverified 0KazQAD: Kazakh Open-Domain Question Answering Dataset Apr 6, 2024 Information Retrieval Machine Translation
Code Code Available 0Multicalibration for Confidence Scoring in LLMs Apr 6, 2024 Benchmarking Question Answering
— Unverified 0Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models Apr 6, 2024 MME Object
Code Code Available 0Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning Apr 6, 2024 Domain Generalization Image Retrieval
Code Code Available 0Best Response Shaping Apr 5, 2024 Deep Reinforcement Learning Question Answering
— Unverified 0Koala: Key frame-conditioned long video-LLM Apr 5, 2024 Action Recognition Question Answering
— Unverified 0BuDDIE: A Business Document Dataset for Multi-task Information Extraction Apr 5, 2024 Document Classification document understanding
— Unverified 0Neural-Symbolic VideoQA: Learning Compositional Spatio-Temporal Reasoning for Real-world Video Question Answering Apr 5, 2024 Question Answering Video Question Answering
— Unverified 0Do Sentence Transformers Learn Quasi-Geospatial Concepts from General Text? Apr 5, 2024 Question Answering Recommendation Systems
— Unverified 0Mitigating LLM Hallucinations via Conformal Abstention Apr 4, 2024 Conformal Prediction Generative Question Answering
— Unverified 0CBR-RAG: Case-Based Reasoning for Retrieval Augmented Generation in LLMs for Legal Question Answering Apr 4, 2024 Language Modeling Language Modelling
Code Code Available 1PRobELM: Plausibility Ranking Evaluation for Language Models Apr 4, 2024 Question Answering TruthfulQA
— Unverified 0