Object-Centric Temporal Consistency via Conditional Autoregressive Inductive Biases Oct 21, 2024 Object Question Answering
— Unverified 0Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models Oct 21, 2024 Instruction Following object-detection
— Unverified 0Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant Oct 20, 2024 Question Answering speech-recognition
Code Code Available 7BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression Oct 20, 2024 In-Context Learning Long-Context Understanding
Code Code Available 1MedLogic-AQA: Enhancing Medical Question Answering with Abstractive Models Focusing on Logical Structures Oct 20, 2024 Answer Generation Informativeness
Code Code Available 0Reverse Question Answering: Can an LLM Write a Question so Hard (or Bad) that it Can't Answer? Oct 20, 2024 Question Answering valid
Code Code Available 0CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts Oct 20, 2024 Question Answering Visual Question Answering
Code Code Available 0Evaluating Consistencies in LLM responses through a Semantic Clustering of Question Answering Oct 20, 2024 Language Modelling Large Language Model
— Unverified 0ChitroJera: A Regionally Relevant Visual Question Answering Dataset for Bangla Oct 19, 2024 Question Answering Visual Question Answering
— Unverified 0Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models Oct 19, 2024 Hallucination Language Modeling
— Unverified 0LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound Oct 19, 2024 Instruction Following Knowledge Distillation
— Unverified 0Make LLMs better zero-shot reasoners: Structure-orientated autonomous reasoning Oct 18, 2024 Question Answering
Code Code Available 0Electrocardiogram-Language Model for Few-Shot Question Answering with Meta Learning Oct 18, 2024 Diagnostic Language Modeling
— Unverified 0SPFresh: Incremental In-Place Update for Billion-Scale Vector Search Oct 18, 2024 Information Retrieval Question Answering
— Unverified 0Optimizing Retrieval-Augmented Generation with Elasticsearch for Enhanced Question-Answering Systems Oct 18, 2024 Question Answering RAG
— Unverified 0Addressing Blind Guessing: Calibration of Selection Bias in Multiple-Choice Question Answering by Video Language Models Oct 18, 2024 Fairness Multiple-choice
— Unverified 0SwaQuAD-24: QA Benchmark Dataset in Swahili Oct 18, 2024 Diversity Information Retrieval
— Unverified 0MCSFF: Multi-modal Consistency and Specificity Fusion Framework for Entity Alignment Oct 18, 2024 Entity Alignment Information Retrieval
— Unverified 0Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens Oct 18, 2024 Math Question Answering
— Unverified 0ELOQ: Resources for Enhancing LLM Detection of Out-of-Scope Questions Oct 18, 2024 Hallucination Natural Questions
Code Code Available 0Paths-over-Graph: Knowledge Graph Empowered Large Language Model Reasoning Oct 18, 2024 Hallucination Knowledge Base Question Answering
Code Code Available 1DiscoGraMS: Enhancing Movie Screen-Play Summarization using Movie Character-Aware Discourse Graph Oct 18, 2024 Document Summarization Question Answering
— Unverified 0MultiChartQA: Benchmarking Vision-Language Models on Multi-Chart Problems Oct 18, 2024 Benchmarking Question Answering
Code Code Available 1NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples Oct 18, 2024 Attribute Question Answering
— Unverified 0RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training Oct 18, 2024 Denoising Question Answering
— Unverified 0E3D-GPT: Enhanced 3D Visual Foundation for Medical Vision-Language Model Oct 18, 2024 Language Modeling Language Modelling
— Unverified 0ViConsFormer: Constituting Meaningful Phrases of Scene Texts using Transformer-based Method in Vietnamese Text-based Visual Question Answering Oct 18, 2024 Question Answering Visual Question Answering
Code Code Available 0Zero-shot Action Localization via the Confidence of Large Vision-Language Models Oct 18, 2024 Action Localization Language Modelling
— Unverified 0Accounting for Sycophancy in Language Model Uncertainty Estimation Oct 17, 2024 Language Modeling Language Modelling
— Unverified 0FinQAPT: Empowering Financial Decisions with End-to-End LLM-driven Question Answering Pipeline Oct 17, 2024 Decision Making Question Answering
— Unverified 0From Isolated Conversations to Hierarchical Schemas: Dynamic Tree Memory Representation for LLMs Oct 17, 2024 Dialogue Understanding Management
— Unverified 0A Little Human Data Goes A Long Way Oct 17, 2024 Fact Verification Question Answering
Code Code Available 0RAP: Retrieval-Augmented Personalization for Multimodal Large Language Models Oct 17, 2024 Image Captioning Question Answering
Code Code Available 2BQA: Body Language Question Answering Dataset for Video Large Language Models Oct 17, 2024 Question Answering
— Unverified 0Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models Oct 17, 2024 Language Modelling Large Language Model
— Unverified 0RescueADI: Adaptive Disaster Interpretation in Remote Sensing Images with Autonomous Agents Oct 17, 2024 Question Answering Task Planning
— Unverified 0Advancing Large Language Model Attribution through Self-Improving Oct 17, 2024 Language Modeling Language Modelling
— Unverified 0Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations Oct 17, 2024 Decision Making Form
Code Code Available 0Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts? Oct 17, 2024 All Language Modeling
Code Code Available 0AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning Oct 17, 2024 Mathematical Reasoning Question Answering
— Unverified 0Developing Question-Answering Models in Low-Resource Languages: A Case Study on Turkish Medical Texts Using Transformer-Based Approaches Oct 16, 2024 Language Modeling Language Modelling
— Unverified 0REFINE on Scarce Data: Retrieval Enhancement through Fine-Tuning via Model Fusion of Embedding Models Oct 16, 2024 Data Augmentation Language Modeling
— Unverified 0Meta-Chunking: Learning Text Segmentation and Semantic Completion via Logical Perception Oct 16, 2024 Binary Classification Chunking
Code Code Available 3Large Language Models as a Tool for Mining Object Knowledge Oct 16, 2024 General Knowledge Knowledge Base Construction
— Unverified 0LEGAL-UQA: A Low-Resource Urdu-English Dataset for Legal Question Answering Oct 16, 2024 Optical Character Recognition (OCR) Question Answering
Code Code Available 0An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation Oct 16, 2024 Dialogue Generation Question Answering
— Unverified 0Open Domain Question Answering with Conflicting Contexts Oct 16, 2024 Open-Domain Question Answering Question Answering
— Unverified 0Pyramid-Driven Alignment: Pyramid Principle Guided Integration of Large Language Models and Knowledge Graphs Oct 16, 2024 Knowledge Graphs Question Answering
— Unverified 0WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines Oct 16, 2024 Question Answering Visual Question Answering
Code Code Available 1A Claim Decomposition Benchmark for Long-form Answer Verification Oct 16, 2024 Form Hallucination
Code Code Available 0