Multi-object event graph representation learning for Video Question Answering Sep 12, 2024 Contrastive Learning Graph Representation Learning
— Unverified 0Top-down Activity Representation Learning for Video Question Answering Sep 12, 2024 Question Answering Representation Learning
— Unverified 0Experimenting with Legal AI Solutions: The Case of Question-Answering for Access to Justice Sep 12, 2024 Question Answering Retrieval
— Unverified 0OmniQuery: Contextually Augmenting Captured Multimodal Memory to Enable Personal Question Answering Sep 12, 2024 Language Modeling Language Modelling
— Unverified 0An Evaluation Framework for Attributed Information Retrieval using Large Language Models Sep 12, 2024 Diversity Information Retrieval
Code Code Available 0Integrating SPARQL and LLMs for Question Answering over Scholarly Data Sources Sep 11, 2024 Extractive Question-Answering Question Answering
— Unverified 0AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge Sep 11, 2024 Language Modelling Large Language Model
Code Code Available 1Securing Vision-Language Models with a Robust Encoder Against Jailbreak and Adversarial Attacks Sep 11, 2024 Image Captioning Question Answering
Code Code Available 0MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications Sep 11, 2024 Ethics Hallucination
— Unverified 0Learning to Compress Contexts for Efficient Knowledge-based Visual Question Answering Sep 11, 2024 Question Answering Visual Question Answering
— Unverified 0KAG: Boosting LLMs in Professional Domains via Knowledge Augmented Generation Sep 10, 2024 Knowledge Graphs Question Answering
Code Code Available 9VisScience: An Extensive Benchmark for Evaluating K12 Educational Multi-modal Scientific Reasoning Sep 10, 2024 Question Answering Visual Question Answering
— Unverified 0Mitigating Hallucination in Visual-Language Models via Re-Balancing Contrastive Decoding Sep 10, 2024 Hallucination Image Captioning
— Unverified 0GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering Sep 10, 2024 Question Answering RAG
Code Code Available 1EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis Sep 10, 2024 Contrastive Learning Cross-Modal Retrieval
Code Code Available 2Enhancing Temporal Understanding in Audio Question Answering for Large Audio Language Models Sep 10, 2024 Audio captioning Audio Question Answering
— Unverified 0LIME: Less Is More for MLLM Evaluation Sep 10, 2024 Image Captioning Question Answering
Code Code Available 1Accelerating Large Language Model Pretraining via LFR Pedagogy: Learn, Focus, and Review Sep 10, 2024 Language Modeling Language Modelling
— Unverified 0MLLM-LLaVA-FL: Multimodal Large Language Model Assisted Federated Learning Sep 9, 2024 Federated Learning Image Captioning
— Unverified 0M3-Jepa: Multimodal Alignment via Multi-directional MoE based on the JEPA framework Sep 9, 2024 Computational Efficiency Cross-Modal Retrieval
Code Code Available 1Breaking Neural Network Scaling Laws with Modularity Sep 9, 2024 Question Answering Visual Question Answering
— Unverified 0Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling Sep 9, 2024 Language Modeling Language Modelling
Code Code Available 0Seek and Solve Reasoning for Table Question Answering Sep 9, 2024 In-Context Learning Question Answering
— Unverified 0RIRAG: Regulatory Information Retrieval and Answer Generation Sep 9, 2024 Answer Generation Information Retrieval
Code Code Available 1Towards Building a Robust Knowledge Intensive Question Answering Model with Large Language Models Sep 9, 2024 Contrastive Learning Data Augmentation
— Unverified 0MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery Sep 9, 2024 Memorization Question Answering
Code Code Available 7Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue Sep 7, 2024 Question Answering Speaker Identification
Code Code Available 0WebQuest: A Benchmark for Multimodal QA on Web Page Sequences Sep 6, 2024 Question Answering
— Unverified 0COLUMBUS: Evaluating COgnitive Lateral Understanding through Multiple-choice reBUSes Sep 6, 2024 Multiple-choice Question Answering
Code Code Available 0Combining LLMs and Knowledge Graphs to Reduce Hallucinations in Question Answering Sep 6, 2024 Hallucination Knowledge Graphs
— Unverified 0Question-Answering Dense Video Events Sep 6, 2024 Benchmarking Question Answering
Code Code Available 0Vietnamese Legal Information Retrieval in Question-Answering System Sep 5, 2024 Hallucination Information Retrieval
— Unverified 0Debate on Graph: a Flexible and Reliable Reasoning Framework for Large Language Models Sep 5, 2024 Answer Generation Graph Question Answering
Code Code Available 1MARAGS: A Multi-Adapter System for Multi-Task Retrieval Augmented Generation Question Answering Sep 5, 2024 Question Answering RAG
— Unverified 0Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding Sep 5, 2024 Question Answering Scene Understanding
Code Code Available 2mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding Sep 5, 2024 document understanding GPU
— Unverified 0Enhancing Healthcare LLM Trust with Atypical Presentations Recalibration Sep 5, 2024 Decision Making Medical Question Answering
Code Code Available 0RAG based Question-Answering for Contextual Response Prediction System Sep 5, 2024 Prediction Question Answering
— Unverified 0The representation landscape of few-shot learning and fine-tuning in large language models Sep 5, 2024 Few-Shot Learning In-Context Learning
Code Code Available 0OccLLaMA: An Occupancy-Language-Action Generative World Model for Autonomous Driving Sep 5, 2024 Autonomous Driving Motion Planning
— Unverified 0Word and Phrase Features in Graph Convolutional Network for Automatic Question Classification Sep 4, 2024 Classification Graph Neural Network
— Unverified 0Diversify-verify-adapt: Efficient and Robust Retrieval-Augmented Ambiguous Question Answering Sep 4, 2024 Question Answering RAG
— Unverified 0LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA Sep 4, 2024 Question Answering Sentence
Code Code Available 4How Privacy-Savvy Are Large Language Models? A Case Study on Compliance and Privacy Technical Review Sep 4, 2024 Question Answering Text Generation
— Unverified 0GoT-CQA: Graph-of-Thought Guided Compositional Reasoning for Chart Question Answering Sep 4, 2024 Chart Question Answering Question Answering
— Unverified 0MOSMOS: Multi-organ segmentation facilitated by medical report supervision Sep 4, 2024 Contrastive Learning Organ Segmentation
— Unverified 0R2GQA: Retriever-Reader-Generator Question Answering System to Support Students Understanding Legal Regulations in Higher Education Sep 4, 2024 Articles Information Retrieval
— Unverified 0Unforgettable Generalization in Language Models Sep 3, 2024 Physical Commonsense Reasoning Question Answering
— Unverified 0How to Determine the Preferred Image Distribution of a Black-Box Vision-Language Model? Sep 3, 2024 In-Context Learning Language Modeling
Code Code Available 0VProChart: Answering Chart Question through Visual Perception Alignment Agent and Programmatic Solution Reasoning Sep 3, 2024 Chart Question Answering Data Visualization
Code Code Available 1