FFA Sora, video generation as fundus fluorescein angiography simulator Dec 23, 2024 Privacy Preserving Question Answering
— Unverified 0VidCtx: Context-aware Video Question Answering with Image Models Dec 23, 2024 Large Language Model Question Answering
Code Code Available 0Multimodal Preference Data Synthetic Alignment with Reward Model Dec 23, 2024 2k Caption Generation
Code Code Available 0Evaluating LLM Reasoning in the Operations Research Domain with ORQA Dec 22, 2024 Question Answering
Code Code Available 2MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge Dec 22, 2024 Multi-hop Question Answering Question Answering
Code Code Available 0FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos Dec 22, 2024 Language Modelling Large Language Model
Code Code Available 0Prompting Large Language Models with Rationale Heuristics for Knowledge-based Visual Question Answering Dec 22, 2024 Question Answering Visual Question Answering
— Unverified 0Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding Dec 21, 2024 Attribute Question Answering
Code Code Available 1Application of Multimodal Large Language Models in Autonomous Driving Dec 21, 2024 Autonomous Driving Decision Making
— Unverified 0DragonVerseQA: Open-Domain Long-Form Context-Aware Question-Answering Dec 21, 2024 Articles Form
Code Code Available 0Automated CVE Analysis: Harnessing Machine Learning In Designing Question-Answering Models For Cybersecurity Information Extraction Dec 21, 2024 Question Answering
— Unverified 0SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization Dec 21, 2024 Image Captioning Multimodal Reasoning
Code Code Available 0STAMPsy: Towards SpatioTemporal-Aware Mixed-Type Dialogues for Psychological Counseling Dec 21, 2024 Conversational Recommendation Dialogue Generation
— Unverified 0Speech Retrieval-Augmented Generation without Automatic Speech Recognition Dec 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Contrastive Learning for Task-Independent SpeechLLM-Pretraining Dec 20, 2024 Contrastive Learning Question Answering
Code Code Available 0MRAG: A Modular Retrieval Framework for Time-Sensitive Question Answering Dec 20, 2024 Question Answering Retrieval
— Unverified 0PolySmart @ TRECVid 2024 Medical Video Question Answering Dec 20, 2024 Question Answering Retrieval
— Unverified 0HybGRAG: Hybrid Retrieval-Augmented Generation on Textual and Relational Knowledge Bases Dec 20, 2024 Question Answering RAG
— Unverified 0Logical Consistency of Large Language Models in Fact-checking Dec 20, 2024 Fact Checking Hallucination
— Unverified 0NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional Reasoning Dec 20, 2024 Graph Question Answering Nutrition
— Unverified 0Defeasible Visual Entailment: Benchmark, Evaluator, and Reward-Driven Optimization Dec 19, 2024 Contrastive Learning Decision Making
Code Code Available 1Review-Then-Refine: A Dynamic Framework for Multi-Hop Question Answering with Temporal Adaptability Dec 19, 2024 Multi-hop Question Answering Question Answering
— Unverified 0Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering Dec 19, 2024 Contrastive Learning Language Modeling
Code Code Available 0Query pipeline optimization for cancer patient question answering systems Dec 19, 2024 Hallucination Passage Retrieval
— Unverified 0GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering Dec 19, 2024 Efficient Exploration Embodied Question Answering
— Unverified 0Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs Dec 19, 2024 Arithmetic Reasoning Code Generation
— Unverified 0AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving Dec 19, 2024 Autonomous Driving Benchmarking
Code Code Available 2CodeRepoQA: A Large-scale Benchmark for Software Engineering Question Answering Dec 19, 2024 Question Answering
Code Code Available 0FedPIA -- Permuting and Integrating Adapters leveraging Wasserstein Barycenters for Finetuning Foundation Models in Multi-Modal Federated Learning Dec 19, 2024 Federated Learning parameter-efficient fine-tuning
— Unverified 0Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models Dec 19, 2024 Autonomous Driving Image Captioning
Code Code Available 0FiVL: A Framework for Improved Vision-Language Alignment Dec 19, 2024 Answer Generation Multimodal Reasoning
Code Code Available 0EarthDial: Turning Multi-sensory Earth Observations to Interactive Dialogues Dec 19, 2024 Change Detection Disaster Response
— Unverified 0Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces Dec 18, 2024 Question Answering Spatial Reasoning
Code Code Available 4RACQUET: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs Dec 18, 2024 Question Answering
Code Code Available 0CAD-Recode: Reverse Engineering CAD Code from Point Clouds Dec 18, 2024 CAD Reconstruction Decoder
Code Code Available 3A Cognitive Ideation Support Framework using IBM Watson Services Dec 18, 2024 Question Answering
— Unverified 0MedCoT: Medical Chain of Thought via Hierarchical Expert Dec 18, 2024 Diagnostic Medical Visual Question Answering
Code Code Available 1Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs Dec 18, 2024 Question Answering RAG
— Unverified 0Consistency of Compositional Generalization across Multiple Levels Dec 18, 2024 Meta-Learning Question Answering
Code Code Available 0Knowledge Editing with Dynamic Knowledge Graphs for Multi-Hop Question Answering Dec 18, 2024 graph construction knowledge editing
Code Code Available 1A Concept-Centric Approach to Multi-Modality Learning Dec 18, 2024 Image-text matching Question Answering
— Unverified 0ARTEMIS-DA: An Advanced Reasoning and Transformation Engine for Multi-Step Insight Synthesis in Data Analytics Dec 18, 2024 Code Generation Information Retrieval
— Unverified 0Knowledge Graphs: The Future of Data Integration and Insightful Discovery Dec 17, 2024 Chatbot Data Integration
— Unverified 0SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation Dec 17, 2024 Fact Verification Knowledge Graphs
Code Code Available 2On the Structural Memory of LLM Agents Dec 17, 2024 Language Modeling Language Modelling
Code Code Available 0Question: How do Large Language Models perform on the Question Answering tasks? Answer: Dec 17, 2024 Articles Instruction Following
— Unverified 0LLM-based Discriminative Reasoning for Knowledge Graph Question Answering Dec 17, 2024 Graph Question Answering Question Answering
— Unverified 0Modality-Inconsistent Continual Learning of Multimodal Large Language Models Dec 17, 2024 Continual Learning Knowledge Distillation
— Unverified 0Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues Dec 17, 2024 Language Modeling Language Modelling
Code Code Available 0When to Speak, When to Abstain: Contrastive Decoding with Abstention Dec 17, 2024 Hallucination Question Answering
— Unverified 0