LaVida Drive: Vision-Text Interaction VLM for Autonomous Driving with Token Selection, Recovery and Enhancement Nov 20, 2024 Autonomous Driving Computational Efficiency
— Unverified 0Retrieval-Augmented Generation for Domain-Specific Question Answering: A Case Study on Pittsburgh and CMU Nov 20, 2024 Question Answering RAG
— Unverified 0Evaluating LLMs Capabilities Towards Understanding Social Dynamics Nov 20, 2024 Prompt Engineering Question Answering
— Unverified 0Uni-Mlip: Unified Self-supervision for Medical Vision Language Pre-training Nov 20, 2024 Contrastive Learning image-classification
— Unverified 0Neon: News Entity-Interaction Extraction for Enhanced Question Answering Nov 19, 2024 Articles Open Information Extraction
— Unverified 0AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction Nov 19, 2024 GPU Question Answering
— Unverified 0CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs Nov 19, 2024 Hallucination Language Modeling
— Unverified 0DynFocus: Dynamic Cooperative Network Empowers LLMs with Video Understanding Nov 19, 2024 Question Answering Video Understanding
— Unverified 0Do LLMs Understand Ambiguity in Text? A Case Study in Open-world Question Answering Nov 19, 2024 Fact Checking Open-Domain Question Answering
— Unverified 0Med-2E3: A 2D-Enhanced 3D Medical Multimodal Large Language Model Nov 19, 2024 Language Modeling Language Modelling
— Unverified 0Value-Spectrum: Quantifying Preferences of Vision-Language Models via Value Decomposition in Social Media Contexts Nov 18, 2024 Benchmarking Multimodal Large Language Model
Code Code Available 0Mitigating Knowledge Conflicts in Language Model-Driven Question Answering Nov 18, 2024 Document Summarization Hallucination
— Unverified 0Memory-Augmented Multimodal LLMs for Surgical VQA via Self-Contained Inquiry Nov 17, 2024 Question Answering Scene Understanding
— Unverified 0Understanding Multimodal LLMs: the Mechanistic Interpretability of Llava in Visual Question Answering Nov 17, 2024 Hallucination In-Context Learning
Code Code Available 0Learn from Downstream and Be Yourself in Multimodal Large Language Model Fine-Tuning Nov 17, 2024 Image Captioning Language Modeling
Code Code Available 0A Comprehensive Survey on Visual Question Answering Datasets and Algorithms Nov 17, 2024 Diagnostic Miscellaneous
— Unverified 0ForPKG: A Framework for Constructing Forestry Policy Knowledge Graph and Application Analysis Nov 17, 2024 graph construction Knowledge Graphs
Code Code Available 0Large Vision-Language Models for Remote Sensing Visual Question Answering Nov 16, 2024 Language Modeling Language Modelling
— Unverified 0LLaSA: Large Language and Structured Data Assistant Nov 16, 2024 Hypergraph representations Question Answering
— Unverified 0Visual question answering based evaluation metrics for text-to-image generation Nov 15, 2024 Image Generation Image Manipulation
— Unverified 0SlimLM: An Efficient Small Language Model for On-Device Document Assistance Nov 15, 2024 Language Modeling Language Modelling
— Unverified 0Layer Importance and Hallucination Analysis in Large Language Models via Enhanced Activation Variance-Sparsity Nov 15, 2024 Contrastive Learning Hallucination
— Unverified 0AMXFP4: Taming Activation Outliers with Asymmetric Microscaling Floating-Point for 4-bit LLM Inference Nov 15, 2024 Quantization Question Answering
— Unverified 0Everything is a Video: Unifying Modalities through Next-Frame Prediction Nov 15, 2024 Caption Generation Cross-Modal Retrieval
— Unverified 0A Benchmark for Long-Form Medical Question Answering Nov 14, 2024 Answer Generation Form
Code Code Available 0Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering Nov 14, 2024 Medical Question Answering Misinformation
— Unverified 0The Limited Impact of Medical Adaptation of Large Language and Vision-Language Models Nov 13, 2024 Medical Question Answering Question Answering
Code Code Available 0SparrowVQE: Visual Question Explanation for Course Content Understanding Nov 12, 2024 Question Answering Visual Question Answering
Code Code Available 0Likelihood as a Performance Gauge for Retrieval-Augmented Generation Nov 12, 2024 Language Modeling Language Modelling
Code Code Available 0Deceiving Question-Answering Models: A Hybrid Word-Level Adversarial Approach Nov 12, 2024 Abstractive Text Summarization Machine Translation
Code Code Available 0Query Optimization for Parametric Knowledge Refinement in Retrieval-Augmented Large Language Models Nov 12, 2024 Knowledge Distillation Question Answering
— Unverified 0Toward Optimal Search and Retrieval for RAG Nov 11, 2024 Question Answering RAG
Code Code Available 0Invar-RAG: Invariant LLM-aligned Retrieval for Better Generation Nov 11, 2024 Hallucination Information Retrieval
— Unverified 0EVQAScore: Efficient Video Question Answering Data Evaluation Nov 11, 2024 Keyword Extraction Question Answering
— Unverified 0Greenback Bears and Fiscal Hawks: Finance is a Jungle and Text Embeddings Must Adapt Nov 11, 2024 Question Answering
— Unverified 0Subgraph Retrieval Enhanced by Graph-Text Alignment for Commonsense Question Answering Nov 11, 2024 Contrastive Learning Question Answering
— Unverified 0Self-Training Meets Consistency: Improving LLMs' Reasoning With Consistency-Driven Rationale Evaluation Nov 10, 2024 Question Answering
Code Code Available 0Leveraging Retrieval-Augmented Generation for Persian University Knowledge Retrieval Nov 9, 2024 Information Retrieval Prompt Engineering
— Unverified 0M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework Nov 9, 2024 document understanding Question Answering
— Unverified 0Exploring Knowledge Boundaries in Large Language Models for Retrieval Judgment Nov 9, 2024 Question Answering RAG
— Unverified 0Poze: Sports Technique Feedback under Data Constraints Nov 8, 2024 Pose Estimation Question Answering
— Unverified 0GUIDEQ: Framework for Guided Questioning for progressive informational collection and classification Nov 8, 2024 Question Answering text-classification
Code Code Available 0SciDQA: A Deep Reading Comprehension Dataset over Scientific Papers Nov 8, 2024 Articles Question Answering
Code Code Available 0Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models Nov 8, 2024 Quantization Question Answering
— Unverified 0The Empirical Impact of Data Sanitization on Language Models Nov 8, 2024 Language Modeling Language Modelling
— Unverified 0WeatherGFM: Learning A Weather Generalist Foundation Model via In-context Learning Nov 8, 2024 In-Context Learning Question Answering
— Unverified 0Multi-Document Financial Question Answering using LLMs Nov 8, 2024 Knowledge Distillation Knowledge Graphs
— Unverified 0Seeing Through the Fog: A Cost-Effectiveness Analysis of Hallucination Detection Systems Nov 8, 2024 Diagnostic Hallucination
— Unverified 0Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent Nov 8, 2024 Autonomous Driving Language Modeling
— Unverified 0Unmasking the Limits of Large Language Models: A Systematic Evaluation of Masked Text Processing Ability through MskQA and MskCal Nov 8, 2024 Question Answering
Code Code Available 0