Hadamard product in deep learning: Introduction, Advances and Challenges Apr 17, 2025 Computational Efficiency Deep Learning
— Unverified 0Bridging the Semantic Gaps: Improving Medical VQA Consistency with LLM-Augmented Question Sets Apr 16, 2025 Diversity Medical Visual Question Answering
— Unverified 0Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization Apr 16, 2025 Hallucination Question Answering
— Unverified 0Instruction-augmented Multimodal Alignment for Image-Text and Element Matching Apr 16, 2025 Image Augmentation Image Generation
— Unverified 0LLM-as-a-Judge: Reassessing the Performance of LLMs in Extractive QA Apr 16, 2025 Question Answering Reading Comprehension
Code Code Available 0Mitigating LLM Hallucinations with Knowledge Graphs: A Case Study Apr 16, 2025 Knowledge Graphs Question Answering
— Unverified 0AskQE: Question Answering as Automatic Evaluation for Machine Translation Apr 15, 2025 Machine Translation Question Answering
— Unverified 0Streamlining Biomedical Research with Specialized LLMs Apr 15, 2025 Decision Making Dialogue Generation
— Unverified 0Benchmarking Biopharmaceuticals Retrieval-Augmented Generation Evaluation Apr 15, 2025 Benchmarking Question Answering
— Unverified 0QAVA: Query-Agnostic Visual Attack to Large Vision-Language Models Apr 15, 2025 Question Answering Visual Question Answering
Code Code Available 0LVLM_CSP: Accelerating Large Vision Language Models via Clustering, Scattering, and Pruning for Reasoning Segmentation Apr 15, 2025 Image Captioning Question Answering
— Unverified 0RankAlign: A Ranking View of the Generator-Validator Gap in Large Language Models Apr 15, 2025 Question Answering
Code Code Available 0Ai2 Scholar QA: Organized Literature Synthesis with Attribution Apr 15, 2025 Question Answering Retrieval
Code Code Available 3Exploring the Role of Knowledge Graph-Based RAG in Japanese Medical Question Answering with Small-Scale LLMs Apr 15, 2025 Medical Question Answering Question Answering
— Unverified 0From Misleading Queries to Accurate Answers: A Three-Stage Fine-Tuning Method for LLMs Apr 15, 2025 Hallucination Question Answering
— Unverified 0Reasoning Court: Combining Reasoning, Action, and Judgment for Multi-Hop Reasoning Apr 14, 2025 Fact Verification Question Answering
— Unverified 0Constructing Micro Knowledge Graphs from Technical Support Documents Apr 14, 2025 Knowledge Graphs Question Answering
— Unverified 0VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents Apr 14, 2025 Question Answering RAG
— Unverified 0Hallucination Detection in LLMs via Topological Divergence on Attention Graphs Apr 14, 2025 Hallucination Question Answering
— Unverified 0Building Trustworthy Multimodal AI: A Review of Fairness, Transparency, and Ethics in Vision-Language Tasks Apr 14, 2025 Ethics Fairness
— Unverified 0MMKB-RAG: A Multi-Modal Knowledge-Based Retrieval-Augmented Generation Framework Apr 14, 2025 Question Answering RAG
— Unverified 0Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding Apr 14, 2025 Question Answering
Code Code Available 5ReasonDrive: Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models Apr 14, 2025 Autonomous Driving Autonomous Vehicles
Code Code Available 1See or Recall: A Sanity Check for the Role of Vision in Solving Visualization Question Answer Tasks with Multimodal LLMs Apr 14, 2025 Data Visualization Question Answering
— Unverified 0HD-RAG: Retrieval-Augmented Generation for Hybrid Documents Containing Text and Hierarchical Tables Apr 13, 2025 Question Answering RAG
— Unverified 0A Survey on Efficient Vision-Language Models Apr 13, 2025 Image Captioning Question Answering
Code Code Available 1Kongzi: A Historical Large Language Model with Fact Enhancement Apr 13, 2025 Language Modeling Language Modelling
— Unverified 0TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning Apr 13, 2025 Question Answering reinforcement-learning
Code Code Available 2PathVLM-R1: A Reinforcement Learning-Driven Reasoning Model for Pathology Visual-Language Tasks Apr 12, 2025 Computed Tomography (CT) Question Answering
— Unverified 0NoTeS-Bank: Benchmarking Neural Transcription and Search for Scientific Notes Understanding Apr 12, 2025 Benchmarking Document AI
— Unverified 0Knowledge Graph-extended Retrieval Augmented Generation for Question Answering Apr 11, 2025 In-Context Learning Information Retrieval
— Unverified 0MedHal: An Evaluation Dataset for Medical Hallucination Detection Apr 11, 2025 Hallucination Natural Language Inference
— Unverified 0LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs Apr 11, 2025 Benchmarking Image Generation
Code Code Available 1AstroLLaVA: towards the unification of astronomical data and natural language Apr 11, 2025 Astronomy Image Captioning
— Unverified 0VLMT: Vision-Language Multimodal Transformer for Multimodal Multi-hop Question Answering Apr 11, 2025 cross-modal alignment Information Retrieval
— Unverified 0Out of Style: RAG's Fragility to Linguistic Variation Apr 11, 2025 Question Answering RAG
Code Code Available 0RAG-VR: Leveraging Retrieval-Augmented Generation for 3D Question Answering in VR Environments Apr 11, 2025 Answer Generation Question Answering
Code Code Available 0Towards Efficient and Robust Moment Retrieval System: A Unified Framework for Multi-Granularity Models and Temporal Reranking Apr 11, 2025 Moment Retrieval Question Answering
— Unverified 0Data Metabolism: An Efficient Data Design Schema For Vision Language Model Apr 10, 2025 Language Modeling Language Modelling
— Unverified 0Enhanced Question-Answering for Skill-based learning using Knowledge-based AI and Generative AI Apr 10, 2025 Question Answering
— Unverified 0Plan-and-Refine: Diverse and Comprehensive Retrieval-Augmented Generation Apr 10, 2025 Question Answering Retrieval
Code Code Available 0TokenFocus-VQA: Enhancing Text-to-Image Alignment with Position-Aware Focus and Multi-Perspective Aggregations on LVLMs Apr 10, 2025 Ensemble Learning Position
— Unverified 0On the Temporal Question-Answering Capabilities of Large Language Models Over Anonymized Data Apr 10, 2025 Question Answering
— Unverified 0Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos Apr 10, 2025 Question Answering Video Generation
— Unverified 0MRD-RAG: Enhancing Medical Diagnosis with Multi-Round Retrieval-Augmented Generation Apr 10, 2025 Diagnostic Medical Diagnosis
Code Code Available 1TALE: A Tool-Augmented Framework for Reference-Free Evaluation of Large Language Models Apr 10, 2025 Question Answering
— Unverified 0How Can Objects Help Video-Language Understanding? Apr 10, 2025 Image Captioning Object
— Unverified 0Do LLMs Understand Your Translations? Evaluating Paragraph-level MT with Question Answering Apr 10, 2025 Machine Translation Question Answering
Code Code Available 0PR-Attack: Coordinated Prompt-RAG Attacks on Retrieval-Augmented Generation in Large Language Models via Bilevel Optimization Apr 10, 2025 Anomaly Detection Bilevel Optimization
— Unverified 0MDIT: A Model-free Data Interpolation Method for Diverse Instruction Tuning Apr 9, 2025 Code Generation Diversity
— Unverified 0