SaSR-Net: Source-Aware Semantic Representation Network for Enhancing Audio-Visual Question Answering Nov 7, 2024 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
— Unverified 0A Brief History of Named Entity Recognition Nov 7, 2024 named-entity-recognition Named Entity Recognition
— Unverified 0Survey on Semantic Interpretation of Tabular Data: Challenges and Directions Nov 7, 2024 Knowledge Graphs Question Answering
— Unverified 0M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding Nov 7, 2024 document understanding Optical Character Recognition
— Unverified 0Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning Nov 7, 2024 Offline RL Policy Gradient Methods
— Unverified 0Seeing is Deceiving: Exploitation of Visual Pathways in Multi-Modal Language Models Nov 7, 2024 Adversarial Attack Image Captioning
— Unverified 0M3SciQA: A Multi-Modal Multi-Document Scientific QA Benchmark for Evaluating Foundation Models Nov 6, 2024 Information Retrieval Question Answering
— Unverified 0NeurIPS 2023 Competition: Privacy Preserving Federated Learning Document VQA Nov 6, 2024 Federated Learning Language Modelling
— Unverified 0Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress? Nov 6, 2024 Medical Question Answering Question Answering
Code Code Available 0Select2Plan: Training-Free ICL-Based Planning through VQA and Memory Retrieval Nov 6, 2024 Autonomous Navigation In-Context Learning
— Unverified 0Lexicalization Is All You Need: Examining the Impact of Lexical Knowledge in a Compositional QALD System Nov 6, 2024 All Question Answering
Code Code Available 0Multimodal Commonsense Knowledge Distillation for Visual Question Answering Nov 5, 2024 Knowledge Distillation Question Answering
— Unverified 0MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning Nov 5, 2024 MME Question Answering
— Unverified 0Leveraging Large Language Models in Code Question Answering: Baselines and Issues Nov 5, 2024 Large Language Model Question Answering
Code Code Available 0From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing Nov 5, 2024 Change Detection Contrastive Learning
— Unverified 0VERITAS: A Unified Approach to Reliability Evaluation Nov 5, 2024 Fact Checking Hallucination
— Unverified 0PersianRAG: A Retrieval-Augmented Generation System for Persian Language Nov 5, 2024 Language Modeling Language Modelling
— Unverified 0FactTest: Factuality Testing in Large Language Models with Finite-Sample and Distribution-Free Guarantees Nov 4, 2024 Multiple-choice Question Answering
— Unverified 0Can Language Models Enable In-Context Database? Nov 4, 2024 Question Answering RAG
— Unverified 0One VLM to Keep it Learning: Generation and Balancing for Data-free Continual Visual Question Answering Nov 4, 2024 Continual Learning Question Answering
— Unverified 0AVSS: Layer Importance Evaluation in Large Language Models via Activation Variance-Sparsity Analysis Nov 4, 2024 Language Modeling Language Modelling
— Unverified 0Addressing Uncertainty in LLMs to Enhance Reliability in Generative AI Nov 4, 2024 Conformal Prediction Prediction
— Unverified 0A Multi-Task Role-Playing Agent Capable of Imitating Character Linguistic Styles Nov 4, 2024 Question Answering Story Generation
— Unverified 0Goal-Oriented Semantic Communication for Wireless Visual Question Answering Nov 3, 2024 Edge-computing Question Answering
— Unverified 0RS-MoE: Mixture of Experts for Remote Sensing Image Captioning and Visual Question Answering Nov 3, 2024 Descriptive Image Captioning
— Unverified 0Diagnosing Medical Datasets with Training Dynamics Nov 3, 2024 Medical Question Answering Question Answering
Code Code Available 0A Visual Question Answering Method for SAR Ship: Breaking the Requirement for Multimodal Dataset Construction and Model Fine-Tuning Nov 3, 2024 object-detection Object Detection
— Unverified 0LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding Nov 2, 2024 document understanding Question Answering
— Unverified 0Designing a Robust Radiology Report Generation System Nov 2, 2024 Decision Making Diagnostic
— Unverified 0Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions Nov 1, 2024 Document Embedding Information Retrieval
— Unverified 0Latent Paraphrasing: Perturbation on Layers Improves Knowledge Injection in Language Models Nov 1, 2024 Diversity Paraphrase Generation
Code Code Available 0Magnitude Pruning of Large Pretrained Transformer Models with a Mixture Gaussian Prior Nov 1, 2024 Natural Language Understanding Question Answering
— Unverified 0Right this way: Can VLMs Guide Us to See More to Answer Questions? Nov 1, 2024 Question Answering Visual Question Answering
Code Code Available 0Provenance: A Light-weight Fact-checker for Retrieval Augmented LLM Generation Output Nov 1, 2024 Fact Checking Natural Language Inference
— Unverified 0AttackQA: Development and Adoption of a Dataset for Assisting Cybersecurity Operations using Fine-tuned and Open-Source LLMs Nov 1, 2024 Question Answering RAG
— Unverified 0GRS-QA -- Graph Reasoning-Structured Question Answering Dataset Nov 1, 2024 Multi-hop Question Answering Question Answering
— Unverified 0LEAF: Learning and Evaluation Augmented by Fact-Checking to Improve Factualness in Large Language Models Oct 31, 2024 Fact Checking Medical Question Answering
— Unverified 0Dynamic Uncertainty Ranking: Enhancing In-Context Learning for Long-Tail Knowledge in LLMs Oct 31, 2024 In-Context Learning Memorization
— Unverified 0JudgeRank: Leveraging Large Language Models for Reasoning-Intensive Reranking Oct 31, 2024 Code Completion Open-Domain Question Answering
— Unverified 0SimpsonsVQA: Enhancing Inquiry-Based Learning with a Tailored Dataset Oct 30, 2024 Question Answering Visual Question Answering
— Unverified 0BUZZ: Beehive-structured Sparse KV Cache with Segmented Heavy Hitters for Efficient LLM Inference Oct 30, 2024 Computational Efficiency Question Answering
Code Code Available 0Dynamic Strategy Planning for Efficient Question Answering with Large Language Models Oct 30, 2024 Multi-hop Question Answering Question Answering
— Unverified 0MDCure: A Scalable Pipeline for Multi-Document Instruction-Following Oct 30, 2024 Articles Instruction Following
Code Code Available 0Improving Uncertainty Quantification in Large Language Models via Semantic Embeddings Oct 30, 2024 Question Answering Uncertainty Quantification
— Unverified 0Danoliteracy of Generative, Large Language Models Oct 30, 2024 Question Answering
— Unverified 0Symbolic Graph Inference for Compound Scene Understanding Oct 30, 2024 Question Answering Scene Understanding
— Unverified 0Are VLMs Really Blind Oct 29, 2024 Language Modeling Language Modelling
Code Code Available 0ProMQA: Question Answering Dataset for Multimodal Procedural Activity Understanding Oct 29, 2024 Action Recognition Action Segmentation
Code Code Available 0Enhancing Financial Question Answering with a Multi-Agent Reflection Framework Oct 29, 2024 Question Answering
— Unverified 0AAAR-1.0: Assessing AI's Potential to Assist Research Oct 29, 2024 Question Answering
— Unverified 0