CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models May 22, 2024 Benchmarking Hallucination
— Unverified 0Semantic Density: Uncertainty Quantification for Large Language Models through Confidence Measurement in Semantic Space May 22, 2024 Misinformation Question Answering
Code Code Available 1FiDeLiS: Faithful Reasoning in Large Language Model for Knowledge Graph Question Answering May 22, 2024 Common Sense Reasoning Graph Question Answering
— Unverified 0Backpropagation-Free Multi-modal On-Device Model Adaptation via Cloud-Device Collaboration May 21, 2024 Question Answering Video Question Answering
— Unverified 0Efficient and Interpretable Information Retrieval for Product Question Answering with Heterogeneous Data May 21, 2024 Contrastive Learning Information Retrieval
Code Code Available 0MentalQA: An Annotated Arabic Corpus for Questions and Answers of Mental Healthcare May 21, 2024 Anatomy Epidemiology
— Unverified 0Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs May 21, 2024 Arithmetic Reasoning Decision Making
— Unverified 0ProtT3: Protein-to-Text Generation for Text-based Protein Understanding May 21, 2024 Property Prediction Question Answering
Code Code Available 2Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering May 21, 2024 Diversity Information Retrieval
Code Code Available 0OLAPH: Improving Factuality in Biomedical Long-form Question Answering May 21, 2024 Form Long Form Question Answering
Code Code Available 1MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering May 20, 2024 Benchmarking Question Answering
Code Code Available 2Increasing the LLM Accuracy for Question Answering: Ontologies to the Rescue! May 20, 2024 Knowledge Graphs Question Answering
— Unverified 0KG-RAG: Bridging the Gap Between Knowledge and Creativity May 20, 2024 Graph Question Answering Information Retrieval
— Unverified 0Inquire, Interact, and Integrate: A Proactive Agent Collaborative Framework for Zero-Shot Multimodal Medical Reasoning May 19, 2024 Multimodal Reasoning Question Answering
— Unverified 0Case-Based Reasoning Approach for Solving Financial Question Answering May 18, 2024 Question Answering
— Unverified 0MemeMQA: Multimodal Question Answering for Memes via Rationale-Based Inferencing May 18, 2024 Question Answering Text Generation
— Unverified 0EyeFound: A Multimodal Generalist Foundation Model for Ophthalmic Imaging May 18, 2024 Question Answering Visual Question Answering
— Unverified 0VideoQA-SC: Adaptive Semantic Communication for Video Question Answering May 17, 2024 Question Answering Semantic Communication
— Unverified 0Evaluation of large language model performance on the Biomedical Language Understanding and Reasoning Benchmark May 17, 2024 Document Classification Language Modeling
— Unverified 0Efficient Multimodal Large Language Models: A Survey May 17, 2024 Edge-computing Question Answering
Code Code Available 3LLM-based Multi-Agent Reinforcement Learning: Current and Future Directions May 17, 2024 Multi-agent Reinforcement Learning Question Answering
— Unverified 0StackOverflowVQA: Stack Overflow Visual Question Answering Dataset May 17, 2024 Question Answering Sentence
— Unverified 0Towards Better Question Generation in QA-based Event Extraction May 17, 2024 Event Extraction Question Answering
Code Code Available 1KnowledgeHub: An end-to-end Tool for Assisted Scientific Discovery May 16, 2024 named-entity-recognition Named Entity Recognition
— Unverified 0Autonomous Workflow for Multimodal Fine-Grained Training Assistants Towards Mixed Reality May 16, 2024 Mixed Reality Question Answering
— Unverified 0Grounded 3D-LLM with Referent Tokens May 16, 2024 Dense Captioning Diversity
Code Code Available 2AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning May 16, 2024 Multiple-choice Question Answering
— Unverified 0Chameleon: Mixed-Modal Early-Fusion Foundation Models May 16, 2024 Image Captioning Image Generation
Code Code Available 7UniRAG: Universal Retrieval Augmentation for Large Vision Language Models May 16, 2024 Image Captioning Image Generation
Code Code Available 1SciQAG: A Framework for Auto-Generated Science Question Answering Dataset with Fine-grained Evaluation May 16, 2024 Open-Ended Question Answering Question Answering
Code Code Available 1When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models May 16, 2024 In-Context Learning Question Answering
Code Code Available 7FinTextQA: A Dataset for Long-form Financial Question Answering May 16, 2024 Diversity Form
— Unverified 0Conformal Alignment: Knowing When to Trust Foundation Models with Guarantees May 16, 2024 Decision Making Informativeness
Code Code Available 1IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues May 15, 2024 Information Retrieval Question Answering
— Unverified 0Prompting-based Synthetic Data Generation for Few-Shot Question Answering May 15, 2024 Question Answering Synthetic Data Generation
Code Code Available 0STAR: A Benchmark for Situated Reasoning in Real-World Videos May 15, 2024 Diagnostic Logical Reasoning
— Unverified 0CinePile: A Long Video Question Answering Dataset and Benchmark May 14, 2024 Form Human-Object Interaction Detection
— Unverified 0SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models May 14, 2024 Adversarial Robustness Instruction Following
— Unverified 0UCCIX: Irish-eXcellence Large Language Model May 13, 2024 Benchmarking Language Modeling
— Unverified 0MetaReflection: Learning Instructions for Language Agents using Past Reflections May 13, 2024 Logical Reasoning Question Answering
— Unverified 0AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments May 13, 2024 Decision Making Diagnostic
— Unverified 0TANQ: An open domain dataset of table answered questions May 13, 2024 Math Open-Domain Question Answering
Code Code Available 1Benchmarking Retrieval-Augmented Large Language Models in Biomedical NLP: Application, Robustness, and Self-Awareness May 13, 2024 Benchmarking counterfactual
— Unverified 0EconLogicQA: A Question-Answering Benchmark for Evaluating Large Language Models in Economic Sequential Reasoning May 13, 2024 Articles
Code Code Available 0From Questions to Insightful Answers: Building an Informed Chatbot for University Resources May 13, 2024 Chatbot Language Modeling
— Unverified 0KET-QA: A Dataset for Knowledge Enhanced Table Question Answering May 13, 2024 Question Answering
— Unverified 0CLIP-Powered TASS: Target-Aware Single-Stream Network for Audio-Visual Question Answering May 13, 2024 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
— Unverified 0FreeVA: Offline MLLM as Training-Free Video Assistant May 13, 2024 Fairness Question Answering
Code Code Available 2MedConceptsQA: Open Source Medical Concepts QA Benchmark May 12, 2024 Few-Shot Learning Question Answering
Code Code Available 1Realizing Visual Question Answering for Education: GPT-4V as a Multimodal AI May 12, 2024 Question Answering Visual Question Answering
— Unverified 0