LOVA3: Learning to Visual Question Answering, Asking and Assessment May 23, 2024 Question Answering Visual Question Answering
Code Code Available 2AGILE: A Novel Reinforcement Learning Framework of LLM Agents May 23, 2024 Question Answering reinforcement-learning
Code Code Available 2Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation May 22, 2024 Informativeness Language Modeling
Code Code Available 2ProtT3: Protein-to-Text Generation for Text-based Protein Understanding May 21, 2024 Property Prediction Question Answering
Code Code Available 2MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering May 20, 2024 Benchmarking Question Answering
Code Code Available 2Grounded 3D-LLM with Referent Tokens May 16, 2024 Dense Captioning Diversity
Code Code Available 2FreeVA: Offline MLLM as Training-Free Video Assistant May 13, 2024 Fairness Question Answering
Code Code Available 2HMT: Hierarchical Memory Transformer for Long Context Language Processing May 9, 2024 Language Modeling Language Modelling
Code Code Available 2DALK: Dynamic Co-Augmentation of LLMs and KG to answer Alzheimer's Disease Questions with Scientific Literature May 8, 2024 Question Answering
Code Code Available 2Overview of the EHRSQL 2024 Shared Task on Reliable Text-to-SQL Modeling on Electronic Health Records May 4, 2024 Information Retrieval Question Answering
Code Code Available 2IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages Apr 25, 2024 Cross-Lingual Question Answering Diversity
Code Code Available 2Generate-on-Graph: Treat LLM as both Agent and KG in Incomplete Knowledge Graph Question Answering Apr 23, 2024 Graph Question Answering Hallucination
Code Code Available 2GSCo: Towards Generalizable AI in Medicine via Generalist-Specialist Collaboration Apr 23, 2024 Collaborative Inference In-Context Learning
Code Code Available 2FakeBench: Probing Explainable Fake Image Detection via Large Multimodal Models Apr 20, 2024 Binary Classification Fake Image Detection
Code Code Available 2Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models Apr 16, 2024 image-classification Image Classification
Code Code Available 2LLoCO: Learning Long Contexts Offline Apr 11, 2024 4k In-Context Learning
Code Code Available 2Superposition Prompting: Improving and Accelerating Retrieval-Augmented Generation Apr 10, 2024 Question Answering RAG
Code Code Available 2Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks Apr 9, 2024 Answer Selection Long-Context Understanding
Code Code Available 2LongVLM: Efficient Long Video Understanding via Large Language Models Apr 4, 2024 Question Answering Video Question Answering
Code Code Available 2Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward Apr 1, 2024 Instruction Following Language Modeling
Code Code Available 2How Much are Large Language Models Contaminated? A Comprehensive Survey and the LLMSanitize Library Mar 31, 2024 Question Answering
Code Code Available 2Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want Mar 29, 2024 Instruction Following Language Modelling
Code Code Available 2VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis Mar 29, 2024 Hallucination Image Captioning
Code Code Available 2Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models Mar 29, 2024 Question Answering Visual Question Answering
Code Code Available 2Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous Driving Mar 28, 2024 Autonomous Driving Language Modeling
Code Code Available 2Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Prediction Mar 27, 2024 Image Captioning Language Modeling
Code Code Available 2An Image Grid Can Be Worth a Video: Zero-shot Video Question Answering Using a VLM Mar 27, 2024 Language Modeling Language Modelling
Code Code Available 2OmniVid: A Generative Framework for Universal Video Understanding Mar 26, 2024 Action Recognition Decoder
Code Code Available 2Visually Guided Generative Text-Layout Pre-training for Document Intelligence Mar 25, 2024 Document Classification document understanding
Code Code Available 2LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models Mar 22, 2024 Language Modelling Large Language Model
Code Code Available 2Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers Mar 22, 2024 Information Retrieval
Code Code Available 2VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learning Mar 19, 2024 Benchmarking Image Captioning
Code Code Available 2RAGGED: Towards Informed Design of Retrieval Augmented Generation Systems Mar 14, 2024 Decoder Question Answering
Code Code Available 2Beyond Text: Frozen Large Language Models in Visual Signal Comprehension Mar 12, 2024 Deblurring Decoder
Code Code Available 2ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis Mar 11, 2024 Question Answering
Code Code Available 2KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking Techniques Mar 9, 2024 Knowledge Graphs Long Form Question Answering
Code Code Available 2Debiasing Multimodal Large Language Models Mar 8, 2024 Fairness Question Answering
Code Code Available 2CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios Mar 7, 2024 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
Code Code Available 2QAQ: Quality Adaptive Quantization for LLM KV Cache Mar 7, 2024 Quantization Question Answering
Code Code Available 2Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning Mar 6, 2024 Multimodal Reasoning Question Answering
Code Code Available 2Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation Feb 28, 2024 Code Generation In-Context Learning
Code Code Available 2The First Place Solution of WSDM Cup 2024: Leveraging Large Language Models for Conversational Multi-Doc QA Feb 28, 2024 Natural Language Understanding Question Answering
Code Code Available 2Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey Feb 27, 2024 Language Modeling Language Modelling
Code Code Available 2BlendSQL: A Scalable Dialect for Unifying Hybrid Question Answering in Relational Algebra Feb 27, 2024 Question Answering
Code Code Available 2TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space Feb 27, 2024 Contrastive Learning Hallucination
Code Code Available 2RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering Feb 26, 2024 Form Open-Domain Question Answering
Code Code Available 2Data Science with LLMs and Interpretable Models Feb 22, 2024 Additive models Question Answering
Code Code Available 2ActiveRAG: Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents Feb 21, 2024 Active Learning Position
Code Code Available 2FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models Feb 21, 2024 Question Answering
Code Code Available 2Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs Feb 19, 2024 Question Answering
Code Code Available 2