Learning Trimodal Relation for AVQA with Missing Modality Jul 23, 2024 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
Code Code Available 1Enhancing LLM's Cognition via Structurization Jul 23, 2024 Hallucination Hallucination Evaluation
Code Code Available 1HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning Jul 22, 2024 Benchmarking Hallucination
Code Code Available 1Evaluating language models as risk scores Jul 19, 2024 Multiple-choice Question Answering
Code Code Available 1Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark Jul 18, 2024 GPU Image Retrieval
Code Code Available 1TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish Jul 17, 2024 Math Multiple-choice
Code Code Available 1Video-Language Alignment via Spatio-Temporal Graph Transformer Jul 16, 2024 Contrastive Learning Question Answering
Code Code Available 1MixGR: Enhancing Retriever Generalization for Scientific Domain through Complementary Granularity Jul 15, 2024 Question Answering RAG
Code Code Available 1Graphusion: Leveraging Large Language Models for Scientific Knowledge Graph Fusion and Construction in NLP Education Jul 15, 2024 graph construction Knowledge Graphs
Code Code Available 1Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking Jul 14, 2024 Multi-Object Tracking Object Tracking
Code Code Available 1IoT-LM: Large Multisensory Language Models for the Internet of Things Jul 13, 2024 Language Modeling Language Modelling
Code Code Available 1CompAct: Compressing Retrieved Documents Actively for Question Answering Jul 12, 2024 Multi-hop Question Answering Question Answering
Code Code Available 1Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing Jul 11, 2024 Common Sense Reasoning Question Answering
Code Code Available 1AutoBencher: Creating Salient, Novel, Difficult Datasets for Language Models Jul 11, 2024 Language Modelling Math
Code Code Available 1IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model Jul 10, 2024 Language Modeling Language Modelling
Code Code Available 13D Vision and Language Pretraining with Large-Scale Synthetic Data Jul 8, 2024 Dense Captioning Diversity
Code Code Available 1Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs Jul 5, 2024 General Knowledge Instruction Following
Code Code Available 1Referring Atomic Video Action Recognition Jul 2, 2024 Action Localization Action Recognition
Code Code Available 1LogEval: A Comprehensive Benchmark Suite for Large Language Models In Log Analysis Jul 2, 2024 Anomaly Detection Fault Diagnosis
Code Code Available 1Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning Jul 1, 2024 image-classification Image Classification
Code Code Available 1Eliminating Position Bias of Language Models: A Mechanistic Approach Jul 1, 2024 Math object-detection
Code Code Available 1CVLUE: A New Benchmark Dataset for Chinese Vision-Language Understanding Evaluation Jul 1, 2024 Image-text Retrieval Question Answering
Code Code Available 1PolygonGNN: Representation Learning for Polygonal Geometries with Heterogeneous Visibility Graph Jun 30, 2024 Computational Efficiency Geographic Question Answering
Code Code Available 1H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables Jun 29, 2024 Fact Verification Mathematical Reasoning
Code Code Available 1STLLaVA-Med: Self-Training Large Language and Vision Assistant for Medical Question-Answering Jun 28, 2024 Medical Diagnosis Medical Question Answering
Code Code Available 1The SIFo Benchmark: Investigating the Sequential Instruction Following Ability of Large Language Models Jun 28, 2024 Instruction Following Question Answering
Code Code Available 1MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment Jun 28, 2024 Answer Generation Image Captioning
Code Code Available 1SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented Generation Jun 27, 2024 Question Answering RAG
Code Code Available 1Knowledge graph enhanced retrieval-augmented generation for failure mode and effects analysis Jun 26, 2024 Language Modeling Language Modelling
Code Code Available 1CogMG: Collaborative Augmentation Between Large Language Model and Knowledge Graph Jun 25, 2024 Knowledge Graph Completion Knowledge Graphs
Code Code Available 1DEXTER: A Benchmark for open-domain Complex Question Answering using LLMs Jun 24, 2024 Question Answering Retrieval
Code Code Available 1LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing Jun 24, 2024 Question Answering
Code Code Available 1HCQA @ Ego4D EgoSchema Challenge 2024 Jun 22, 2024 Caption Generation
Code Code Available 1UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document Analysis Jun 21, 2024 Question Answering RAG
Code Code Available 1Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs Jun 20, 2024 Knowledge Distillation Knowledge Graphs
Code Code Available 1Timo: Towards Better Temporal Reasoning for Language Models Jun 20, 2024 Question Answering
Code Code Available 1SuperGLEBer: German Language Understanding Evaluation Benchmark Jun 20, 2024 Document Classification Natural Language Understanding
Code Code Available 1LLaSA: A Multimodal LLM for Human Activity Analysis Through Wearable and Smartphone Sensors Jun 20, 2024 16k Instruction Following
Code Code Available 1AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding Jun 19, 2024 Question Answering Spatial Reasoning
Code Code Available 1MoreHopQA: More Than Multi-hop Reasoning Jun 19, 2024 Question Answering
Code Code Available 1DialSim: A Real-Time Simulator for Evaluating Long-Term Multi-Party Dialogue Understanding of Conversational Agents Jun 19, 2024 Dialogue Understanding Question Answering
Code Code Available 1Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation Jun 19, 2024 Question Answering RAG
Code Code Available 1LIVE: Learnable In-Context Vector for Visual Question Answering Jun 19, 2024 In-Context Learning Question Answering
Code Code Available 1Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators Jun 19, 2024 Fact Verification Question Answering
Code Code Available 1Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Aligned with Human Cognitive Principles Jun 18, 2024 Arithmetic Reasoning Code Generation
Code Code Available 1TRACE the Evidence: Constructing Knowledge-Grounded Reasoning Chains for Retrieval-Augmented Generation Jun 17, 2024 Question Answering RAG
Code Code Available 1Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations Jun 17, 2024 AI and Safety Question Answering
Code Code Available 1MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language Models Jun 17, 2024 Benchmarking Fact Checking
Code Code Available 1MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model Jun 17, 2024 Language Modeling Language Modelling
Code Code Available 1Soft Prompting for Unlearning in Large Language Models Jun 17, 2024 In-Context Learning Machine Unlearning
Code Code Available 1