SCAR: Efficient Instruction-Tuning for Large Language Models via Style Consistency-Aware Response Ranking Jun 16, 2024 Open-Ended Question Answering Question Answering
Code Code Available 1FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture Jun 16, 2024 Diversity Multiple-choice
Code Code Available 1CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training Jun 15, 2024 Domain Adaptation Language Modeling
Code Code Available 1IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Language Models in E-commerce Jun 14, 2024 Multiple-choice Question Answering
Code Code Available 1VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs Jun 14, 2024 Anomaly Detection Benchmarking
Code Code Available 1Large language model validity via enhanced conformal prediction methods Jun 14, 2024 Conformal Prediction Language Modeling
Code Code Available 1Vision-Language Models Meet Meteorology: Developing Models for Extreme Weather Events Detection with Heatmaps Jun 14, 2024 Question Answering Visual Question Answering
Code Code Available 1Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning? Jun 13, 2024 Mathematical Reasoning Question Answering
Code Code Available 1Too Many Frames, Not All Useful: Efficient Strategies for Long-Form Video QA Jun 13, 2024 All EgoSchema
Code Code Available 1Towards Reliable Detection of LLM-Generated Texts: A Comprehensive Evaluation Framework with CUDRT Jun 13, 2024 Benchmarking LLM-generated Text Detection
Code Code Available 1Advancing High Resolution Vision-Language Models in Biomedicine Jun 12, 2024 Language Modeling Language Modelling
Code Code Available 1Situational Awareness Matters in 3D Vision Language Reasoning Jun 11, 2024 Question Answering
Code Code Available 1SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature Jun 10, 2024 Claim Verification Instruction Following
Code Code Available 1VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text Jun 10, 2024 Language Modeling Language Modelling
Code Code Available 1ComplexTempQA: A Large-Scale Dataset for Complex Temporal Question Answering Jun 7, 2024 Information Retrieval Question Answering
Code Code Available 1LinkQ: An LLM-Assisted Visual Interface for Knowledge Graph Question-Answering Jun 7, 2024 Graph Question Answering Language Modeling
Code Code Available 1Semantically Diverse Language Generation for Uncertainty Estimation in Language Models Jun 6, 2024 Question Answering Text Generation
Code Code Available 1Retaining Key Information under High Compression Ratios: Query-Guided Compressor for LLMs Jun 4, 2024 Question Answering TriviaQA
Code Code Available 1An Information Bottleneck Perspective for Effective Noise Filtering on Retrieval-Augmented Generation Jun 3, 2024 Answer Generation Question Answering
Code Code Available 1MediQ: Question-Asking LLMs and a Benchmark for Reliable Interactive Clinical Reasoning Jun 3, 2024 Diagnostic MedQA
Code Code Available 1Re-ReST: Reflection-Reinforced Self-Training for Language Agents Jun 3, 2024 Code Generation Image Generation
Code Code Available 1Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering Jun 2, 2024 counterfactual Counterfactual Reasoning
Code Code Available 1Encoding and Controlling Global Semantics for Long-form Video Question Answering May 30, 2024 Form Question Answering
Code Code Available 1One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models May 30, 2024 Question Answering RAG
Code Code Available 1Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA May 30, 2024 Diagnostic Medical Diagnosis
Code Code Available 1MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions May 29, 2024 Benchmarking Dialogue Understanding
Code Code Available 1Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs May 29, 2024 Image Retrieval Question Answering
Code Code Available 1THREAD: Thinking Deeper with Recursive Spawning May 27, 2024 Few-Shot Learning Question Answering
Code Code Available 1Map-based Modular Approach for Zero-shot Embodied Question Answering May 26, 2024 Embodied Question Answering Navigate
Code Code Available 1Semantic Density: Uncertainty Quantification for Large Language Models through Confidence Measurement in Semantic Space May 22, 2024 Misinformation Question Answering
Code Code Available 1PitVQA: Image-grounded Text Embedding LLM for Visual Question Answering in Pituitary Surgery May 22, 2024 Question Answering Visual Question Answering
Code Code Available 1OLAPH: Improving Factuality in Biomedical Long-form Question Answering May 21, 2024 Form Long Form Question Answering
Code Code Available 1Towards Better Question Generation in QA-based Event Extraction May 17, 2024 Event Extraction Question Answering
Code Code Available 1Conformal Alignment: Knowing When to Trust Foundation Models with Guarantees May 16, 2024 Decision Making Informativeness
Code Code Available 1SciQAG: A Framework for Auto-Generated Science Question Answering Dataset with Fine-grained Evaluation May 16, 2024 Open-Ended Question Answering Question Answering
Code Code Available 1UniRAG: Universal Retrieval Augmentation for Large Vision Language Models May 16, 2024 Image Captioning Image Generation
Code Code Available 1TANQ: An open domain dataset of table answered questions May 13, 2024 Math Open-Domain Question Answering
Code Code Available 1MedConceptsQA: Open Source Medical Concepts QA Benchmark May 12, 2024 Few-Shot Learning Question Answering
Code Code Available 1ERAGent: Enhancing Retrieval-Augmented Language Models with Improved Accuracy, Efficiency, and Personalization May 6, 2024 Question Answering RAG
Code Code Available 1Enhancing Contextual Understanding in Large Language Models through Contrastive Decoding May 4, 2024 Open-Domain Question Answering Question Answering
Code Code Available 1Understanding Figurative Meaning through Explainable Visual Entailment May 2, 2024 Question Answering Visual Entailment
Code Code Available 1BiomedRAG: A Retrieval Augmented Large Language Model for Biomedicine May 1, 2024 Language Modeling Language Modelling
Code Code Available 1TableVQA-Bench: A Visual Question Answering Benchmark on Multiple Table Domains Apr 30, 2024 Language Modelling Large Language Model
Code Code Available 1ViOCRVQA: Novel Benchmark Dataset and Vision Reader for Visual Question Answering by Understanding Vietnamese Text in Images Apr 29, 2024 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1Large Language Models in the Clinic: A Comprehensive Benchmark Apr 25, 2024 Decision Making Document Summarization
Code Code Available 1Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks Apr 23, 2024 Mathematical Problem-Solving Question Answering
Code Code Available 1LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models Apr 23, 2024 Logical Reasoning Question Answering
Code Code Available 1Simulating Task-Oriented Dialogues with State Transition Graphs and Large Language Models Apr 23, 2024 Conversational Question Answering Dialogue State Tracking
Code Code Available 1LaPA: Latent Prompt Assist Model For Medical Visual Question Answering Apr 19, 2024 Medical Visual Question Answering Question Answering
Code Code Available 1Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering Apr 18, 2024 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
Code Code Available 1