HiddenTables & PyQTax: A Cooperative Game and Dataset For TableQA to Ensure Scale and Data Privacy Across a Myriad of Taxonomies Jun 16, 2024 Question Answering
— Unverified 0Multi-LLM QA with Embodied Exploration Jun 16, 2024 Embodied Question Answering Feature Importance
— Unverified 0VCEval: Rethinking What is a Good Educational Video and How to Automatically Evaluate It Jun 15, 2024 Language Modeling Language Modelling
— Unverified 0MMLU-SR: A Benchmark for Stress-Testing Reasoning Capability of Large Language Models Jun 15, 2024 Mathematical Reasoning MMLU
— Unverified 0On the Hardness of Faithful Chain-of-Thought Reasoning in Large Language Models Jun 15, 2024 In-Context Learning Question Answering
— Unverified 0Large Language Models as Interpolated and Extrapolated Event Predictors Jun 15, 2024 Knowledge Graphs Question Answering
Code Code Available 0Beyond Raw Videos: Understanding Edited Videos with Large Multimodal Model Jun 15, 2024 Question Answering Video Understanding
Code Code Available 0CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training Jun 15, 2024 Domain Adaptation Language Modeling
Code Code Available 1CHiSafetyBench: A Chinese Hierarchical Safety Benchmark for Large Language Models Jun 14, 2024 Multiple-choice Question Answering
Code Code Available 2Integrating Large Language Models with Graph-based Reasoning for Conversational Question Answering Jun 14, 2024 Conversational Question Answering Knowledge Graphs
— Unverified 0GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks Jun 14, 2024 named-entity-recognition Named Entity Recognition
— Unverified 0EWEK-QA: Enhanced Web and Efficient Knowledge Graph Retrieval for Citation-based Question Answering Systems Jun 14, 2024 Question Answering Retrieval
— Unverified 0VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs Jun 14, 2024 Anomaly Detection Benchmarking
Code Code Available 1Efficient Prompting for LLM-based Generative Internet of Things Jun 14, 2024 Prompt Engineering Question Answering
— Unverified 0CHIRON: Rich Character Representations in Long-Form Narratives Jun 14, 2024 Form Question Answering
Code Code Available 0Enhancing Question Answering on Charts Through Effective Pre-training Tasks Jun 14, 2024 document understanding Optical Character Recognition (OCR)
— Unverified 0Large language model validity via enhanced conformal prediction methods Jun 14, 2024 Conformal Prediction Language Modeling
Code Code Available 1Datasets for Multilingual Answer Sentence Selection Jun 14, 2024 Language Modeling Language Modelling
— Unverified 0SHMamba: Structured Hyperbolic State Space Model for Audio-Visual Question Answering Jun 14, 2024 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
— Unverified 0Vision-Language Models Meet Meteorology: Developing Models for Extreme Weather Events Detection with Heatmaps Jun 14, 2024 Question Answering Visual Question Answering
Code Code Available 1BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack Jun 14, 2024 Question Answering Retrieval-augmented Generation
Code Code Available 9Precision Empowers, Excess Distracts: Visual Question Answering With Dynamically Infused Knowledge In Language Models Jun 14, 2024 Decoder Knowledge Graphs
— Unverified 0A Training-free Sub-quadratic Cost Transformer Model Serving Framework With Hierarchically Pruned Attention Jun 14, 2024 GPU Question Answering
— Unverified 0IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Language Models in E-commerce Jun 14, 2024 Multiple-choice Question Answering
Code Code Available 1Detecting and Evaluating Medical Hallucinations in Large Vision Language Models Jun 14, 2024 Hallucination Medical Visual Question Answering
— Unverified 0A Survey of Video Datasets for Grounded Event Understanding Jun 14, 2024 Common Sense Reasoning Event Extraction
Code Code Available 0Multi-Modal Retrieval For Large Language Model Based Speech Recognition Jun 13, 2024 Automatic Speech Recognition Language Modeling
— Unverified 0Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs Jun 13, 2024 Arithmetic Reasoning Fact Verification
Code Code Available 2Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning? Jun 13, 2024 Mathematical Reasoning Question Answering
Code Code Available 1DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding Jun 13, 2024 Instruction Following Language Modeling
— Unverified 0No perspective, no perception!! Perspective-aware Healthcare Answer Summarization Jun 13, 2024 Community Question Answering Question Answering
Code Code Available 0VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding Jun 13, 2024 Dense Video Captioning MVBench
Code Code Available 3Explore the Limits of Omni-modal Pretraining at Scale Jun 13, 2024 Language Modeling Language Modelling
Code Code Available 2Yo'LLaVA: Your Personalized Language and Vision Assistant Jun 13, 2024 Image Captioning Question Answering
Code Code Available 2Towards Vision-Language Geo-Foundation Model: A Survey Jun 13, 2024 Earth Observation Image Captioning
Code Code Available 2Optimizing Visual Question Answering Models for Driving: Bridging the Gap Between Human and Machine Attention Patterns Jun 13, 2024 Autonomous Driving Question Answering
— Unverified 0Towards Multilingual Audio-Visual Question Answering Jun 13, 2024 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
Code Code Available 0Towards Reliable Detection of LLM-Generated Texts: A Comprehensive Evaluation Framework with CUDRT Jun 13, 2024 Benchmarking LLM-generated Text Detection
Code Code Available 1Needle In A Video Haystack: A Scalable Synthetic Evaluator for Video MLLMs Jun 13, 2024 Benchmarking Question Answering
Code Code Available 2MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations Jun 13, 2024 3D visual grounding Attribute
Code Code Available 4Too Many Frames, Not All Useful: Efficient Strategies for Long-Form Video QA Jun 13, 2024 All EgoSchema
Code Code Available 1Advancing High Resolution Vision-Language Models in Biomedicine Jun 12, 2024 Language Modeling Language Modelling
Code Code Available 1VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks Jun 12, 2024 Image Generation Language Modeling
Code Code Available 5Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams Jun 12, 2024 cross-modal alignment Language Modelling
Code Code Available 3Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation Jun 12, 2024 Dialogue Generation Diversity
— Unverified 0Research Trends for the Interplay between Large Language Models and Knowledge Graphs Jun 12, 2024 Descriptive Knowledge Graphs
— Unverified 0Prediction of the Realisation of an Information Need: An EEG Study Jun 12, 2024 EEG Information Retrieval
— Unverified 0DistilDoc: Knowledge Distillation for Visually-Rich Document Applications Jun 12, 2024 document-image-classification Document Image Classification
— Unverified 0Question-Answering (QA) Model for a Personalized Learning Assistant for Arabic Language Jun 11, 2024 Question Answering
— Unverified 0Efficient Parallel Multi-Hop Reasoning: A Scalable Approach for Knowledge Graph Analysis Jun 11, 2024 Knowledge Base Completion Knowledge Graphs
— Unverified 0