WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines Oct 16, 2024 Question Answering Visual Question Answering
Code Code Available 1RuleRAG: Rule-guided retrieval-augmented generation with language models for question answering Oct 15, 2024 In-Context Learning Instruction Following
Code Code Available 1Telco-DPR: A Hybrid Dataset for Evaluating Retrieval Models of 3GPP Technical Specifications Oct 15, 2024 Question Answering RAG
— Unverified 0LargePiG: Your Large Language Model is Secretly a Pointer Generator Oct 15, 2024 Hallucination Language Modeling
— Unverified 0OMCAT: Omni Context Aware Transformer Oct 15, 2024 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
— Unverified 0Unleashing the Power of LLMs as Multi-Modal Encoders for Text and Graph-Structured Data Oct 15, 2024 Contrastive Learning Data Ablation
— Unverified 0Causal Reasoning in Large Language Models: A Knowledge Graph Approach Oct 15, 2024 Question Answering
— Unverified 0Empowering Users in Digital Privacy Management through Interactive LLM-Based Agents Oct 15, 2024 Management Question Answering
— Unverified 0AGENTiGraph: An Interactive Knowledge Graph Platform for LLM-based Chatbots Utilizing Private Data Oct 15, 2024 Hallucination Knowledge Graphs
— Unverified 0SHAKTI: A 2.5 Billion Parameter Small Language Model Optimized for Edge AI and Low-Resource Environments Oct 15, 2024 Language Modeling Language Modelling
— Unverified 0Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMs Oct 15, 2024 Image Description Multiple-choice
Code Code Available 0VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI Oct 15, 2024 Question Answering Video Question Answering
Code Code Available 2FLARE: Faithful Logic-Aided Reasoning and Exploration Oct 14, 2024 Code Generation Question Answering
— Unverified 0LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory Oct 14, 2024 Benchmarking Large Language Model
Code Code Available 3QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios Oct 14, 2024 Question Answering
Code Code Available 0SensorLLM: Human-Intuitive Alignment of Multivariate Sensor Data with LLMs for Activity Recognition Oct 14, 2024 Activity Recognition Descriptive
Code Code Available 2KBLaM: Knowledge Base augmented Language Model Oct 14, 2024 8k GPU
Code Code Available 5Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs Oct 14, 2024 Computational Efficiency Question Answering
Code Code Available 2EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network Operations Oct 14, 2024 Answer Generation Question Answering
Code Code Available 4Eliminating the Language Bias for Visual Question Answering with fine-grained Causal Intervention Oct 14, 2024 Contrastive Learning counterfactual
— Unverified 0Will LLMs Replace the Encoder-Only Models in Temporal Relation Classification? Oct 14, 2024 In-Context Learning Question Answering
Code Code Available 0TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models Oct 14, 2024 2k Benchmarking
Code Code Available 1Towards Foundation Models for 3D Vision: How Close Are We? Oct 14, 2024 Question Answering Visual Question Answering
Code Code Available 1BanglaQuAD: A Bengali Open-domain Question Answering Dataset Oct 14, 2024 Articles Open-Domain Question Answering
— Unverified 0ChartKG: A Knowledge-Graph-Based Representation for Chart Images Oct 13, 2024 Chart Question Answering Knowledge Graph Completion
— Unverified 0Surgical-LLaVA: Toward Surgical Scenario Understanding via Large Language and Vision Models Oct 13, 2024 Instruction Following Question Answering
— Unverified 0A Step Towards Mixture of Grader: Statistical Analysis of Existing Automatic Evaluation Metrics Oct 13, 2024 Question Answering
— Unverified 0MMCOMPOSITION: Revisiting the Compositionality of Pre-trained Vision-Language Models Oct 13, 2024 Cross-Modal Retrieval Question Answering
— Unverified 0LoRE: Logit-Ranked Retriever Ensemble for Enhancing Open-Domain Question Answering Oct 13, 2024 Answer Generation Language Modeling
— Unverified 0Enhanced Electronic Health Records Text Summarization Using Large Language Models Oct 12, 2024 Question Answering Reading Comprehension
— Unverified 0Quebec Automobile Insurance Question-Answering With Retrieval-Augmented Generation Oct 12, 2024 Question Answering RAG
Code Code Available 0Synthetic Knowledge Ingestion: Towards Knowledge Refinement and Injection for Enhancing Large Language Models Oct 12, 2024 Question Answering RAG
Code Code Available 0Declarative Knowledge Distillation from Large Language Models for Visual Question Answering Datasets Oct 12, 2024 Knowledge Distillation Question Answering
Code Code Available 0Multi-granularity Contrastive Cross-modal Collaborative Generation for End-to-End Long-term Video Question Answering Oct 12, 2024 Answer Generation Blocking
Code Code Available 1Zero-shot Commonsense Reasoning over Machine Imagination Oct 12, 2024 Question Answering Visual Question Answering
Code Code Available 0Skipping Computations in Multimodal LLMs Oct 12, 2024 Question Answering Visual Question Answering
Code Code Available 1Prompting Video-Language Foundation Models with Domain-specific Fine-grained Heuristics for Video Question Answering Oct 12, 2024 Question Answering Video Question Answering
— Unverified 0Optimized Biomedical Question-Answering Services with LLM and Multi-BERT Integration Oct 11, 2024 Decision Making Question Answering
— Unverified 0Generation with Dynamic Vocabulary Oct 11, 2024 Language Modeling Language Modelling
Code Code Available 0Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation Oct 11, 2024 Open-Domain Question Answering Question Answering
Code Code Available 2SPORTU: A Comprehensive Sports Understanding Benchmark for Multimodal Large Language Models Oct 11, 2024 Few-Shot Learning Multiple-choice
Code Code Available 1Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping Oct 11, 2024 MME Question Answering
Code Code Available 1Retrieving Contextual Information for Long-Form Question Answering using Weak Supervision Oct 11, 2024 Form Long Form Question Answering
— Unverified 0Measuring the Groundedness of Legal Question-Answering Systems Oct 11, 2024 Natural Language Inference Question Answering
— Unverified 0ViT3D Alignment of LLaMA3: 3D Medical Image Report Generation Oct 11, 2024 Diagnostic Language Modeling
— Unverified 0MedMobile: A mobile-sized language model with expert-level clinical capabilities Oct 11, 2024 Language Modeling Language Modelling
Code Code Available 0Accurate and Regret-aware Numerical Problem Solver for Tabular Question Answering Oct 10, 2024 Question Answering Semantic Parsing
Code Code Available 0ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning Oct 10, 2024 Natural Language Understanding parameter-efficient fine-tuning
Code Code Available 0StablePrompt: Automatic Prompt Tuning using Reinforcement Learning for Large Language Models Oct 10, 2024 Question Answering Reinforcement Learning (RL)
Code Code Available 1Do You Know What You Are Talking About? Characterizing Query-Knowledge Relevance For Reliable Retrieval Augmented Generation Oct 10, 2024 Misinformation Question Answering
— Unverified 0