StablePrompt: Automatic Prompt Tuning using Reinforcement Learning for Large Language Models Oct 10, 2024 Question Answering Reinforcement Learning (RL)
Code Code Available 1Do You Know What You Are Talking About? Characterizing Query-Knowledge Relevance For Reliable Retrieval Augmented Generation Oct 10, 2024 Misinformation Question Answering
— Unverified 0Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision Oct 10, 2024 Question Answering Visual Question Answering
— Unverified 0Can Knowledge Graphs Make Large Language Models More Trustworthy? An Empirical Study over Open-ended Question Answering Oct 10, 2024 Hallucination Knowledge Graphs
— Unverified 0Rewriting Conversational Utterances with Instructed Large Language Models Oct 10, 2024 Conversational Search Question Answering
— Unverified 0SAKA: An Intelligent Platform for Semi-automated Knowledge Graph Construction and Application Oct 10, 2024 graph construction Knowledge Base Question Answering
— Unverified 0Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models Oct 10, 2024 Conformal Prediction Language Modeling
— Unverified 0MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models Oct 10, 2024 Multiple-choice Question Answering
— Unverified 0Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System Oct 10, 2024 Large Language Model Question Answering
— Unverified 0TVBench: Redesigning Video-Language Evaluation Oct 10, 2024 Multiple-choice Open-Ended Question Answering
— Unverified 0AuditWen:An Open-Source Large Language Model for Audit Oct 9, 2024 Answer Generation Language Modeling
Code Code Available 1PAR: Prompt-Aware Token Reduction Method for Efficient Large Multimodal Models Oct 9, 2024 Question Answering Retrieval
— Unverified 0SparseGrad: A Selective Method for Efficient Fine-tuning of MLP Layers Oct 9, 2024 parameter-efficient fine-tuning Question Answering
Code Code Available 0SEGMENT+: Long Text Processing with Short-Context Language Models Oct 9, 2024 Question Answering
— Unverified 0Do great minds think alike? Investigating Human-AI Complementarity in Question Answering with CAIMIRA Oct 9, 2024 Information Retrieval Question Answering
— Unverified 0Exploring the Readiness of Prominent Small Language Models for the Democratization of Financial Literacy Oct 9, 2024 Few-Shot Learning Question Answering
Code Code Available 0Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning Oct 9, 2024 Hallucination Multiple-choice
Code Code Available 0Enhancing Multimodal LLM for Detailed and Accurate Video Captioning using Multi-Round Preference Optimization Oct 9, 2024 Audio captioning Large Language Model
— Unverified 0FltLM: An Intergrated Long-Context Large Language Model for Effective Context Filtering and Understanding Oct 9, 2024 Language Modeling Language Modelling
— Unverified 0Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles Oct 9, 2024 Question Answering
Code Code Available 0Uncovering Factor Level Preferences to Improve Human-Model Alignment Oct 9, 2024 Language Modelling Large Language Model
— Unverified 0Personal Intelligence System UniLM: Hybrid On-Device Small Language Model and Server-Based Large Language Model for Malay Nusantara Oct 9, 2024 Language Modeling Language Modelling
— Unverified 0PortLLM: Personalizing Evolving Large Language Models with Training-Free and Portable Model Patches Oct 8, 2024 GPU GSM8K
— Unverified 0Large Continual Instruction Assistant Oct 8, 2024 Question Answering Semantic Similarity
Code Code Available 2Enhancing SPARQL Generation by Triplet-order-sensitive Pre-training Oct 8, 2024 Graph Question Answering Language Modeling
Code Code Available 0TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation Data Oct 8, 2024 Change Detection Earth Observation
Code Code Available 2ActionAtlas: A VideoQA Benchmark for Domain-specialized Action Recognition Oct 8, 2024 Action Recognition Multiple-choice
— Unverified 0PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling Oct 8, 2024 document understanding Language Modeling
Code Code Available 2Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning Oct 8, 2024 Image Retrieval Math
— Unverified 0Temporal Reasoning Transfer from Text to Video Oct 8, 2024 Diagnostic MME
— Unverified 0Enhancing Temporal Modeling of Video LLMs via Time Gating Oct 8, 2024 MVBench Question Answering
Code Code Available 0ERVQA: A Dataset to Benchmark the Readiness of Large Vision Language Models in Hospital Environments Oct 8, 2024 Decoder Question Answering
Code Code Available 0Multimodal Large Language Models and Tunings: Vision, Language, Sensors, Audio, and Beyond Oct 8, 2024 Question Answering Visual Question Answering
Code Code Available 0Information Discovery in e-Commerce Oct 8, 2024 Information Retrieval Knowledge Graphs
— Unverified 0Core Tokensets for Data-efficient Sequential Training of Transformers Oct 8, 2024 Image Captioning image-classification
Code Code Available 0Document-level Causal Relation Extraction with Knowledge-guided Binary Question Answering Oct 7, 2024 Question Answering Relation
— Unverified 0Differential Transformer Oct 7, 2024 Hallucination In-Context Learning
Code Code Available 2Mitigating the Risk of Health Inequity Exacerbated by Large Language Models Oct 7, 2024 Bias Detection Medical Question Answering
— Unverified 0ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering Oct 7, 2024 Question Answering Retrieval
Code Code Available 1MM-R^3: On (In-)Consistency of Multi-modal Large Language Models (MLLMs) Oct 7, 2024 Question Answering Visual Question Answering
— Unverified 0ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models Oct 7, 2024 Question Answering Visual Question Answering
Code Code Available 1CasiMedicos-Arg: A Medical Question Answering Dataset Annotated with Explanatory Argumentative Structures Oct 7, 2024 Argument Mining Medical Question Answering
Code Code Available 0Precise Model Benchmarking with Only a Few Observations Oct 7, 2024 Benchmarking model
— Unverified 0VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks Oct 7, 2024 Information Retrieval Language Modeling
— Unverified 0FAMMA: A Benchmark for Financial Domain Multilingual Multimodal Question Answering Oct 6, 2024 Asset Management Question Answering
Code Code Available 0MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA with LLM and MLLM Integration Oct 6, 2024 Medical Visual Question Answering Question Answering
Code Code Available 1Optimizing AI Reasoning: A Hamiltonian Dynamics Approach to Multi-Hop Question Answering Oct 6, 2024 Multi-hop Question Answering Question Answering
Code Code Available 0Adaptive Question Answering: Enhancing Language Model Proficiency for Addressing Knowledge Conflicts with Source Citations Oct 5, 2024 Language Modeling Language Modelling
— Unverified 0Overview of Factify5WQA: Fact Verification through 5W Question-Answering Oct 5, 2024 Fact Verification Fake News Detection
— Unverified 0Beyond Forecasting: Compositional Time Series Reasoning for End-to-End Task Execution Oct 5, 2024 Anomaly Detection Decision Making
— Unverified 0