From Problem-Solving to Teaching Problem-Solving: Aligning LLMs with Pedagogy using Reinforcement Learning May 21, 2025 Question Answering Reinforcement Learning (RL)
Code Code Available 1Exploring The Visual Feature Space for Multimodal Neural Decoding May 21, 2025 Brain Decoding Question Answering
Code Code Available 0LiveVLM: Efficient Online Video Understanding via Streaming-Oriented KV Cache and Retrieval May 21, 2025 Autonomous Driving Question Answering
— Unverified 0TinyDrive: Multiscale Visual Question Answering with Selective Token Routing for Autonomous Driving May 21, 2025 Autonomous Driving Question Answering
— Unverified 0ChartCards: A Chart-Metadata Generation Framework for Multi-Task Chart Understanding May 21, 2025 Chart Question Answering Chart Understanding
Code Code Available 0SNAP: A Benchmark for Testing the Effects of Capture Conditions on Fundamental Vision Tasks May 21, 2025 image-classification Image Classification
Code Code Available 0Single LLM, Multiple Roles: A Unified Retrieval-Augmented Generation Framework Using Role-Specific Token Optimization May 21, 2025 Open-Domain Question Answering Question Answering
— Unverified 0Set-LLM: A Permutation-Invariant LLM May 21, 2025 Multiple-choice Question Answering
— Unverified 0Discovering Pathology Rationale and Token Allocation for Efficient Multimodal Pathology Reasoning May 21, 2025 Computational Efficiency Diagnostic
— Unverified 0Traveling Across Languages: Benchmarking Cross-Lingual Consistency in Multimodal LLMs May 21, 2025 Benchmarking Question Answering
Code Code Available 0Reinforcing Question Answering Agents with Minimalist Policy Gradient Optimization May 20, 2025 Hallucination In-Context Learning
— Unverified 0Visual Instruction Bottleneck Tuning May 20, 2025 Hallucination Object Hallucination
— Unverified 0RAVENEA: A Benchmark for Multimodal Retrieval-Augmented Visual Culture Understanding May 20, 2025 Image Captioning Question Answering
Code Code Available 0Beyond Chains: Bridging Large Language Models and Knowledge Bases in Complex Question Answering May 20, 2025 Knowledge Base Question Answering Question Answering
— Unverified 0AutoRev: Automatic Peer Review System for Academic Research Papers May 20, 2025 Question Answering Review Generation
— Unverified 0Toward Effective Reinforcement Learning Fine-Tuning for Medical VQA in Vision-Language Models May 20, 2025 Medical Visual Question Answering Question Answering
— Unverified 0Automatic Dataset Generation for Knowledge Intensive Question Answering Tasks May 20, 2025 Dataset Generation Question Answering
— Unverified 0QA-prompting: Improving Summarization with Large Language Models using Question-Answering May 20, 2025 In-Context Learning Question Answering
Code Code Available 0Texts or Images? A Fine-grained Analysis on the Effectiveness of Input Representations and Models for Table Question Answering May 20, 2025 Question Answering
Code Code Available 0Domain Adaptation of VLM for Soccer Video Understanding May 20, 2025 Action Classification Domain Adaptation
— Unverified 0Memory-Centric Embodied Question Answer May 20, 2025 Embodied Question Answering Large Language Model
— Unverified 0Towards Omnidirectional Reasoning with 360-R1: A Dataset, Benchmark, and GRPO-based Method May 20, 2025 Hallucination Object Localization
— Unverified 0Studying the Role of Input-Neighbor Overlap in Retrieval-Augmented Language Models Training Efficiency May 20, 2025 Language Modeling Language Modelling
— Unverified 0VoQA: Visual-only Question Answering May 20, 2025 Question Answering
Code Code Available 0Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation May 20, 2025 Information Retrieval Knowledge Distillation
— Unverified 0HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing May 20, 2025 Language Modeling Language Modelling
— Unverified 0Debating for Better Reasoning: An Unsupervised Multimodal Approach May 20, 2025 Question Answering Visual Question Answering
— Unverified 0The Hallucination Tax of Reinforcement Finetuning May 20, 2025 Hallucination Math
— Unverified 0YESciEval: Robust LLM-as-a-Judge for Scientific Question Answering May 20, 2025 Question Answering
— Unverified 0Abacus: A Cost-Based Optimizer for Semantic Operator Systems May 20, 2025 Question Answering
— Unverified 0Exploring Jailbreak Attacks on LLMs through Intent Concealment and Diversion May 20, 2025 Question Answering Text Generation
— Unverified 0AMAQA: A Metadata-based QA Dataset for RAG Systems May 19, 2025 Question Answering RAG
— Unverified 0Q^2Forge: Minting Competency Questions and SPARQL Queries for Question-Answering Over Knowledge Graphs May 19, 2025 Knowledge Graphs Question Answering
— Unverified 0Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification May 19, 2025 Code Completion Question Answering
— Unverified 0A Case Study of Cross-Lingual Zero-Shot Generalization for Classical Languages in LLMs May 19, 2025 Machine Translation named-entity-recognition
Code Code Available 0Rethinking Predictive Modeling for LLM Routing: When Simple kNN Beats Complex Learned Routers May 19, 2025 Instruction Following Question Answering
— Unverified 0AGI-Elo: How Far Are We From Mastering A Task? May 19, 2025 Code Generation Image Classification
Code Code Available 1SurveillanceVQA-589K: A Benchmark for Comprehensive Surveillance Video-Language Understanding with Large Models May 19, 2025 Causal Inference Decision Making
— Unverified 0ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models May 19, 2025 Chart Question Answering Chart Understanding
— Unverified 0ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling May 19, 2025 Graph Generation Knowledge Distillation
— Unverified 0Understanding Complexity in VideoQA via Visual Program Generation May 19, 2025 Code Generation Question Answering
— Unverified 0Reasoning-OCR: Can Large Multimodal Models Solve Complex Logical Reasoning Problems from OCR Cues? May 19, 2025 Logical Reasoning Optical Character Recognition
Code Code Available 1The Hidden Structure -- Improving Legal Document Understanding Through Explicit Text Formatting May 19, 2025 document understanding Optical Character Recognition (OCR)
— Unverified 0KIT's Offline Speech Translation and Instruction Following Submission for IWSLT 2025 May 19, 2025 Automatic Speech Recognition Instruction Following
— Unverified 0Learnware of Language Models: Specialized Small Language Models Can Do Big May 19, 2025 Privacy Preserving Question Answering
Code Code Available 2Tianyi: A Traditional Chinese Medicine all-rounder language model and its Real-World Clinical Practice May 19, 2025 All Hallucination
— Unverified 0RAGXplain: From Explainable Evaluation to Actionable Guidance of RAG Pipelines May 18, 2025 Decision Making Question Answering
— Unverified 0Disambiguation in Conversational Question Answering in the Era of LLM: A Survey May 18, 2025 Benchmarking Conversational Question Answering
— Unverified 0GMSA: Enhancing Context Compression via Group Merging and Layer Semantic Alignment May 18, 2025 Computational Efficiency Question Answering
— Unverified 0Enhancing Large Language Models with Reward-guided Tree Search for Knowledge Graph Question and Answering May 18, 2025 Graph Question Answering Knowledge Graphs
— Unverified 0