WIP: Large Language Model-Enhanced Smart Tutor for Undergraduate Circuit Analysis Jun 10, 2025 Language Modeling Language Modelling
— Unverified 0mSTEB: Massively Multilingual Evaluation of LLMs on Speech and Text Tasks Jun 10, 2025 Language Identification Question Answering
— Unverified 0Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning Jun 9, 2025 Future prediction Question Answering
Code Code Available 0MEMOIR: Lifelong Model Editing with Minimal Overwrite and Informed Retention for LLMs Jun 9, 2025 Hallucination Model Editing
— Unverified 0Aligning Text, Images, and 3D Structure Token-by-Token Jun 9, 2025 3D Object Recognition Instruction Following
— Unverified 0HAIBU-ReMUD: Reasoning Multimodal Ultrasound Dataset and Model Bridging to General Specific Domains Jun 9, 2025 Diagnostic Question Answering
Code Code Available 0Federated In-Context Learning: Iterative Refinement for Improved Answer Quality Jun 9, 2025 In-Context Learning Question Answering
— Unverified 0Cognitive Weave: Synthesizing Abstracted Knowledge with a Spatio-Temporal Resonance Graph Jun 9, 2025 Large Language Model Question Answering
Code Code Available 0ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving Jun 9, 2025 Autonomous Driving Imitation Learning
— Unverified 0LEANN: A Low-Storage Vector Index Jun 9, 2025 Question Answering RAG
— Unverified 0Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning Jun 8, 2025 Attribute Hallucination
— Unverified 0Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning Jun 8, 2025 Offline RL Question Answering
— Unverified 0Multi-Step Visual Reasoning with Visual Tokens Scaling and Verification Jun 8, 2025 Question Answering Visual Question Answering
Code Code Available 1Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning Jun 8, 2025 Medical Report Generation Question Answering
— Unverified 0The State-of-the-Art in Lifelog Retrieval: A Review of Progress at the ACM Lifelog Search Challenge Workshop 2022-24 Jun 7, 2025 Question Answering Retrieval
— Unverified 0Meta-Adaptive Prompt Distillation for Few-Shot Visual Question Answering Jun 7, 2025 In-Context Learning Meta-Learning
— Unverified 0MAPLE: Multi-Agent Adaptive Planning with Long-Term Memory for Table Reasoning Jun 6, 2025 Question Answering Table-based Question Answering
— Unverified 0BioMol-MQA: A Multi-Modal Question Answering Dataset For LLM Reasoning Over Bio-Molecular Interactions Jun 6, 2025 Information Retrieval Question Answering
— Unverified 0Towards Efficient Multi-LLM Inference: Characterization and Analysis of LLM Routing and Hierarchical Techniques Jun 6, 2025 Benchmarking Model Selection
— Unverified 0DynamicMind: A Tri-Mode Thinking System for Large Language Models Jun 6, 2025 Computational Efficiency Prompt Engineering
— Unverified 0TextVidBench: A Benchmark for Long Video Scene Text Understanding Jun 5, 2025 Prompt Engineering Question Answering
— Unverified 0Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs Jun 5, 2025 cross-modal alignment Dense Captioning
— Unverified 0Ontology-based knowledge representation for bone disease diagnosis: a foundation for safe and sustainable medical artificial intelligence systems Jun 5, 2025 Diagnostic Multimodal Deep Learning
— Unverified 0ECoRAG: Evidentiality-guided Compression for Long Context RAG Jun 5, 2025 Answer Generation Open-Domain Question Answering
Code Code Available 1Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-Reasoning Jun 5, 2025 Question Answering RAG
Code Code Available 0Multiple-Choice Question Generation Using Large Language Models: Methodology and Educator Insights Jun 5, 2025 Multiple-choice Question Answering
— Unverified 0Plugging Schema Graph into Multi-Table QA: A Human-Guided Framework for Reducing LLM Reliance Jun 4, 2025 Question Answering Semantic Similarity
— Unverified 0Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model Jun 4, 2025 Language Modeling Language Modelling
— Unverified 0ReXVQA: A Large-scale Visual Question Answering Benchmark for Generalist Chest X-ray Understanding Jun 4, 2025 Negation Negation Detection
— Unverified 0A Multi-Agent Framework for Mitigating Dialect Biases in Privacy Policy Question-Answering Systems Jun 3, 2025 Question Answering
— Unverified 0OThink-R1: Intrinsic Fast/Slow Thinking Mode Switching for Over-Reasoning Mitigation Jun 3, 2025 Question Answering
Code Code Available 1FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes Jun 3, 2025 Benchmarking Feature Engineering
Code Code Available 0EgoVLM: Policy Optimization for Egocentric Video Understanding Jun 3, 2025 EgoSchema Question Answering
Code Code Available 0Hanfu-Bench: A Multimodal Benchmark on Cross-Temporal Cultural Understanding and Transcreation Jun 2, 2025 Multiple-choice Question Answering
— Unverified 0iQUEST: An Iterative Question-Guided Framework for Knowledge Base Question Answering Jun 2, 2025 Graph Neural Network Knowledge Base Question Answering
— Unverified 0ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists Jun 2, 2025 Benchmarking Form
— Unverified 0Learning Sparsity for Effective and Efficient Music Performance Question Answering Jun 2, 2025 Audio-visual Question Answering Question Answering
— Unverified 0Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning Jun 2, 2025 Fact Verification Language Modeling
Code Code Available 2Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian Laws Jun 2, 2025 Language Modeling Language Modelling
Code Code Available 0anyECG-chat: A Generalist ECG-MLLM for Flexible ECG Input and Multi-Task Understanding Jun 1, 2025 Open-Ended Question Answering Question Answering
— Unverified 0Fast or Slow? Integrating Fast Intuition and Deliberate Thinking for Enhancing Visual Question Answering Jun 1, 2025 All MME
— Unverified 0Dynamic Chunking and Selection for Reading Comprehension of Ultra-Long Context in Large Language Models Jun 1, 2025 Chunking Multi-hop Question Answering
Code Code Available 0A Graph-Retrieval-Augmented Generation Framework Enhances Decision-Making in the Circular Economy Jun 1, 2025 Decision Making Multi-hop Question Answering
— Unverified 0Probing the Geometry of Truth: Consistency and Generalization of Truth Directions in LLMs Across Logical Transformations and Question Answering Tasks Jun 1, 2025 In-Context Learning Negation
Code Code Available 0PAKTON: A Multi-Agent Framework for Question Answering in Long Legal Agreements May 31, 2025 Privacy Preserving Question Answering
Code Code Available 1MedOrch: Medical Diagnosis with Tool-Augmented Reasoning Agents for Flexible Extensibility May 30, 2025 Decision Making Medical Diagnosis
— Unverified 0LaMP-QA: A Benchmark for Personalized Long-form Question Answering May 30, 2025 Answer Generation Form
— Unverified 0ClinBench-HPB: A Clinical Benchmark for Evaluating LLMs in Hepato-Pancreato-Biliary Diseases May 30, 2025 Medical Question Answering Multiple-choice
— Unverified 0VUDG: A Dataset for Video Understanding Domain Generalization May 30, 2025 Domain Generalization Multiple-choice
— Unverified 0Pangu DeepDiver: Adaptive Search Intensity Scaling via Open-Web Reinforcement Learning May 30, 2025 Question Answering Reinforcement Learning (RL)
— Unverified 0