Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering Mar 23, 2025 Benchmarking Chart Question Answering
— Unverified 0SUNAR: Semantic Uncertainty based Neighborhood Aware Retrieval for Complex QA Mar 23, 2025 Question Answering Retrieval
— Unverified 0Relation Extraction with Instance-Adapted Predicate Descriptions Mar 22, 2025 Decoder Question Answering
Code Code Available 04D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding Mar 22, 2025 Benchmarking Object
Code Code Available 0Progressive Prompt Detailing for Improved Alignment in Text-to-Image Generative Models Mar 22, 2025 Question Answering Visual Question Answering
Code Code Available 0MARS: A Multi-Agent Framework Incorporating Socratic Guidance for Automated Prompt Optimization Mar 21, 2025 Question Answering
— Unverified 0Does Chain-of-Thought Reasoning Help Mobile GUI Agent? An Empirical Study Mar 21, 2025 Attribute Mathematical Problem-Solving
Code Code Available 0Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models Mar 21, 2025 GSM8K Question Answering
Code Code Available 2A Study into Investigating Temporal Robustness of LLMs Mar 21, 2025 Question Answering World Knowledge
— Unverified 0MTBench: A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering Mar 21, 2025 Question Answering Time Series
Code Code Available 1PVChat: Personalized Video Chat with One-Shot Learning Mar 21, 2025 One-Shot Learning Question Answering
— Unverified 0Dense Passage Retrieval in Conversational Search Mar 21, 2025 Conversational Search Information Retrieval
Code Code Available 0Big Help or Big Brother? Auditing Tracking, Profiling, and Personalization in Generative AI Assistants Mar 20, 2025 Question Answering
— Unverified 0UMIT: Unifying Medical Imaging Tasks via Vision-Language Models Mar 20, 2025 Diagnostic Medical Image Analysis
Code Code Available 0DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering Mar 20, 2025 Contrastive Learning Question Answering
— Unverified 0MKG-Rank: Enhancing Large Language Models with Knowledge Graph for Multilingual Medical Question Answering Mar 20, 2025 Knowledge Graphs Medical Question Answering
— Unverified 0Typed-RAG: Type-aware Multi-Aspect Decomposition for Non-Factoid Question Answering Mar 20, 2025 Question Answering RAG
Code Code Available 0Agentic Keyframe Search for Video Question Answering Mar 20, 2025 EgoSchema Question Answering
Code Code Available 1A Vision Centric Remote Sensing Benchmark Mar 20, 2025 Question Answering Representation Learning
— Unverified 0ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph Mar 20, 2025 Benchmarking Hallucination
— Unverified 0GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions Mar 20, 2025 Question Answering
— Unverified 0Bridging Technology and Humanities: Evaluating the Impact of Large Language Models on Social Sciences Research with DeepSeek-R1 Mar 20, 2025 Large Language Model Logical Reasoning
— Unverified 0AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models Mar 20, 2025 Autonomous Driving Multiple-choice
— Unverified 0Bias Evaluation and Mitigation in Retrieval-Augmented Medical Question-Answering Systems Mar 19, 2025 counterfactual Decision Making
— Unverified 0Optimizing Retrieval Strategies for Financial Question Answering Documents in Retrieval-Augmented Generation Systems Mar 19, 2025 Question Answering RAG
Code Code Available 1EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models Mar 19, 2025 MM-Vet Multimodal Reasoning
— Unverified 0MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration Mar 19, 2025 Long Form Question Answering Question Answering
— Unverified 0UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation Mar 19, 2025 Language Model Evaluation Language Modeling
— Unverified 0KoGNER: A Novel Framework for Knowledge Graph Distillation on Biomedical Named Entity Recognition Mar 19, 2025 Knowledge Distillation Knowledge Graphs
— Unverified 0GraspCorrect: Robotic Grasp Correction via Vision-Language Model-Guided Feedback Mar 19, 2025 Language Modeling Language Modelling
— Unverified 0Solla: Towards a Speech-Oriented LLM That Hears Acoustic Context Mar 19, 2025 Audio captioning Audio Question Answering
Code Code Available 0TruthLens:A Training-Free Paradigm for DeepFake Detection Mar 19, 2025 Binary Classification DeepFake Detection
— Unverified 0Uncertainty Distillation: Teaching Language Models to Express Semantic Confidence Mar 18, 2025 Question Answering Uncertainty Quantification
— Unverified 0EIAD: Explainable Industrial Anomaly Detection Via Multi-Modal Large Language Models Mar 18, 2025 Anomaly Detection Defect Detection
— Unverified 0Synthetic Clarification and Correction Dialogues about Data-Centric Tasks -- A Teacher-Student Approach Mar 18, 2025 Question Answering Table-based Question Answering
— Unverified 0Synthetic Data Generation Using Large Language Models: Advances in Text and Code Mar 18, 2025 Code Translation Prompt Engineering
— Unverified 0MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding Mar 18, 2025 document understanding Question Answering
Code Code Available 3How much do LLMs learn from negative examples? Mar 18, 2025 Multiple-choice Question Answering
Code Code Available 0CARE: A QLoRA-Fine Tuned Multi-Domain Chatbot With Fast Learning On Minimal Hardware Mar 18, 2025 Chatbot Question Answering
— Unverified 0Identifying and Mitigating Position Bias of Multi-image Vision-Language Models Mar 18, 2025 Position Question Answering
— Unverified 0Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding Mar 18, 2025 document understanding Question Answering
Code Code Available 0Where do Large Vision-Language Models Look at when Answering Questions? Mar 18, 2025 Question Answering Visual Question Answering
Code Code Available 2RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning Mar 17, 2025 Answer Generation Multi-hop Question Answering
— Unverified 0HICD: Hallucination-Inducing via Attention Dispersion for Contrastive Decoding to Mitigate Hallucinations in Large Language Models Mar 17, 2025 Hallucination Question Answering
Code Code Available 0VITED: Video Temporal Evidence Distillation Mar 17, 2025 Question Answering Video Question Answering
— Unverified 0HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding Mar 17, 2025 Question Answering Scene Understanding
— Unverified 0VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning Mar 17, 2025 Grounded Video Question Answering Question Answering
Code Code Available 3Unified Autoregressive Visual Generation and Understanding with Continuous Tokens Mar 17, 2025 Image Captioning Image Generation
— Unverified 0Knowledge-Aware Iterative Retrieval for Multi-Agent Systems Mar 17, 2025 Evidence Selection Large Language Model
— Unverified 0MES-RAG: Bringing Multi-modal, Entity-Storage, and Secure Enhancements to RAG Mar 17, 2025 Information Retrieval Question Answering
Code Code Available 0