FoQA: A Faroese Question-Answering Dataset Feb 11, 2025 Articles Extractive Question-Answering
— Unverified 0Who Taught You That? Tracing Teachers in Model Distillation Feb 10, 2025 Instruction Following POS
— Unverified 0RoToR: Towards More Reliable Responses for Order-Invariant Inputs Feb 10, 2025 Graph Question Answering MMLU
Code Code Available 0Learning Musical Representations for Music Performance Question Answering Feb 10, 2025 Question Answering
Code Code Available 0Generative Distribution Prediction: A Unified Approach to Multimodal Learning Feb 10, 2025 Domain Adaptation Image Captioning
— Unverified 0ClinKD: Cross-Modal Clinical Knowledge Distiller For Multi-Task Medical Images Feb 9, 2025 Clinical Knowledge Medical Visual Question Answering
Code Code Available 0Self-Training Large Language Models for Tool-Use Without Demonstrations Feb 9, 2025 GSM8K Mathematical Reasoning
— Unverified 0LM2: Large Memory Models Feb 9, 2025 Decoder MMLU
Code Code Available 1Performance Analysis of Traditional VQA Models Under Limited Computational Resources Feb 9, 2025 Question Answering Visual Question Answering
— Unverified 0Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding Feb 9, 2025 Image Captioning Image-text Retrieval
Code Code Available 3Evolving LLMs' Self-Refinement Capability via Iterative Preference Optimization Feb 8, 2025 GSM8K Math
— Unverified 0Investigating the Shortcomings of LLMs in Step-by-Step Legal Reasoning Feb 8, 2025 Legal Reasoning Multiple-choice
Code Code Available 0Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment Feb 7, 2025 Diversity Human-Object Interaction Detection
— Unverified 0Uncertainty Quantification for LLMs through Minimum Bayes Risk: Bridging Confidence and Consistency Feb 7, 2025 Abstractive Text Summarization Machine Translation
— Unverified 0Efficient Knowledge Feeding to Language Models: A Novel Integrated Encoder-Decoder Architecture Feb 7, 2025 Decoder In-Context Learning
— Unverified 0Mitigating Unintended Memorization with LoRA in Federated Learning for LLMs Feb 7, 2025 Federated Learning Medical Question Answering
Code Code Available 1ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning Feb 7, 2025 Multiple-choice Question Answering
Code Code Available 0SMI: An Information-Theoretic Metric for Predicting Model Knowledge Solely from Pre-Training Signals Feb 6, 2025 Question Answering
Code Code Available 0PixFoundation: Are We Heading in the Right Direction with Pixel-level Vision Foundation Models? Feb 6, 2025 Question Answering Referring Expression
Code Code Available 1No Images, No Problem: Retaining Knowledge in Continual VQA with Questions-Only Memory Feb 6, 2025 Continual Learning Question Answering
Code Code Available 0TerraQ: Spatiotemporal Question-Answering on Satellite Image Archives Feb 6, 2025 Earth Observation Question Answering
— Unverified 0Éclair -- Extracting Content and Layout with Integrated Reading Order for Documents Feb 6, 2025 Image Captioning Optical Character Recognition
— Unverified 0LLMs to Support a Domain Specific Knowledge Assistant Feb 6, 2025 Chatbot Multiple-choice
— Unverified 0ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization Feb 6, 2025 Language Modeling Language Modelling
Code Code Available 2Efficient Few-Shot Continual Learning in Vision-Language Models Feb 6, 2025 Continual Learning Image Captioning
— Unverified 0Ontology-Guided, Hybrid Prompt Learning for Generalization in Knowledge Graph Question Answering Feb 6, 2025 Graph Question Answering Knowledge Graphs
Code Code Available 0DocMIA: Document-Level Membership Inference Attacks against DocVQA Models Feb 6, 2025 document understanding Inference Attack
Code Code Available 0LLMs can be easily Confused by Instructional Distractions Feb 5, 2025 Bias Detection Code Generation
— Unverified 0SensorChat: Answering Qualitative and Quantitative Questions during Long-Term Multimodal Sensor Interactions Feb 5, 2025 Quantization Question Answering
— Unverified 0Spatial-RAG: Spatial Retrieval Augmented Generation for Real-World Geospatial Reasoning Questions Feb 4, 2025 Question Answering RAG
— Unverified 0Exploring Spatial Language Grounding Through Referring Expressions Feb 4, 2025 Image Captioning Negation
— Unverified 0TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes Feb 4, 2025 Autonomous Driving Multiple-choice
Code Code Available 1Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation Feb 4, 2025 Benchmarking Information Retrieval
Code Code Available 4AmaSQuAD: A Benchmark for Amharic Extractive Question Answering Feb 4, 2025 Extractive Question-Answering Question Answering
— Unverified 0Memento No More: Coaching AI Agents to Master Multiple Tasks via Hints Internalization Feb 3, 2025 Information Retrieval Question Answering
— Unverified 0ChartCitor: Multi-Agent Framework for Fine-Grained Chart Visual Attribution Feb 3, 2025 Chart Question Answering Question Answering
— Unverified 0Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models Feb 3, 2025 Adversarial Robustness Image Captioning
Code Code Available 1Topic-FlipRAG: Topic-Orientated Adversarial Opinion Manipulation Attacks to Retrieval-Augmented Generation Models Feb 3, 2025 Question Answering RAG
— Unverified 0VLM-Assisted Continual learning for Visual Question Answering in Self-Driving Feb 2, 2025 Autonomous Driving Continual Learning
— Unverified 0Hypo3D: Exploring Hypothetical Reasoning in 3D Feb 2, 2025 Question Answering Visual Question Answering
— Unverified 0Multilingual State Space Models for Structured Question Answering in Indic Languages Feb 1, 2025 Answer Generation Diversity
Code Code Available 0Memory-Efficient Fine-Tuning of Transformers via Token Selection Jan 31, 2025 Few-Shot Learning Question Answering
Code Code Available 0KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search Jan 31, 2025 Heuristic Search Knowledge Base Question Answering
Code Code Available 1-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation Jan 31, 2025 Question Answering Video Question Answering
Code Code Available 1CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering Jan 30, 2025 General Knowledge Language Modeling
— Unverified 0o3-mini vs DeepSeek-R1: Which One is Safer? Jan 30, 2025 Code Generation Program Repair
Code Code Available 1General Embedding vs. Task-Specific Embedding: A Comparative Approach to Enhancing NLP Performance Jan 30, 2025 Multi-Task Learning
— Unverified 0InnerThoughts: Disentangling Representations and Predictions in Large Language Models Jan 29, 2025 Multiple-choice Position
— Unverified 0Anatomy Might Be All You Need: Forecasting What to Do During Surgery Jan 29, 2025 All Anatomy
— Unverified 0Cross-Language Approach for Quranic QA Jan 29, 2025 Machine Translation Question Answering
— Unverified 0