| NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples | Oct 18, 2024 | AttributeQuestion Answering | —Unverified | 0 | 0 |
| Detection-based Intermediate Supervision for Visual Question Answering | Dec 26, 2023 | cross-modal alignmentLogical Reasoning | —Unverified | 0 | 0 |
| Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey | Nov 26, 2024 | Natural Language UnderstandingQuestion Answering | —Unverified | 0 | 0 |
| Natural Reflection Backdoor Attack on Vision Language Model for Autonomous Driving | May 9, 2025 | Autonomous DrivingBackdoor Attack | —Unverified | 0 | 0 |
| Detecting and Evaluating Medical Hallucinations in Large Vision Language Models | Jun 14, 2024 | HallucinationMedical Visual Question Answering | —Unverified | 0 | 0 |
| Negative Object Presence Evaluation (NOPE) to Measure Object Hallucination in Vision-Language Models | Oct 9, 2023 | HallucinationObject | —Unverified | 0 | 0 |
| Neglected Risks: The Disturbing Reality of Children's Images in Datasets and the Urgent Call for Accountability | Apr 20, 2025 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| NegVQA: Can Vision Language Models Understand Negation? | May 28, 2025 | NegationQuestion Answering | —Unverified | 0 | 0 |
| Aligning MAGMA by Few-Shot Learning and Finetuning | Oct 18, 2022 | Few-Shot LearningImage Captioning | —Unverified | 0 | 0 |
| Neural Attention Models for Sequence Classification: Analysis and Application to Key Term Extraction and Dialogue Act Detection | Mar 31, 2016 | Caption GenerationClassification | —Unverified | 0 | 0 |
| Neural Memory Plasticity for Anomaly Detection | Oct 12, 2019 | Anomaly DetectionEEG | —Unverified | 0 | 0 |
| AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability | May 23, 2024 | cross-modal alignmentLanguage Modelling | —Unverified | 0 | 0 |
| Neural Self Talk: Image Understanding via Continuous Questioning and Answering | Dec 10, 2015 | Question AnsweringQuestion Generation | —Unverified | 0 | 0 |
| VISREAS: Complex Visual Reasoning with Unanswerable Questions | Feb 23, 2024 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| NeurIPS 2023 Competition: Privacy Preserving Federated Learning Document VQA | Nov 6, 2024 | Federated LearningLanguage Modelling | —Unverified | 0 | 0 |
| Neuro-Symbolic Spatio-Temporal Reasoning | Nov 28, 2022 | AI AgentImage Segmentation | —Unverified | 0 | 0 |
| Neuro-Symbolic Visual Reasoning: Disentangling "Visual" from "Reasoning" | Jun 20, 2020 | Graph GenerationQuestion Answering | —Unverified | 0 | 0 |
| Neuro-Symbolic VQA: A review from the perspective of AGI desiderata | Apr 13, 2021 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| NEVLP: Noise-Robust Framework for Efficient Vision-Language Pre-training | Sep 15, 2024 | Contrastive Learningcross-modal alignment | —Unverified | 0 | 0 |
| New Ideas and Trends in Deep Multimodal Content Understanding: A Review | Oct 16, 2020 | Cross-Modal RetrievalDeep Learning | —Unverified | 0 | 0 |
| NEWSKVQA: Knowledge-Aware News Video Question Answering | Feb 8, 2022 | Common Sense ReasoningManagement | —Unverified | 0 | 0 |
| NMT-Keras: a Very Flexible Toolkit with a Focus on Interactive NMT and Online Learning | Jul 9, 2018 | General ClassificationMachine Translation | —Unverified | 0 | 0 |
| VisScience: An Extensive Benchmark for Evaluating K12 Educational Multi-modal Scientific Reasoning | Sep 10, 2024 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| Visual7W: Grounded Question Answering in Images | Nov 11, 2015 | Multiple-choiceMultiple Choice Question Answering (MCQA) | —Unverified | 0 | 0 |
| Non-monotonic Logical Reasoning Guiding Deep Learning for Explainable Visual Question Answering | Sep 23, 2019 | Inductive LearningLogical Reasoning | —Unverified | 0 | 0 |