New Ideas and Trends in Deep Multimodal Content Understanding: A Review Oct 16, 2020 Cross-Modal Retrieval Deep Learning
— Unverified 0New Encoder Learning for Captioning Heavy Rain Images via Semantic Visual Feature Matching May 28, 2021 Decoder Image Captioning
— Unverified 0NICE: CVPR 2023 Challenge on Zero-shot Image Captioning Sep 5, 2023 Fairness Image Captioning
— Unverified 0NLIP: Noise-robust Language-Image Pre-training Dec 14, 2022 Image Captioning Image-text Retrieval
— Unverified 0NLPHut’s Participation at WAT2021 Aug 1, 2021 Caption Generation Image Captioning
— Unverified 0NNEval: Neural Network based Evaluation Metric for Image Captioning Sep 1, 2018 Image Captioning Sentence
— Unverified 0No Detail Left Behind: Revisiting Self-Retrieval for Fine-Grained Image Captioning Sep 4, 2024 Image Captioning Retrieval
— Unverified 0Non-Autoregressive Image Captioning with Counterfactuals-Critical Multi-Agent Learning May 10, 2020 Image Captioning Machine Translation
— Unverified 0Nonparametric Method for Data-driven Image Captioning Jun 1, 2014 Density Estimation Image Captioning
— Unverified 0Normalized and Geometry-Aware Self-Attention Network for Image Captioning Mar 19, 2020 Image Captioning Machine Translation
— Unverified 0NOVA: A Benchmark for Anomaly Localization and Clinical Reasoning in Brain MRI May 20, 2025 Anomaly Localization Benchmarking
— Unverified 0O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video Captioning Aug 5, 2021 Attribute Caption Generation
— Unverified 0OBJ2TEXT: Generating Visually Descriptive Language from Object Layouts Jul 22, 2017 Caption Generation Descriptive
— Unverified 0Object Counts! Bringing Explicit Detections Back into Image Captioning Apr 23, 2018 Image Captioning Language Modeling
— Unverified 0Object-oriented backdoor attack against image captioning Jan 5, 2024 Backdoor Attack Image Captioning
— Unverified 0ODIANLP’s Participation in WAT2020 Dec 1, 2020 Hindi Image Captioning Image Captioning
— Unverified 0Off-Policy Self-Critical Training for Transformer in Visual Paragraph Generation Jun 21, 2020 Image Captioning Reinforcement Learning (RL)
— Unverified 0Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts Dec 1, 2023 Chart Question Answering Document AI
— Unverified 0OmniVL:One Foundation Model for Image-Language and Video-Language Tasks Sep 15, 2022 Action Classification Action Recognition
— Unverified 0On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization May 24, 2022 Descriptive Image Captioning
— Unverified 0On Distinctive Image Captioning via Comparing and Reweighting Apr 8, 2022 Image Captioning Retrieval
— Unverified 0One Model To Learn Them All Jun 16, 2017 All Image Captioning
— Unverified 0On Hallucination and Predictive Uncertainty in Conditional Language Generation Mar 28, 2021 Data-to-Text Generation Hallucination
— Unverified 0On Human Intellect and Machine Failures: Troubleshooting Integrative Machine Learning Systems Nov 24, 2016 BIG-bench Machine Learning Image Captioning
— Unverified 0On Randomized Classification Layers and Their Implications in Natural Language Generation Jun 1, 2021 Image Captioning Language Modeling
— Unverified 0On Speculative Decoding for Multimodal Large Language Models Apr 13, 2024 Image Captioning Language Modeling
— Unverified 0On the Effects of Video Grounding on Language Models Oct 1, 2022 Image Captioning Question Answering
— Unverified 0On the Performance of Multimodal Language Models Oct 4, 2023 Benchmarking Binary Classification
— Unverified 0On the Robustness of Large Multimodal Models Against Image Adversarial Attacks Dec 6, 2023 Image Captioning image-classification
— Unverified 0On the Role of Scene Graphs in Image Captioning Nov 1, 2019 Descriptive Image Captioning
— Unverified 0On Vision Features in Multimodal Machine Translation Nov 16, 2021 Image Captioning Machine Translation
— Unverified 0OPCap:Object-aware Prompting Captioning Nov 27, 2024 Attribute Decoder
— Unverified 0OpenDAS: Open-Vocabulary Domain Adaptation for 2D and 3D Segmentation May 30, 2024 3D Instance Segmentation 3D Open-Vocabulary Instance Segmentation
— Unverified 0Open-Vocabulary Object Detection using Pseudo Caption Labels Mar 23, 2023 Image Captioning Knowledge Distillation
— Unverified 0OptiBox: Breaking the Limits of Proposals for Visual Grounding Nov 29, 2019 Image Captioning Visual Grounding
— Unverified 0Optimizing Vision-Language Interactions Through Decoder-Only Models Dec 14, 2024 Decoder Image Captioning
— Unverified 0Order-Free RNN with Visual Attention for Multi-Label Classification Jul 18, 2017 Classification General Classification
— Unverified 0ORD: Object Relationship Discovery for Visual Dialogue Generation Jun 15, 2020 Dialogue Generation Graph Attention
— Unverified 0OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst Jun 14, 2024 Image Captioning Language Modeling
— Unverified 0OSU Multimodal Machine Translation System Report Oct 7, 2017 Image Captioning Machine Translation
— Unverified 0Overview of TREC 2024 Medical Video Question Answering (MedVidQA) Track Dec 15, 2024 Image Captioning Medical Question Answering
— Unverified 0OxfordTVG-HIC: Can Machine Make Humorous Captions from Images? Jul 21, 2023 Diversity Image Captioning
— Unverified 0PaLI: A Jointly-Scaled Multilingual Language-Image Model Sep 14, 2022 Decoder Few-Shot Image Classification
— Unverified 0Panoptic Perception: A Novel Task and Fine-grained Dataset for Universal Remote Sensing Image Interpretation Apr 6, 2024 Image Captioning Instance Segmentation
— Unverified 0ParaCNN: Visual Paragraph Generation via Adversarial Twin Contextual CNNs Apr 21, 2020 Image Captioning Image Description
— Unverified 0ParsVQA-Caps: A Benchmark for Visual Question Answering and Image Captioning in Persian Dec 7, 2022 Image Captioning Question Answering
— Unverified 0Partially-Supervised Image Captioning Jun 15, 2018 Image Captioning Object
— Unverified 0Partially-Supervised Novel Object Captioning Leveraging Context from Paired Data Sep 10, 2021 Image Captioning Novel Object Detection
— Unverified 0Partial Off-Policy Learning: Balance Accuracy and Diversity for Human-Oriented Image Captioning Jan 1, 2021 Diversity Generative Adversarial Network
— Unverified 0Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception Jan 1, 2025 Image Captioning Image Generation
— Unverified 0