AstroLLaVA: towards the unification of astronomical data and natural language Apr 11, 2025 Astronomy Image Captioning
— Unverified 0Generative Distribution Prediction: A Unified Approach to Multimodal Learning Feb 10, 2025 Domain Adaptation Image Captioning
— Unverified 03D Spatial Understanding in MLLMs: Disambiguation and Evaluation Dec 9, 2024 3D dense captioning 3D visual grounding
— Unverified 0Exploring Affordance and Situated Meaning in Image Captions: A Multimodal Analysis May 24, 2023 Image Captioning Natural Language Understanding
— Unverified 0Exploring the Functional and Geometric Bias of Spatial Relations Using Neural Language Models Jun 1, 2018 Image Captioning
— Unverified 0CLAMP: Contrastive LAnguage Model Prompt-tuning Dec 4, 2023 Contrastive Learning Image Captioning
— Unverified 0Consensus Graph Representation Learning for Better Grounded Image Captioning Dec 2, 2021 Graph Representation Learning Hallucination
— Unverified 0Attr2Style: A Transfer Learning Approach for Inferring Fashion Styles via Apparel Attributes Aug 26, 2020 Attribute Image Captioning
— Unverified 0Geometry-Entangled Visual Semantic Transformer for Image Captioning Sep 29, 2021 Caption Generation Image Captioning
— Unverified 0GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing Jan 12, 2025 Image Captioning Language Modeling
— Unverified 0GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing Mar 16, 2025 Change Detection Image Captioning
— Unverified 0GeoSeq2Seq: Information Geometric Sequence-to-Sequence Networks Oct 25, 2017 Image Captioning Translation
— Unverified 0Image captioning in different languages May 31, 2024 Image Captioning Position
— Unverified 0Exploring the Frontier of Vision-Language Models: A Survey of Current Methodologies and Future Directions Feb 20, 2024 Image Captioning Question Answering
— Unverified 0Exploring Spatial Language Grounding Through Referring Expressions Feb 4, 2025 Image Captioning Negation
— Unverified 0Exploring Semantic Relationships for Unpaired Image Captioning Jun 20, 2021 Image Captioning Sentence
— Unverified 0Exploring Overall Contextual Information for Image Captioning in Human-Like Cognitive Style Oct 15, 2019 Decoder Image Captioning
— Unverified 0CLAIR: Evaluating Image Captions with Large Language Models Oct 19, 2023 Diversity Image Captioning
— Unverified 0Context-Aware Group Captioning via Self-Attention and Contrastive Features Apr 7, 2020 Image Captioning
— Unverified 0Improving mitosis detection on histopathology images using large vision-language models Oct 11, 2023 Domain Generalization Image Captioning
— Unverified 0Good Representation, Better Explanation: Role of Convolutional Neural Networks in Transformer-Based Remote Sensing Image Captioning Feb 22, 2025 Decoder Image Captioning
— Unverified 0Google Neural Network Models for Edge Devices: Analyzing and Mitigating Machine Learning Inference Bottlenecks Sep 29, 2021 Edge-computing Face Detection
— Unverified 0Image Captioning in news report scenario Mar 24, 2024 Image Captioning Recommendation Systems
— Unverified 0Context-Independent OCR with Multimodal LLMs: Effects of Image Resolution and Visual Complexity Mar 31, 2025 Image Captioning Optical Character Recognition
— Unverified 0Contextual Emotion Estimation from Image Captions Sep 22, 2023 Image Captioning Language Modelling
— Unverified 0A Unified Sequence Interface for Vision Tasks Jun 15, 2022 Image Captioning Instance Segmentation
— Unverified 0Graph Neural Networks in Vision-Language Image Understanding: A Survey Mar 7, 2023 Image Captioning Image Retrieval
— Unverified 0Image Captioning using Multiple Transformers for Self-Attention Mechanism Feb 14, 2021 Image Captioning
— Unverified 0GraphSeq2Seq: Graph-Sequence-to-Sequence for Neural Machine Translation Sep 27, 2018 Decoder Image Captioning
— Unverified 0Green Runner: A tool for efficient model selection from model repositories May 26, 2023 Deep Learning Image Captioning
— Unverified 0Aligning Attention Distribution to Information Flow for Hallucination Mitigation in Large Vision-Language Models May 20, 2025 Hallucination Image Captioning
— Unverified 0GroundCap: A Visually Grounded Image Captioning Dataset Feb 19, 2025 Image Captioning Object Detection
— Unverified 0Astrea: A MOE-based Visual Understanding Model with Progressive Alignment Mar 12, 2025 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Image Captioning based on Deep Reinforcement Learning Sep 13, 2018 Deep Reinforcement Learning Image Captioning
— Unverified 0Exploring External Knowledge for Accurate modeling of Visual and Language Problems Jan 27, 2023 Image Captioning Machine Translation
— Unverified 0Group-based Distinctive Image Captioning with Memory Attention Aug 20, 2021 Contrastive Learning Image Captioning
— Unverified 0Group-based Distinctive Image Captioning with Memory Difference Encoding and Attention Apr 3, 2025 Caption Generation Contrastive Learning
— Unverified 0GroupCap: Group-Based Image Captioning With Structured Relevance and Diversity Constraints Jun 1, 2018 Diversity Image Captioning
— Unverified 0AutoCaption: Image Captioning with Neural Architecture Search Dec 16, 2020 Decoder Image Captioning
— Unverified 0Grow and Prune Compact, Fast, and Accurate LSTMs May 30, 2018 Image Captioning speech-recognition
— Unverified 0Exploring Explicit and Implicit Visual Relationships for Image Captioning May 6, 2021 Decoder Image Captioning
— Unverified 0Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language Jun 28, 2024 Image Captioning
— Unverified 0Guide Me: Interacting with Deep Networks Mar 30, 2018 Image Captioning Image Generation
— Unverified 0Guiding Attention using Partial-Order Relationships for Image Captioning Apr 15, 2022 Caption Generation Image Captioning
— Unverified 0CIC: A Framework for Culturally-Aware Image Captioning Feb 8, 2024 Descriptive Image Captioning
— Unverified 0Image Captioning based on Feature Refinement and Reflective Decoding Jun 16, 2022 Decoder Image Captioning
— Unverified 0Chittron: An Automatic Bangla Image Captioning System Sep 2, 2018 Caption Generation Image Captioning
— Unverified 0HAAV: Hierarchical Aggregation of Augmented Views for Image Captioning May 25, 2023 Caption Generation Decoder
— Unverified 0Exploring Causes and Mitigation of Hallucinations in Large Vision Language Models Feb 24, 2025 Hallucination Image Captioning
— Unverified 0Cheap-fake Detection with LLM using Prompt Engineering Jun 5, 2023 Image Captioning Image Generation
— Unverified 0