SOTAVerified

Caption Generation

Papers

Showing 251300 of 310 papers

TitleStatusHype
Local Information Assisted Attention-free Decoder for Audio CaptioningCode0
AUTOMATED AUDIO CAPTIONING BY FINE-TUNING BART WITH AUDIOSET TAGSCode0
Tensor Product Generation Networks for Deep NLP ModelingCode0
LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption GenerationCode0
SciCap+: A Knowledge Augmented Dataset to Study the Challenges of Scientific Figure CaptioningCode0
Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image CaptioningCode0
Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target TokensCode0
Efficient Urdu Caption Generation using Attention based LSTMCode0
Event and Entity Extraction from Generated Video CaptionsCode0
Journalistic Guidelines Aware News Image CaptioningCode0
ViPE: Visualise Pretty-much EverythingCode0
Sequence to Sequence -- Video to TextCode0
Memeify: A Large-Scale Meme Generation SystemCode0
3D CoCa: Contrastive Learners are 3D CaptionersCode0
Image Captioning with Deep Bidirectional LSTMsCode0
Using Artificial Tokens to Control Languages for Multilingual Image Caption GenerationCode0
Mol2Lang-VLM: Vision- and Text-Guided Generative Pre-trained Language Models for Advancing Molecule Captioning through Multimodal FusionCode0
Image Caption Generation for News ArticlesCode0
DSD: Dense-Sparse-Dense Training for Deep Neural NetworksCode0
Multi-LLM Collaborative Caption Generation in Scientific DocumentsCode0
Automatic Report Generation for Histopathology images using pre-trained Vision Transformers and BERTCode0
Where to put the Image in an Image Caption GeneratorCode0
SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMsCode0
Multimodal Preference Data Synthetic Alignment with Reward ModelCode0
Discriminability objective for training descriptive captionsCode0
An Empirical Study of Language CNN for Image CaptioningCode0
Multi-source weak supervision for saliency detectionCode0
CNN Fixations: An unraveling approach to visualize the discriminative image regionsCode0
Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon CaptioningCode0
Guiding Long-Short Term Memory for Image Caption GenerationCode0
Deterministic Non-Autoregressive Neural Sequence Modeling by Iterative RefinementCode0
DeepDiary: Automatic Caption Generation for Lifelogging Image StreamsCode0
Global Object Proposals for Improving Multi-Sentence Video DescriptionsCode0
CLIP Meets Video Captioning: Concept-Aware Representation Learning Does MatterCode0
NICGSlowDown: Evaluating the Efficiency Robustness of Neural Image Caption Generation ModelsCode0
Bivariate Beta-LSTMCode0
Dual-path Collaborative Generation Network for Emotional Video CaptioningCode0
Bangla Image Caption Generation through CNN-Transformer based Encoder-Decoder NetworkCode0
Transform, Contrast and Tell: Coherent Entity-Aware Multi-Image CaptioningCode0
Summaries as Captions: Generating Figure Captions for Scientific Documents with Automated Text SummarizationCode0
From Simple to Professional: A Combinatorial Controllable Image Captioning AgentCode0
FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human FeedbackCode0
Cortico-cerebellar networks as decoupling neural interfacesCode0
Pre-gen metrics: Predicting caption quality metrics without generating captionsCode0
R^3Net:Relation-embedded Representation Reconstruction Network for Change CaptioningCode0
Rˆ3Net:Relation-embedded Representation Reconstruction Network for Change CaptioningCode0
Exploring Models and Data for Remote Sensing Image Caption GenerationCode0
Recurrent Neural Network RegularizationCode0
Referring Expression Object Segmentation with Caption-Aware ConsistencyCode0
Regularizing RNNs for Caption Generation by Reconstructing The Past with The PresentCode0
Show:102550
← PrevPage 6 of 7Next →

No leaderboard results yet.