Image to text

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–246 of 246 papers

Title	Date	Tasks	Status
Image Semantic Relation Generation	Oct 19, 2022	Image RetrievalImage Segmentation	—Unverified
Cross-modal Contrastive Attention Model for Medical Report Generation	Oct 1, 2022	Image to textMedical Report Generation	—Unverified
Every picture tells a story: Image-grounded controllable stylistic story generation	Sep 4, 2022	Image CaptioningImage to text	—Unverified
Discovering Bugs in Vision Models using Off-the-shelf Image Generation and Captioning	Aug 18, 2022	Image GenerationImage to text	—Unverified
Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval	Jul 29, 2022	Cross-Modal RetrievalData Augmentation	—Unverified
SRCB at SemEval-2022 Task 5: Pretraining Based Image to Text Late Sequential Fusion System for Multimodal Misogynous Meme Identification	Jul 1, 2022	Image to text	—Unverified
Delving into the Openness of CLIP	Jun 4, 2022	image-classificationImage Classification	CodeCode Available
Multilingual Image Corpus – Towards a Multimodal and Multilingual Dataset	Jun 1, 2022	Caption Generationimage-classification	—Unverified
Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations	Apr 20, 2022	Cross-Modal RetrievalImage Retrieval	—Unverified
COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval	Apr 15, 2022	Contrastive LearningCross-Modal Retrieval	—Unverified
Characterizing and Understanding the Behavior of Quantized Models for Reliable Deployment	Apr 8, 2022	Image to textLanguage Modeling	CodeCode Available
Two-stream Hierarchical Similarity Reasoning for Image-text Matching	Mar 10, 2022	Image-text matchingImage to text	—Unverified
A Thousand Words Are Worth More Than a Picture: Natural Language-Centric Outside-Knowledge Visual Question Answering	Jan 14, 2022	Generative Question AnsweringImage to text	—Unverified
Transform-Retrieve-Generate: Natural Language-Centric Outside-Knowledge Visual Question Answering	Jan 1, 2022	Generative Question AnsweringImage to text	—Unverified
EI-CLIP: Entity-Aware Interventional Contrastive Learning for E-Commerce Cross-Modal Retrieval	Jan 1, 2022	Causal InferenceContrastive Learning	—Unverified
Self-Supervised Image-to-Text and Text-to-Image Synthesis	Dec 9, 2021	Image GenerationImage to text	CodeCode Available
Exploration into Translation-Equivariant Image Quantization	Dec 1, 2021	Image GenerationImage to text	CodeCode Available
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages	Nov 24, 2021	DecoderImage to text	—Unverified
Contrastive Learning of Visual-Semantic Embeddings	Oct 17, 2021	Contrastive Learningimage-classification	—Unverified
Survey of Visual-Semantic Embedding Methods for Zero-Shot Image Retrieval	May 16, 2021	Graph GenerationImage Captioning	—Unverified
Knowledge driven Description Synthesis for Floor Plan Interpretation	Mar 15, 2021	Caption GenerationDescriptive	—Unverified
Hierarchical Gumbel Attention Network for Text-based Person Search	Oct 10, 2020	Image RetrievalImage to text	—Unverified
Cross-Modal Alignment with Mixture Experts Neural Network for Intral-City Retail Recommendation	Sep 17, 2020	cross-modal alignmentImage to text	—Unverified
Development of a New Image-to-text Conversion System for Pashto, Farsi and Traditional Chinese	May 8, 2020	Image to textOptical Character Recognition (OCR)	—Unverified
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications	Nov 10, 2019	Caption GenerationImage Generation	—Unverified
Illegible Text to Readable Text: An Image-to-Image Transformation using Conditional Sliced Wasserstein Adversarial Networks	Oct 11, 2019	Generative Adversarial NetworkImage-to-Image Translation	—Unverified
Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task	Oct 8, 2019	Cross-Modal RetrievalImage to text	CodeCode Available
From Image to Text in Sentiment Analysis via Regression and Deep Learning	Sep 1, 2019	Image to textregression	—Unverified
Knowledge Aware Semantic Concept Expansion for Image-Text Matching	Aug 10, 2019	Common Sense ReasoningContent-Based Image Retrieval	—Unverified
MirrorGAN: Learning Text-to-image Generation by Redescription	Mar 14, 2019	DiversityImage Generation	CodeCode Available
Canonical Correlation Analysis for Misaligned Satellite Image Change Detection	Dec 21, 2018	Action RecognitionChange Detection	—Unverified
Doc2Im: document to image conversion through self-attentive embedding	Nov 8, 2018	Document To Image Conversiondocument understanding	—Unverified
SpatialVOC2K: A Multilingual Dataset of Images with Annotations and Features for Spatial Relations between Objects	Nov 1, 2018	Image to textObject	CodeCode Available
EmojiGAN: learning emojis distributions with a generative model	Oct 1, 2018	Image CaptioningImage to text	—Unverified
Text-to-Image-to-Text Translation using Cycle Consistent Adversarial Networks	Aug 14, 2018	Image to textSentence	CodeCode Available
Deductron -- A Recurrent Neural Network	Jun 23, 2018	Image to textOptical Character Recognition (OCR)	—Unverified
Using Inter-Sentence Diverse Beam Search to Reduce Redundancy in Visual Storytelling	May 30, 2018	Image to textSentence	—Unverified
Turbo Learning for Captionbot and Drawingbot	May 21, 2018	Image CaptioningImage Generation	—Unverified
Face2Text: Collecting an Annotated Image Description Corpus for the Generation of Rich Face Descriptions	Mar 10, 2018	Image DescriptionImage to text	CodeCode Available
Synthesizing Novel Pairs of Image and Text	Dec 18, 2017	Image to text	—Unverified
From Image to Text Classification: A Novel Approach based on Clustering Word Embeddings	Jul 25, 2017	ClusteringGeneral Classification	—Unverified
Better Text Understanding Through Image-To-Text Transfer	May 23, 2017	Image to text	—Unverified
I2T2I: Learning Text to Image Synthesis with Textual Data Augmentation	Mar 20, 2017	Caption GenerationData Augmentation	—Unverified
A Gentle Tutorial of Recurrent Neural Network with Error Backpropagation	Oct 8, 2016	Handwriting RecognitionImage to text	CodeCode Available
Learning Deep Structure-Preserving Image-Text Embeddings	Nov 19, 2015	Image RetrievalImage to text	—Unverified
Effective Use of Word Order for Text Categorization with Convolutional Neural Networks	Dec 1, 2014	General ClassificationImage to text	CodeCode Available

Show:10 25 50

← PrevPage 5 of 5Next →

No leaderboard results yet.