| Effective Use of Word Order for Text Categorization with Convolutional Neural Networks | Dec 1, 2014 | General ClassificationImage to text | CodeCode Available | 0 | 5 |
| A Gentle Tutorial of Recurrent Neural Network with Error Backpropagation | Oct 8, 2016 | Handwriting RecognitionImage to text | CodeCode Available | 0 | 5 |
| Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs) | Oct 25, 2024 | AttributeImage to text | CodeCode Available | 0 | 5 |
| Text-to-Image-to-Text Translation using Cycle Consistent Adversarial Networks | Aug 14, 2018 | Image to textSentence | CodeCode Available | 0 | 5 |
| Discovering Bugs in Vision Models using Off-the-shelf Image Generation and Captioning | Aug 18, 2022 | Image GenerationImage to text | —Unverified | 0 | 0 |
| DIR: Retrieval-Augmented Image Captioning with Comprehensive Understanding | Dec 2, 2024 | Caption GenerationDomain Generalization | —Unverified | 0 | 0 |
| DiffuVST: Narrating Fictional Scenes with Global-History-Guided Denoising Models | Dec 12, 2023 | DenoisingDiversity | —Unverified | 0 | 0 |
| Ask, Attend, Attack: A Effective Decision-Based Black-Box Targeted Attack for Image-to-Text Models | Aug 16, 2024 | Image to text | —Unverified | 0 | 0 |
| DiffusionSTR: Diffusion Model for Scene Text Recognition | Jun 29, 2023 | Image to textmodel | —Unverified | 0 | 0 |
| Development of a New Image-to-text Conversion System for Pashto, Farsi and Traditional Chinese | May 8, 2020 | Image to textOptical Character Recognition (OCR) | —Unverified | 0 | 0 |