| Neural Dependency Coding inspired Multimodal Fusion | Sep 28, 2021 | Emotion RecognitionImage Description | —Unverified | 0 |
| CIDEr-R: Robust Consensus-based Image Description Evaluation | Sep 28, 2021 | DescriptiveImage Description | CodeCode Available | 0 |
| Cross Modification Attention Based Deliberation Model for Image Captioning | Sep 17, 2021 | DecoderDescriptive | —Unverified | 0 |
| SafeAccess+: An Intelligent System to make Smart Home Safer and Americans with Disability Act Compliant | Sep 14, 2021 | Image Description | —Unverified | 0 |
| Computer Vision and Conflicting Values: Describing People with Automated Alt Text | May 26, 2021 | Image Description | —Unverified | 0 |
| How Do Image Description Systems Describe People? A Targeted Assessment of System Competence in the PEOPLE-domain | Dec 1, 2020 | Image Description | CodeCode Available | 0 |
| Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human Gaze | Nov 9, 2020 | cross-modal alignmentImage Captioning | CodeCode Available | 0 |
| EPYNET: Efficient Pyramidal Network for Clothing Segmentation | Oct 13, 2020 | Data AugmentationImage Description | —Unverified | 0 |
| Multimodal Word Sense Disambiguation in Creative Practice | Jul 15, 2020 | ClassificationDescriptive | CodeCode Available | 0 |
| On the use of human reference data for evaluating automatic image descriptions | Jun 15, 2020 | Image Description | —Unverified | 0 |