| Automated Image Captioning with CNNs and Transformers | Dec 13, 2024 | DescriptiveHyperparameter Optimization | CodeCode Available | 0 | 5 |
| Improve Machine Learning carbon footprint using Parquet dataset format and Mixed Precision training for regression models -- Part II | Sep 17, 2024 | BenchmarkingDescriptive | CodeCode Available | 0 | 5 |
| IMITATE: Clinical Prior Guided Hierarchical Vision-Language Pre-training | Oct 11, 2023 | Contrastive LearningDescriptive | CodeCode Available | 0 | 5 |
| Image Captioning with Clause-Focused Metrics in a Multi-Modal Setting for Marketing | May 6, 2019 | DescriptiveImage Captioning | CodeCode Available | 0 | 5 |
| Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning | Dec 17, 2024 | Dense Video CaptioningDescriptive | CodeCode Available | 0 | 5 |
| Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text | Apr 6, 2016 | DescriptiveLanguage Modeling | CodeCode Available | 0 | 5 |
| H-RANSAC, an algorithmic variant for Homography image transform from featureless point sets: application to video-based football analytics | Oct 7, 2023 | DescriptiveImage Stitching | CodeCode Available | 0 | 5 |
| How Contentious Terms About People and Cultures are Used in Linked Open Data | Nov 13, 2023 | DescriptiveWord Sense Disambiguation | CodeCode Available | 0 | 5 |
| Audio Large Language Models Can Be Descriptive Speech Quality Evaluators | Jan 27, 2025 | Descriptive | CodeCode Available | 0 | 5 |
| A Multimodal PDE Foundation Model for Prediction and Scientific Text Descriptions | Feb 9, 2025 | DescriptiveMultimodal Deep Learning | CodeCode Available | 0 | 5 |
| A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT Segmentation | Nov 19, 2024 | DescriptiveDiagnostic | CodeCode Available | 0 | 5 |
| Attribute-based Visual Reprogramming for Image Classification with CLIP | Jan 23, 2025 | AttributeDescriptive | CodeCode Available | 0 | 5 |
| Hierarchical Context-aware Network for Dense Video Event Captioning | Aug 1, 2021 | Descriptive | CodeCode Available | 0 | 5 |
| Hierarchical Deep Multi-modal Network for Medical Visual Question Answering | Sep 27, 2020 | DescriptiveMedical Visual Question Answering | CodeCode Available | 0 | 5 |
| Hallucination Elimination and Semantic Enhancement Framework for Vision-Language Models in Traffic Scenarios | Dec 10, 2024 | Autonomous DrivingDescriptive | CodeCode Available | 0 | 5 |
| Attend to You: Personalized Image Captioning with Context Sequence Memory Networks | Apr 21, 2017 | DescriptiveImage Captioning | CodeCode Available | 0 | 5 |
| GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models | Aug 14, 2024 | DescriptiveFont Generation | CodeCode Available | 0 | 5 |
| Harnessing the Power of Prompt-based Techniques for Generating School-Level Questions using Large Language Models | Dec 2, 2023 | DescriptiveQuestion Answering | CodeCode Available | 0 | 5 |
| Graph Representation Learning for Road Type Classification | Jul 16, 2021 | ClassificationDescriptive | CodeCode Available | 0 | 5 |
| Graphite: GRAPH-Induced feaTure Extraction for Point Cloud Registration | Oct 18, 2020 | DescriptiveKeypoint Detection | CodeCode Available | 0 | 5 |
| Greedy Search for Descriptive Spatial Face Features | Jan 7, 2017 | DescriptiveFacial Expression Recognition | CodeCode Available | 0 | 5 |
| Glocal Explanations of Expected Goal Models in Soccer | Aug 29, 2023 | DescriptiveExplainable artificial intelligence | CodeCode Available | 0 | 5 |
| A Machine Learning Approach to Classifying Construction Cost Documents into the International Construction Measurement Standard | Oct 24, 2022 | DescriptiveMulti Class Text Classification | CodeCode Available | 0 | 5 |
| Global Semantic Descriptors for Zero-Shot Action Recognition | Sep 24, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 | 5 |
| Good News, Everyone! Context driven entity-aware captioning for news images | Apr 2, 2019 | ArticlesDescriptive | CodeCode Available | 0 | 5 |