| Boosting Audio-visual Zero-shot Learning with Large Language Models | Nov 21, 2023 | audio-visual learningDescriptive | CodeCode Available | 0 |
| Learning Efficient Representations of Neutrino Telescope Events | Oct 17, 2024 | Computational EfficiencyDescriptive | CodeCode Available | 0 |
| Learning English with Peppa Pig | Feb 25, 2022 | Descriptive | CodeCode Available | 0 |
| Overview of PicTropes, a film trope dataset | Sep 28, 2018 | Descriptive | CodeCode Available | 0 |
| A Neural Topical Expansion Framework for Unstructured Persona-oriented Dialogue Generation | Feb 6, 2020 | DescriptiveDialogue Generation | CodeCode Available | 0 |
| Temporal and Semantic Evaluation Metrics for Foundation Models in Post-Hoc Analysis of Robotic Sub-tasks | Mar 25, 2024 | DescriptiveMotion Planning | CodeCode Available | 0 |
| Automated Image Captioning with CNNs and Transformers | Dec 13, 2024 | DescriptiveHyperparameter Optimization | CodeCode Available | 0 |
| Addressing Out-of-Label Hazard Detection in Dashcam Videos: Insights from the COOOL Challenge | Jan 27, 2025 | Anomaly DetectionAutonomous Driving | CodeCode Available | 0 |
| Low-Rank Subspace Override for Unsupervised Domain Adaptation | Jul 2, 2019 | DescriptiveDomain Adaptation | CodeCode Available | 0 |
| ANEA: Automated (Named) Entity Annotation for German Domain-Specific Texts | Dec 13, 2021 | Descriptivenamed-entity-recognition | CodeCode Available | 0 |
| Audio Large Language Models Can Be Descriptive Speech Quality Evaluators | Jan 27, 2025 | Descriptive | CodeCode Available | 0 |
| Semi-Supervised Domain Generalization for Object Detection via Language-Guided Feature Alignment | Sep 24, 2023 | DescriptiveDomain Adaptation | CodeCode Available | 0 |
| Graph Representation Learning for Road Type Classification | Jul 16, 2021 | ClassificationDescriptive | CodeCode Available | 0 |
| Semi-supervised multimodal coreference resolution in image narrations | Oct 20, 2023 | coreference-resolutionCoreference Resolution | CodeCode Available | 0 |
| Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs | Apr 11, 2024 | DescriptiveHallucination | CodeCode Available | 0 |
| Attribute-based Visual Reprogramming for Image Classification with CLIP | Jan 23, 2025 | AttributeDescriptive | CodeCode Available | 0 |
| Graphite: GRAPH-Induced feaTure Extraction for Point Cloud Registration | Oct 18, 2020 | DescriptiveKeypoint Detection | CodeCode Available | 0 |
| SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text | May 18, 2018 | DescriptiveImage Captioning | CodeCode Available | 0 |
| Less Descriptive yet Discriminative: Quantifying the Properties of Multimodal Referring Utterances via CLIP | May 1, 2022 | Descriptive | CodeCode Available | 0 |
| Attend to You: Personalized Image Captioning with Context Sequence Memory Networks | Apr 21, 2017 | DescriptiveImage Captioning | CodeCode Available | 0 |
| Let's Think Frame by Frame with VIP: A Video Infilling and Prediction Dataset for Evaluating Video Chain-of-Thought | May 23, 2023 | DescriptiveVideo Prediction | CodeCode Available | 0 |
| CoinMath: Harnessing the Power of Coding Instruction for Math LLMs | Dec 16, 2024 | DescriptiveMath | CodeCode Available | 0 |
| Good News, Everyone! Context driven entity-aware captioning for news images | Apr 2, 2019 | ArticlesDescriptive | CodeCode Available | 0 |
| Picture It In Your Mind: Generating High Level Visual Representations From Textual Descriptions | Jun 23, 2016 | Cross-Modal Information RetrievalCross-Modal Retrieval | CodeCode Available | 0 |
| Leveraging Vision-Language Models for Open-Vocabulary Instance Segmentation and Tracking | Mar 18, 2025 | DescriptiveInstance Segmentation | CodeCode Available | 0 |