| Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted Captions | Nov 13, 2024 | DescriptiveHallucination | CodeCode Available | 0 |
| DYAD: A Descriptive Yet Abjuring Density efficient approximation to linear neural network layers | Dec 11, 2023 | DescriptiveGPU | CodeCode Available | 0 |
| Hierarchical Context-aware Network for Dense Video Event Captioning | Aug 1, 2021 | Descriptive | CodeCode Available | 0 |
| HICEScore: A Hierarchical Metric for Image Captioning Evaluation | Jul 26, 2024 | DescriptiveImage Captioning | CodeCode Available | 0 |
| Uninformed Students: Student-Teacher Anomaly Detection with Discriminative Latent Embeddings | Nov 6, 2019 | Anomaly DetectionAnomaly Segmentation | CodeCode Available | 0 |
| Harnessing the Power of Prompt-based Techniques for Generating School-Level Questions using Large Language Models | Dec 2, 2023 | DescriptiveQuestion Answering | CodeCode Available | 0 |
| SEKE: Specialised Experts for Keyword Extraction | Dec 18, 2024 | DescriptiveKeyword Extraction | CodeCode Available | 0 |
| Hallucination Elimination and Semantic Enhancement Framework for Vision-Language Models in Traffic Scenarios | Dec 10, 2024 | Autonomous DrivingDescriptive | CodeCode Available | 0 |
| Language-Driven Interactive Shadow Detection | Aug 16, 2024 | DescriptiveShadow Detection | CodeCode Available | 0 |
| GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models | Aug 14, 2024 | DescriptiveFont Generation | CodeCode Available | 0 |
| A Hierarchical Approach for Generating Descriptive Image Paragraphs | Nov 20, 2016 | Dense CaptioningDescriptive | CodeCode Available | 0 |
| Self-attention on Multi-Shifted Windows for Scene Segmentation | Jul 10, 2022 | DescriptiveScene Segmentation | CodeCode Available | 0 |
| DuoRC: Towards Complex Language Understanding with Paraphrased Reading Comprehension | Apr 21, 2018 | DescriptiveReading Comprehension | CodeCode Available | 0 |
| Open Digital Rights Enforcement Framework (ODRE): from descriptive to enforceable policies | Sep 26, 2024 | Descriptive | CodeCode Available | 0 |
| Large-scale Multi-granular Concept Extraction Based on Machine Reading Comprehension | Aug 30, 2022 | DescriptiveKnowledge Graphs | CodeCode Available | 0 |
| Self-optimizing Feature Generation via Categorical Hashing Representation and Hierarchical Reinforcement Crossing | Sep 8, 2023 | Descriptive | CodeCode Available | 0 |
| A Graph Theoretic Approach for Object Shape Representation in Compositional Hierarchies Using a Hybrid Generative-Descriptive Model | Jan 21, 2015 | ClusteringDescriptive | CodeCode Available | 0 |
| Self-supervised Product Quantization for Deep Unsupervised Image Retrieval | Sep 6, 2021 | Contrastive LearningDescriptive | CodeCode Available | 0 |
| Collaborative Auto-encoding for Blind Image Quality Assessment | May 24, 2023 | DecoderDescriptive | CodeCode Available | 0 |
| Bounding and Approximating Intersectional Fairness through Marginal Fairness | Jun 12, 2022 | DescriptiveFairness | CodeCode Available | 0 |
| VREN: Volleyball Rally Dataset with Expression Notation Language | Sep 28, 2022 | Decision MakingDescriptive | CodeCode Available | 0 |
| Greedy Search for Descriptive Spatial Face Features | Jan 7, 2017 | DescriptiveFacial Expression Recognition | CodeCode Available | 0 |
| Dropout Concrete Autoencoder for Band Selection on HSI Scenes | Jan 29, 2024 | Deep LearningDescriptive | CodeCode Available | 0 |
| Overcoming the Identity Mapping Problem in Self-Supervised Hyperspectral Anomaly Detection | Apr 5, 2025 | Anomaly DetectionDescriptive | CodeCode Available | 0 |
| Learning Deep Features for One-Class Classification | Jan 16, 2018 | Anomaly DetectionDescriptive | CodeCode Available | 0 |
| Boosting Audio-visual Zero-shot Learning with Large Language Models | Nov 21, 2023 | audio-visual learningDescriptive | CodeCode Available | 0 |
| Learning Efficient Representations of Neutrino Telescope Events | Oct 17, 2024 | Computational EfficiencyDescriptive | CodeCode Available | 0 |
| Learning English with Peppa Pig | Feb 25, 2022 | Descriptive | CodeCode Available | 0 |
| Overview of PicTropes, a film trope dataset | Sep 28, 2018 | Descriptive | CodeCode Available | 0 |
| A Neural Topical Expansion Framework for Unstructured Persona-oriented Dialogue Generation | Feb 6, 2020 | DescriptiveDialogue Generation | CodeCode Available | 0 |
| Temporal and Semantic Evaluation Metrics for Foundation Models in Post-Hoc Analysis of Robotic Sub-tasks | Mar 25, 2024 | DescriptiveMotion Planning | CodeCode Available | 0 |
| Automated Image Captioning with CNNs and Transformers | Dec 13, 2024 | DescriptiveHyperparameter Optimization | CodeCode Available | 0 |
| Addressing Out-of-Label Hazard Detection in Dashcam Videos: Insights from the COOOL Challenge | Jan 27, 2025 | Anomaly DetectionAutonomous Driving | CodeCode Available | 0 |
| Low-Rank Subspace Override for Unsupervised Domain Adaptation | Jul 2, 2019 | DescriptiveDomain Adaptation | CodeCode Available | 0 |
| ANEA: Automated (Named) Entity Annotation for German Domain-Specific Texts | Dec 13, 2021 | Descriptivenamed-entity-recognition | CodeCode Available | 0 |
| Audio Large Language Models Can Be Descriptive Speech Quality Evaluators | Jan 27, 2025 | Descriptive | CodeCode Available | 0 |
| Semi-Supervised Domain Generalization for Object Detection via Language-Guided Feature Alignment | Sep 24, 2023 | DescriptiveDomain Adaptation | CodeCode Available | 0 |
| Graph Representation Learning for Road Type Classification | Jul 16, 2021 | ClassificationDescriptive | CodeCode Available | 0 |
| Semi-supervised multimodal coreference resolution in image narrations | Oct 20, 2023 | coreference-resolutionCoreference Resolution | CodeCode Available | 0 |
| Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs | Apr 11, 2024 | DescriptiveHallucination | CodeCode Available | 0 |
| Attribute-based Visual Reprogramming for Image Classification with CLIP | Jan 23, 2025 | AttributeDescriptive | CodeCode Available | 0 |
| Graphite: GRAPH-Induced feaTure Extraction for Point Cloud Registration | Oct 18, 2020 | DescriptiveKeypoint Detection | CodeCode Available | 0 |
| SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text | May 18, 2018 | DescriptiveImage Captioning | CodeCode Available | 0 |
| Less Descriptive yet Discriminative: Quantifying the Properties of Multimodal Referring Utterances via CLIP | May 1, 2022 | Descriptive | CodeCode Available | 0 |
| Attend to You: Personalized Image Captioning with Context Sequence Memory Networks | Apr 21, 2017 | DescriptiveImage Captioning | CodeCode Available | 0 |
| Let's Think Frame by Frame with VIP: A Video Infilling and Prediction Dataset for Evaluating Video Chain-of-Thought | May 23, 2023 | DescriptiveVideo Prediction | CodeCode Available | 0 |
| CoinMath: Harnessing the Power of Coding Instruction for Math LLMs | Dec 16, 2024 | DescriptiveMath | CodeCode Available | 0 |
| Good News, Everyone! Context driven entity-aware captioning for news images | Apr 2, 2019 | ArticlesDescriptive | CodeCode Available | 0 |
| Picture It In Your Mind: Generating High Level Visual Representations From Textual Descriptions | Jun 23, 2016 | Cross-Modal Information RetrievalCross-Modal Retrieval | CodeCode Available | 0 |
| Leveraging Vision-Language Models for Open-Vocabulary Instance Segmentation and Tracking | Mar 18, 2025 | DescriptiveInstance Segmentation | CodeCode Available | 0 |