| General audio tagging with ensembling convolutional neural network and statistical features | Oct 30, 2018 | Audio TaggingDescriptive | CodeCode Available | 1 |
| What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights | May 31, 2024 | DescriptiveSelf-Supervised Learning | CodeCode Available | 1 |
| Dataset Distillation via Vision-Language Category Prototype | Jun 30, 2025 | Dataset DistillationDescriptive | CodeCode Available | 1 |
| GOAL: Global-local Object Alignment Learning | Mar 22, 2025 | DescriptiveObject | CodeCode Available | 1 |
| A Bi-directional Transformer for Musical Chord Recognition | Jul 5, 2019 | Chord RecognitionDescriptive | CodeCode Available | 1 |
| GraphLIME: Local Interpretable Model Explanations for Graph Neural Networks | Jan 17, 2020 | Descriptivefeature selection | CodeCode Available | 1 |
| Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation | Aug 24, 2023 | cross-modal alignmentDescriptive | CodeCode Available | 1 |
| Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision-Language Models | May 11, 2025 | DescriptiveDiagnostic | CodeCode Available | 1 |
| A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text Supervision | Aug 15, 2023 | DescriptiveLanguage Modelling | CodeCode Available | 1 |
| A Recipe for Creating Multimodal Aligned Datasets for Sequential Tasks | May 19, 2020 | Descriptive | CodeCode Available | 1 |
| HYDRA: A multimodal deep learning framework for malware classification | May 12, 2020 | ClassificationDeep Learning | CodeCode Available | 1 |
| IDAS: Intent Discovery with Abstractive Summarization | May 31, 2023 | Abstractive Text SummarizationDescriptive | CodeCode Available | 1 |
| Ins-HOI: Instance Aware Human-Object Interactions Recovery | Dec 15, 2023 | DescriptiveDisentanglement | CodeCode Available | 1 |
| InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech Systems | Jun 19, 2025 | BenchmarkingDescriptive | CodeCode Available | 1 |
| Deep Graph Matching under Quadratic Constraint | Mar 11, 2021 | DescriptiveGraph Matching | CodeCode Available | 1 |
| CTRLsum: Towards Generic Controllable Text Summarization | Dec 8, 2020 | ArticlesDescriptive | CodeCode Available | 1 |
| LaMOT: Language-Guided Multi-Object Tracking | Jun 12, 2024 | DescriptiveMulti-Object Tracking | CodeCode Available | 1 |
| Enhancing Monocular 3D Scene Completion with Diffusion Model | Mar 2, 2025 | 3D Reconstruction3D Scene Reconstruction | CodeCode Available | 1 |
| A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation | May 29, 2024 | Autonomous DrivingBoundary Detection | CodeCode Available | 1 |
| ANNdotNET -- deep learning tool on .NET Platform | Sep 23, 2020 | Deep LearningDescriptive | CodeCode Available | 1 |
| Learning Transferable Spatiotemporal Representations from Natural Script Knowledge | Sep 30, 2022 | DescriptiveRepresentation Learning | CodeCode Available | 1 |
| Leveraging Large Language Models for Enhancing the Understandability of Generated Unit Tests | Aug 21, 2024 | Bug fixingDescriptive | CodeCode Available | 1 |
| Deep Implicit Statistical Shape Models for 3D Medical Image Delineation | Apr 7, 2021 | DescriptiveLiver Segmentation | CodeCode Available | 1 |
| Meta-Learning Siamese Network for Few-Shot Text Classification | Feb 5, 2023 | ClassificationDescriptive | CodeCode Available | 1 |
| DFR: Deep Feature Reconstruction for Unsupervised Anomaly Segmentation | Dec 13, 2020 | Anomaly DetectionAnomaly Segmentation | CodeCode Available | 1 |