| A Hierarchical Deep Learning Approach for Minority Instrument Detection | Jun 26, 2025 | Deep LearningDiversity | CodeCode Available | 0 |
| Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery | Mar 29, 2025 | Action UnderstandingInstrument Recognition | —Unverified | 0 |
| M2D2: Exploring General-purpose Audio-Language Representations Beyond CLAP | Mar 28, 2025 | Audio captioningAudio Classification | CodeCode Available | 0 |
| SurgRAW: Multi-Agent Workflow with Chain-of-Thought Reasoning for Surgical Intelligence | Mar 13, 2025 | Action RecognitionInstrument Recognition | —Unverified | 0 |
| Masked Latent Prediction and Classification for Self-Supervised Audio Representation Learning | Feb 17, 2025 | Audio ClassificationAudio Tagging | CodeCode Available | 1 |
| MIRFLEX: Music Information Retrieval Feature Library for Extraction | Nov 1, 2024 | BenchmarkingInformation Retrieval | CodeCode Available | 1 |
| Deep Learning for Surgical Instrument Recognition and Segmentation in Robotic-Assisted Surgeries: A Systematic Review | Oct 9, 2024 | Instrument RecognitionSurgical tool detection | —Unverified | 0 |
| PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic Pituitary Surgery | Sep 2, 2024 | Instrument Recognition | —Unverified | 0 |
| I can listen but cannot read: An evaluation of two-tower multimodal systems for instrument recognition | Jul 25, 2024 | Instrument RecognitionRetrieval | CodeCode Available | 0 |
| A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems | Jun 26, 2024 | Audio Source SeparationDecoder | CodeCode Available | 2 |
| WeakSurg: Weakly supervised surgical instrument segmentation using temporal equivariance and semantic continuity | Mar 14, 2024 | Instance SegmentationInstrument Recognition | —Unverified | 0 |
| Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio Models | Oct 24, 2023 | Audio ClassificationAudio Tagging | CodeCode Available | 2 |
| Self-refining of Pseudo Labels for Music Source Separation with Noisy Labeled Data | Jul 24, 2023 | Instrument RecognitionMusic Source Separation | —Unverified | 0 |
| Transfer Learning and Bias Correction with Pre-trained Audio Embeddings | Jul 20, 2023 | Information RetrievalInstrument Recognition | CodeCode Available | 1 |
| Audio Embeddings as Teachers for Music Classification | Jun 30, 2023 | ClassificationInformation Retrieval | CodeCode Available | 1 |
| Surgical Phase and Instrument Recognition: How to identify appropriate Dataset Splits | Jun 29, 2023 | Data VisualizationInstrument Recognition | CodeCode Available | 0 |
| Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level Tasks | Jun 7, 2023 | Audio ClassificationAudio Tagging | CodeCode Available | 1 |
| Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training | Feb 1, 2023 | Chord RecognitionInstrument Recognition | —Unverified | 0 |
| EfficientLEAF: A Faster LEarnable Audio Frontend of Questionable Use | Jul 12, 2022 | Audio ClassificationClassification | CodeCode Available | 1 |
| Jointist: Joint Learning for Multi-instrument Transcription and Its Applications | Jun 22, 2022 | Chord RecognitionInstrument Recognition | —Unverified | 0 |
| ATST: Audio Representation Learning with Teacher-Student Transformer | Apr 26, 2022 | Audio ClassificationInstrument Recognition | CodeCode Available | 1 |
| Efficient Training of Audio Transformers with Patchout | Oct 11, 2021 | Acoustic Scene ClassificationAudio Classification | CodeCode Available | 1 |
| ChMusic: A Traditional Chinese Music Dataset for Evaluation of Instrument Recognition | Aug 19, 2021 | Information RetrievalInstrument Recognition | CodeCode Available | 1 |
| Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument sounds | Jul 24, 2021 | Data AugmentationInstrument Recognition | CodeCode Available | 0 |
| Leveraging Hierarchical Structures for Few-Shot Musical Instrument Recognition | Jul 14, 2021 | Few-Shot LearningInstrument Recognition | CodeCode Available | 1 |