| A Hierarchical Deep Learning Approach for Minority Instrument Detection | Jun 26, 2025 | Deep LearningDiversity | CodeCode Available | 0 |
| Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery | Mar 29, 2025 | Action UnderstandingInstrument Recognition | —Unverified | 0 |
| M2D2: Exploring General-purpose Audio-Language Representations Beyond CLAP | Mar 28, 2025 | Audio captioningAudio Classification | CodeCode Available | 0 |
| SurgRAW: Multi-Agent Workflow with Chain-of-Thought Reasoning for Surgical Intelligence | Mar 13, 2025 | Action RecognitionInstrument Recognition | —Unverified | 0 |
| Masked Latent Prediction and Classification for Self-Supervised Audio Representation Learning | Feb 17, 2025 | Audio ClassificationAudio Tagging | CodeCode Available | 1 |
| MIRFLEX: Music Information Retrieval Feature Library for Extraction | Nov 1, 2024 | BenchmarkingInformation Retrieval | CodeCode Available | 1 |
| Deep Learning for Surgical Instrument Recognition and Segmentation in Robotic-Assisted Surgeries: A Systematic Review | Oct 9, 2024 | Instrument RecognitionSurgical tool detection | —Unverified | 0 |
| PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic Pituitary Surgery | Sep 2, 2024 | Instrument Recognition | —Unverified | 0 |
| I can listen but cannot read: An evaluation of two-tower multimodal systems for instrument recognition | Jul 25, 2024 | Instrument RecognitionRetrieval | CodeCode Available | 0 |
| A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems | Jun 26, 2024 | Audio Source SeparationDecoder | CodeCode Available | 2 |