| ATST: Audio Representation Learning with Teacher-Student Transformer | Apr 26, 2022 | Audio ClassificationInstrument Recognition | CodeCode Available | 1 |
| Audio Embeddings as Teachers for Music Classification | Jun 30, 2023 | ClassificationInformation Retrieval | CodeCode Available | 1 |
| Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level Tasks | Jun 7, 2023 | Audio ClassificationAudio Tagging | CodeCode Available | 1 |
| Masked Latent Prediction and Classification for Self-Supervised Audio Representation Learning | Feb 17, 2025 | Audio ClassificationAudio Tagging | CodeCode Available | 1 |
| Efficient Training of Audio Transformers with Patchout | Oct 11, 2021 | Acoustic Scene ClassificationAudio Classification | CodeCode Available | 1 |
| WeakSurg: Weakly supervised surgical instrument segmentation using temporal equivariance and semantic continuity | Mar 14, 2024 | Instance SegmentationInstrument Recognition | —Unverified | 0 |
| Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery | Mar 29, 2025 | Action UnderstandingInstrument Recognition | —Unverified | 0 |
| Cross-modal variational inference for bijective signal-symbol translation | Feb 10, 2020 | Audio GenerationDensity Estimation | —Unverified | 0 |
| Deep Learning for Surgical Instrument Recognition and Segmentation in Robotic-Assisted Surgeries: A Systematic Review | Oct 9, 2024 | Instrument RecognitionSurgical tool detection | —Unverified | 0 |
| Deep Neural Network for Musical Instrument Recognition using MFCCs | May 3, 2021 | General ClassificationInstrument Recognition | —Unverified | 0 |