SOTAVerified

Instrument Recognition

Papers

Showing 125 of 39 papers

TitleStatusHype
A Hierarchical Deep Learning Approach for Minority Instrument DetectionCode0
Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery0
M2D2: Exploring General-purpose Audio-Language Representations Beyond CLAPCode0
SurgRAW: Multi-Agent Workflow with Chain-of-Thought Reasoning for Surgical Intelligence0
Masked Latent Prediction and Classification for Self-Supervised Audio Representation LearningCode1
MIRFLEX: Music Information Retrieval Feature Library for ExtractionCode1
Deep Learning for Surgical Instrument Recognition and Segmentation in Robotic-Assisted Surgeries: A Systematic Review0
PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic Pituitary Surgery0
I can listen but cannot read: An evaluation of two-tower multimodal systems for instrument recognitionCode0
A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four StemsCode2
WeakSurg: Weakly supervised surgical instrument segmentation using temporal equivariance and semantic continuity0
Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio ModelsCode2
Self-refining of Pseudo Labels for Music Source Separation with Noisy Labeled Data0
Transfer Learning and Bias Correction with Pre-trained Audio EmbeddingsCode1
Audio Embeddings as Teachers for Music ClassificationCode1
Surgical Phase and Instrument Recognition: How to identify appropriate Dataset SplitsCode0
Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level TasksCode1
Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training0
EfficientLEAF: A Faster LEarnable Audio Frontend of Questionable UseCode1
Jointist: Joint Learning for Multi-instrument Transcription and Its Applications0
ATST: Audio Representation Learning with Teacher-Student TransformerCode1
Efficient Training of Audio Transformers with PatchoutCode1
ChMusic: A Traditional Chinese Music Dataset for Evaluation of Instrument RecognitionCode1
Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument soundsCode0
Leveraging Hierarchical Structures for Few-Shot Musical Instrument RecognitionCode1
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1M2D-CLAPAccuracy80.6Unverified
2M2D2 AS+Accuracy79.7Unverified
3M2D ASAccuracy78.7Unverified
4MATPAC (SSL, linear eval)Accuracy74.6Unverified
5melspectAccuracy72.1Unverified
6EfficientLEAFAccuracy71.7Unverified
7LEAFAccuracy69.2Unverified
#ModelMetricClaimedVerifiedStatus
1DyMN-Lmean average precision0.86Unverified
2MATPAC (SSL Model, linear eval)mean average precision0.85Unverified
3EAsT-KD + PaSSTmean average precision0.85Unverified
4EAsT-Final + PaSSTmean average precision0.85Unverified
5PaSSTmean average precision0.84Unverified
#ModelMetricClaimedVerifiedStatus
1SVMF1-score0.81Unverified