| CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-training | May 23, 2025 | Automatic Speech RecognitionEmotion Recognition | CodeCode Available | 11 |
| FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs | Jul 4, 2024 | Emotion RecognitionEvent Detection | CodeCode Available | 11 |
| Bryndza at ClimateActivism 2024: Stance, Target and Hate Event Detection via Retrieval-Augmented GPT-4 and LLaMA | Feb 9, 2024 | Event DetectionHate Speech Detection | CodeCode Available | 4 |
| Towards Effective, Efficient and Unsupervised Social Event Detection in the Hyperbolic Space | Dec 14, 2024 | Event Detection | CodeCode Available | 4 |
| SocialED: A Python Library for Social Event Detection | Dec 18, 2024 | CPUEvent Detection | CodeCode Available | 4 |
| OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia | Jan 23, 2025 | Emotion RecognitionEvent Detection | CodeCode Available | 3 |
| BirdNET: A deep learning solution for avian diversity monitoring | Jan 27, 2021 | Data AugmentationDeep Learning | CodeCode Available | 2 |
| WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research | Mar 30, 2023 | Audio captioningEvent Detection | CodeCode Available | 2 |
| Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event Detection | Sep 26, 2024 | Event DetectionRepresentation Learning | CodeCode Available | 2 |
| BERTrend: Neural Topic Modeling for Emerging Trends Detection | Nov 8, 2024 | Event Detection | CodeCode Available | 2 |
| Tactics2D: A Highly Modular and Extensible Simulator for Driving Decision-making | Nov 18, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding | Sep 25, 2023 | Event Argument ExtractionEvent Detection | CodeCode Available | 2 |
| Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer | Jul 28, 2022 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| An Electrocardiogram Foundation Model Built on over 10 Million Recordings with External Evaluation across Multiple Domains | Oct 5, 2024 | DiagnosticEvent Detection | CodeCode Available | 2 |
| HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection | Feb 2, 2022 | Audio ClassificationEvent Detection | CodeCode Available | 2 |
| The Devil is in the Details: On the Pitfalls of Event Extraction Evaluation | Jun 12, 2023 | Event Argument ExtractionEvent Detection | CodeCode Available | 2 |
| MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event Detection | Aug 16, 2024 | Event DetectionSound Event Detection | CodeCode Available | 2 |
| Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event Detection | Mar 27, 2024 | Data AugmentationDomain Adaptation | CodeCode Available | 2 |
| Deep Learning in Latent Space for Video Prediction and Compression | Jun 19, 2021 | Anomaly DetectionDeep Learning | CodeCode Available | 1 |
| DCASENET: A joint pre-trained deep neural network for detecting and classifying acoustic scenes and events | Sep 21, 2020 | Acoustic Scene ClassificationAudio Tagging | CodeCode Available | 1 |
| Diversifying and Expanding Frequency-Adaptive Convolution Kernels for Sound Event Detection | Jun 8, 2024 | Event DetectionSound Event Detection | CodeCode Available | 1 |
| Data Efficient Video Transformer for Violence Detection | Jul 17, 2021 | Action RecognitionEvent Detection | CodeCode Available | 1 |
| A Comprehensive Python Library for Deep Learning-Based Event Detection in Multivariate Time Series Data and Information Retrieval in NLP | Oct 25, 2023 | Binary ClassificationDeep Learning | CodeCode Available | 1 |
| Couple Learning for semi-supervised sound event detection | Oct 12, 2021 | Event DetectionSound Event Detection | CodeCode Available | 1 |
| DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic Sound Event Localization and Detection | Jun 29, 2021 | Audio ClassificationDirection of Arrival Estimation | CodeCode Available | 1 |