| CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-training | May 23, 2025 | Automatic Speech RecognitionEmotion Recognition | CodeCode Available | 11 |
| FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs | Jul 4, 2024 | Emotion RecognitionEvent Detection | CodeCode Available | 11 |
| SocialED: A Python Library for Social Event Detection | Dec 18, 2024 | CPUEvent Detection | CodeCode Available | 4 |
| Towards Effective, Efficient and Unsupervised Social Event Detection in the Hyperbolic Space | Dec 14, 2024 | Event Detection | CodeCode Available | 4 |
| Bryndza at ClimateActivism 2024: Stance, Target and Hate Event Detection via Retrieval-Augmented GPT-4 and LLaMA | Feb 9, 2024 | Event DetectionHate Speech Detection | CodeCode Available | 4 |
| OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia | Jan 23, 2025 | Emotion RecognitionEvent Detection | CodeCode Available | 3 |
| BERTrend: Neural Topic Modeling for Emerging Trends Detection | Nov 8, 2024 | Event Detection | CodeCode Available | 2 |
| An Electrocardiogram Foundation Model Built on over 10 Million Recordings with External Evaluation across Multiple Domains | Oct 5, 2024 | DiagnosticEvent Detection | CodeCode Available | 2 |
| Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event Detection | Sep 26, 2024 | Event DetectionRepresentation Learning | CodeCode Available | 2 |
| MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event Detection | Aug 16, 2024 | Event DetectionSound Event Detection | CodeCode Available | 2 |
| Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event Detection | Mar 27, 2024 | Data AugmentationDomain Adaptation | CodeCode Available | 2 |
| Tactics2D: A Highly Modular and Extensible Simulator for Driving Decision-making | Nov 18, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding | Sep 25, 2023 | Event Argument ExtractionEvent Detection | CodeCode Available | 2 |
| The Devil is in the Details: On the Pitfalls of Event Extraction Evaluation | Jun 12, 2023 | Event Argument ExtractionEvent Detection | CodeCode Available | 2 |
| WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research | Mar 30, 2023 | Audio captioningEvent Detection | CodeCode Available | 2 |
| Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer | Jul 28, 2022 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection | Feb 2, 2022 | Audio ClassificationEvent Detection | CodeCode Available | 2 |
| BirdNET: A deep learning solution for avian diversity monitoring | Jan 27, 2021 | Data AugmentationDeep Learning | CodeCode Available | 2 |
| F^3Set: Towards Analyzing Fast, Frequent, and Fine-grained Events from Videos | Apr 11, 2025 | Action UnderstandingEvent Detection | CodeCode Available | 1 |
| Explainable AI Components for Narrative Map Extraction | Mar 19, 2025 | Event DetectionExplainable artificial intelligence | CodeCode Available | 1 |
| Exploring Performance-Complexity Trade-Offs in Sound Event Detection Models | Mar 14, 2025 | Audio TaggingEvent Detection | CodeCode Available | 1 |
| EvRT-DETR: Latent Space Adaptation of Image Detectors for Event-based Vision | Dec 3, 2024 | Event-based visionEvent Detection | CodeCode Available | 1 |
| ROAD-Waymo: Action Awareness at Scale for Autonomous Driving | Nov 3, 2024 | Autonomous DrivingBenchmarking | CodeCode Available | 1 |
| Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event Detection | Nov 2, 2024 | Audio Source SeparationEvent Detection | CodeCode Available | 1 |
| SeisLM: a Foundation Model for Seismic Waveforms | Oct 21, 2024 | Event DetectionLanguage Modeling | CodeCode Available | 1 |
| Exploring Text-Queried Sound Event Detection with Audio Source Separation | Sep 20, 2024 | Audio Source SeparationEvent Detection | CodeCode Available | 1 |
| Effective Pre-Training of Audio Transformers for Sound Event Detection | Sep 14, 2024 | Data AugmentationEvent Detection | CodeCode Available | 1 |
| Unsupervised Episode Detection for Large-Scale News Events | Aug 9, 2024 | ArticlesEvent Detection | CodeCode Available | 1 |
| Multi-Iteration Multi-Stage Fine-Tuning of Transformers for Sound Event Detection with Heterogeneous Datasets | Jul 17, 2024 | Event DetectionSound Event Detection | CodeCode Available | 1 |
| Improving Audio Spectrogram Transformers for Sound Event Detection Through Multi-Stage Training | Jul 17, 2024 | Event DetectionMissing Labels | CodeCode Available | 1 |
| WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System | Jul 4, 2024 | Event DetectionLanguage Modeling | CodeCode Available | 1 |
| Self Training and Ensembling Frequency Dependent Networks with Coarse Prediction Pooling and Sound Event Bounding Boxes | Jun 22, 2024 | Change DetectionData Augmentation | CodeCode Available | 1 |
| Pushing the Limit of Sound Event Detection with Multi-Dilated Frequency Dynamic Convolution | Jun 19, 2024 | Event DetectionSound Event Detection | CodeCode Available | 1 |
| Diversifying and Expanding Frequency-Adaptive Convolution Kernels for Sound Event Detection | Jun 8, 2024 | Event DetectionSound Event Detection | CodeCode Available | 1 |
| Sound Event Bounding Boxes | Jun 6, 2024 | Change DetectionEvent Detection | CodeCode Available | 1 |
| Revisiting Deep Audio-Text Retrieval Through the Lens of Transportation | May 16, 2024 | AudioCapsEvent Detection | CodeCode Available | 1 |
| UniAV: Unified Audio-Visual Perception for Multi-Task Video Event Localization | Apr 4, 2024 | Action Localizationaudio-visual event localization | CodeCode Available | 1 |
| Cloud gap-filling with deep learning for improved grassland monitoring | Mar 14, 2024 | Deep LearningEvent Detection | CodeCode Available | 1 |
| Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection | Jan 10, 2024 | Event DetectionSound Event Detection | CodeCode Available | 1 |
| Hierarchical and Incremental Structural Entropy Minimization for Unsupervised Social Event Detection | Dec 19, 2023 | Event DetectionGraph Neural Network | CodeCode Available | 1 |
| Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection | Dec 14, 2023 | Data AugmentationEvent Detection | CodeCode Available | 1 |
| w2v-SELD: A Sound Event Localization and Detection Framework for Self-Supervised Spatial Audio Pre-Training | Dec 12, 2023 | Event DetectionSound Event Detection | CodeCode Available | 1 |
| Event Detection in Time Series: Universal Deep Learning Approach | Nov 27, 2023 | Binary ClassificationDeep Learning | CodeCode Available | 1 |
| MAVEN-Arg: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation | Nov 15, 2023 | AllEvent Argument Extraction | CodeCode Available | 1 |
| A Comprehensive Python Library for Deep Learning-Based Event Detection in Multivariate Time Series Data and Information Retrieval in NLP | Oct 25, 2023 | Binary ClassificationDeep Learning | CodeCode Available | 1 |
| Fine-tune the pretrained ATST model for sound event detection | Sep 15, 2023 | Event DetectionSelf-Supervised Learning | CodeCode Available | 1 |
| Pretraining Representations for Bioacoustic Few-shot Detection using Supervised Contrastive Learning | Sep 2, 2023 | Contrastive LearningData Augmentation | CodeCode Available | 1 |
| CORAL: Expert-Curated medical Oncology Reports to Advance Language Model Inference | Aug 7, 2023 | Event DetectionLanguage Modeling | CodeCode Available | 1 |
| Post-Processing Independent Evaluation of Sound Event Detection Systems | Jun 27, 2023 | Event DetectionSound Event Detection | CodeCode Available | 1 |
| Sentence-level Event Detection without Triggers via Prompt Learning and Machine Reading Comprehension | Jun 25, 2023 | Event DetectionMachine Reading Comprehension | CodeCode Available | 1 |