| CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-training | May 23, 2025 | Automatic Speech RecognitionEmotion Recognition | CodeCode Available | 11 |
| FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs | Jul 4, 2024 | Emotion RecognitionEvent Detection | CodeCode Available | 11 |
| Towards Effective, Efficient and Unsupervised Social Event Detection in the Hyperbolic Space | Dec 14, 2024 | Event Detection | CodeCode Available | 4 |
| Bryndza at ClimateActivism 2024: Stance, Target and Hate Event Detection via Retrieval-Augmented GPT-4 and LLaMA | Feb 9, 2024 | Event DetectionHate Speech Detection | CodeCode Available | 4 |
| SocialED: A Python Library for Social Event Detection | Dec 18, 2024 | CPUEvent Detection | CodeCode Available | 4 |
| OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia | Jan 23, 2025 | Emotion RecognitionEvent Detection | CodeCode Available | 3 |
| Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event Detection | Sep 26, 2024 | Event DetectionRepresentation Learning | CodeCode Available | 2 |
| An Electrocardiogram Foundation Model Built on over 10 Million Recordings with External Evaluation across Multiple Domains | Oct 5, 2024 | DiagnosticEvent Detection | CodeCode Available | 2 |
| Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event Detection | Mar 27, 2024 | Data AugmentationDomain Adaptation | CodeCode Available | 2 |
| WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research | Mar 30, 2023 | Audio captioningEvent Detection | CodeCode Available | 2 |
| BERTrend: Neural Topic Modeling for Emerging Trends Detection | Nov 8, 2024 | Event Detection | CodeCode Available | 2 |
| The Devil is in the Details: On the Pitfalls of Event Extraction Evaluation | Jun 12, 2023 | Event Argument ExtractionEvent Detection | CodeCode Available | 2 |
| HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection | Feb 2, 2022 | Audio ClassificationEvent Detection | CodeCode Available | 2 |
| BirdNET: A deep learning solution for avian diversity monitoring | Jan 27, 2021 | Data AugmentationDeep Learning | CodeCode Available | 2 |
| MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event Detection | Aug 16, 2024 | Event DetectionSound Event Detection | CodeCode Available | 2 |
| OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding | Sep 25, 2023 | Event Argument ExtractionEvent Detection | CodeCode Available | 2 |
| Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer | Jul 28, 2022 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| Tactics2D: A Highly Modular and Extensible Simulator for Driving Decision-making | Nov 18, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection | Jan 10, 2024 | Event DetectionSound Event Detection | CodeCode Available | 1 |
| Frequency Dynamic Convolution: Frequency-Adaptive Pattern Recognition for Sound Event Detection | Mar 29, 2022 | Event DetectionSound Event Detection | CodeCode Available | 1 |
| Fusing Event-based and RGB camera for Robust Object Detection in Adverse Conditions | Mar 30, 2022 | 3D Object DetectionAdversarial Attack | CodeCode Available | 1 |
| Fine-tune the pretrained ATST model for sound event detection | Sep 15, 2023 | Event DetectionSelf-Supervised Learning | CodeCode Available | 1 |
| Few-shot bioacoustic event detection at the DCASE 2023 challenge | Jun 15, 2023 | Event DetectionFew-Shot Learning | CodeCode Available | 1 |
| F^3Set: Towards Analyzing Fast, Frequent, and Fine-grained Events from Videos | Apr 11, 2025 | Action UnderstandingEvent Detection | CodeCode Available | 1 |
| Few-Shot Event Detection with Prototypical Amortized Conditional Random Field | Dec 4, 2020 | Event Detection | CodeCode Available | 1 |
| FilterAugment: An Acoustic Environmental Data Augmentation Method | Oct 7, 2021 | Data AugmentationEvent Detection | CodeCode Available | 1 |
| Frequency Dependent Sound Event Detection for DCASE 2022 Challenge Task 4 | Jun 23, 2022 | Event DetectionSound Event Detection | CodeCode Available | 1 |
| A Comprehensive Python Library for Deep Learning-Based Event Detection in Multivariate Time Series Data and Information Retrieval in NLP | Oct 25, 2023 | Binary ClassificationDeep Learning | CodeCode Available | 1 |
| Few-shot Event Detection: An Empirical Study and a Unified View | May 3, 2023 | Event Detection | CodeCode Available | 1 |
| Forward-Backward Convolutional Recurrent Neural Networks and Tag-Conditioned Convolutional Neural Networks for Weakly Labeled Semi-supervised Sound Event Detection | Mar 11, 2021 | Event DetectionSound Event Detection | CodeCode Available | 1 |
| Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection | Dec 14, 2023 | Data AugmentationEvent Detection | CodeCode Available | 1 |
| EventGraph: Event Extraction as Semantic Graph Parsing | Oct 16, 2022 | Event DetectionEvent Extraction | CodeCode Available | 1 |
| Enhanced Language Representation with Label Knowledge for Span Extraction | Nov 1, 2021 | Event DetectionNER | CodeCode Available | 1 |
| EvRT-DETR: Latent Space Adaptation of Image Detectors for Event-based Vision | Dec 3, 2024 | Event-based visionEvent Detection | CodeCode Available | 1 |
| Effective Pre-Training of Audio Transformers for Sound Event Detection | Sep 14, 2024 | Data AugmentationEvent Detection | CodeCode Available | 1 |
| DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic Sound Event Localization and Detection | Jun 29, 2021 | Audio ClassificationDirection of Arrival Estimation | CodeCode Available | 1 |
| Deep Learning in Latent Space for Video Prediction and Compression | Jun 19, 2021 | Anomaly DetectionDeep Learning | CodeCode Available | 1 |
| Embed2Detect: Temporally Clustered Embedded Words for Event Detection in Social Media | Jun 10, 2020 | ClusteringEvent Detection | CodeCode Available | 1 |
| Explainable AI Components for Narrative Map Extraction | Mar 19, 2025 | Event DetectionExplainable artificial intelligence | CodeCode Available | 1 |
| DCASENET: A joint pre-trained deep neural network for detecting and classifying acoustic scenes and events | Sep 21, 2020 | Acoustic Scene ClassificationAudio Tagging | CodeCode Available | 1 |
| Diversifying and Expanding Frequency-Adaptive Convolution Kernels for Sound Event Detection | Jun 8, 2024 | Event DetectionSound Event Detection | CodeCode Available | 1 |
| A dataset for Audio-Visual Sound Event Detection in Movies | Feb 14, 2023 | Event DetectionSelf-Driving Cars | CodeCode Available | 1 |
| End-to-End Neural Event Coreference Resolution | Sep 17, 2020 | coreference-resolutionCoreference Resolution | CodeCode Available | 1 |
| A Deep Learning Approach for the Segmentation of Electroencephalography Data in Eye Tracking Applications | Jun 17, 2022 | EEGElectroencephalogram (EEG) | CodeCode Available | 1 |
| ACCDOA: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization and Detection | Oct 29, 2020 | Event DetectionSound Event Detection | CodeCode Available | 1 |
| Edge-Enhanced Graph Convolution Networks for Event Detection with Syntactic Relation | Feb 25, 2020 | Event DetectionRelation | CodeCode Available | 1 |
| Exploring Text-Queried Sound Event Detection with Audio Source Separation | Sep 20, 2024 | Audio Source SeparationEvent Detection | CodeCode Available | 1 |
| CORAL: Expert-Curated medical Oncology Reports to Advance Language Model Inference | Aug 7, 2023 | Event DetectionLanguage Modeling | CodeCode Available | 1 |
| Few-shot bioacoustic event detection at the DCASE 2022 challenge | Jul 14, 2022 | Event DetectionSound Event Detection | CodeCode Available | 1 |
| CNN Architectures for Large-Scale Audio Classification | Sep 29, 2016 | Audio ClassificationEvent Detection | CodeCode Available | 1 |