| Moshi: a speech-text foundation model for real-time dialogue | Sep 17, 2024 | Action DetectionActivity Detection | CodeCode Available | 9 |
| pyannote.audio: neural building blocks for speaker diarization | Nov 4, 2019 | Action DetectionActivity Detection | CodeCode Available | 3 |
| audino: A Modern Annotation Tool for Audio and Speech | Jun 9, 2020 | Action DetectionActivity Detection | CodeCode Available | 2 |
| Exploiting Temporal Side Information in Massive IoT Connectivity | Jan 5, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| NAS-VAD: Neural Architecture Search for Voice Activity Detection | Jan 22, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Harvesting Ambient RF for Presence Detection Through Deep Learning | Feb 13, 2020 | Action DetectionActivity Detection | CodeCode Available | 1 |
| WASE: Learning When to Attend for Speaker Extraction in Cocktail Party Environments | Jun 13, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Low-Latency Speech Separation Guided Diarization for Telephone Conversations | Apr 5, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization | Nov 12, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| VoxLingua107: a Dataset for Spoken Language Recognition | Nov 25, 2020 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering | Jun 27, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| ROAD: The ROad event Awareness Dataset for Autonomous Driving | Feb 23, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| A Hybrid CNN-BiLSTM Voice Activity Detector | Mar 5, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| InaGVAD : a Challenging French TV and Radio Corpus Annotated for Speech Activity Detection and Speaker Gender Segmentation | Jun 6, 2024 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation | Oct 24, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| AV Taris: Online Audio-Visual Speech Recognition | Dec 14, 2020 | Action DetectionActivity Detection | CodeCode Available | 1 |
| HGCN: Harmonic gated compensation network for speech enhancement | Jan 30, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Learning spectro-temporal representations of complex sounds with parameterized neural networks | Mar 12, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Classification of Abnormal Hand Movement for Aiding in Autism Detection: Machine Learning Study | Aug 18, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Online speaker diarization of meetings guided by speech separation | Jan 30, 2024 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Speaker Diarization with Overlapping Community Detection Using Graph Attention Networks and Label Propagation Algorithm | Jun 3, 2025 | Action DetectionActivity Detection | CodeCode Available | 1 |
| VANPY: Voice Analysis Framework | Feb 17, 2025 | Action DetectionActivity Detection | CodeCode Available | 1 |
| X-Vector based voice activity detection for multi-genre broadcast speech-to-text | Dec 9, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| End-to-end speaker segmentation for overlap-aware resegmentation | Apr 8, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings | Mar 7, 2023 | Action DetectionActivity Detection | CodeCode Available | 1 |
| SG-VAD: Stochastic Gates Based Speech Activity Detection | Oct 28, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| MM-ALT: A Multimodal Automatic Lyric Transcription System | Jul 13, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Multitask Detection of Speaker Changes, Overlapping Speech and Voice Activity Using wav2vec 2.0 | Oct 26, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| WiFi CSI Based Temporal Activity Detection via Dual Pyramid Network | Dec 19, 2024 | Action DetectionAction Recognition | CodeCode Available | 1 |
| ivrit.ai: A Comprehensive Dataset of Hebrew Speech for AI Research and Development | Jul 17, 2023 | Action DetectionActivity Detection | CodeCode Available | 1 |
| A semi-supervised methodology for fishing activity detection using the geometry behind the trajectory of multiple vessels | Jul 12, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| AVASpeech-SMAD: A Strongly Labelled Speech and Music Activity Detection Dataset with Label Co-Occurrence | Nov 2, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications | Oct 12, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| An End-to-End Architecture for Keyword Spotting and Voice Activity Detection | Nov 28, 2016 | Action DetectionActivity Detection | CodeCode Available | 1 |
| An Acoustic Emission Activity Detection Method based on Short-Term Waveform Features: Application to Metallic Components under Uniaxial Tensile Test | Jun 26, 2019 | Action DetectionActivity Detection | —Unverified | 0 |
| Algorithm Unrolling for Massive Access via Deep Neural Network with Theoretical Guarantee | Jun 19, 2021 | Action DetectionActivity Detection | —Unverified | 0 |
| Access Delay Constrained Activity Detection in Massive Random Access | Nov 4, 2021 | Action DetectionActivity Detection | —Unverified | 0 |
| A Hybrid Graph Network for Complex Activity Detection in Video | Oct 26, 2023 | Action DetectionActivity Detection | —Unverified | 0 |
| Activity Detection for Grant-Free NOMA in Massive IoT Networks | Dec 23, 2022 | Action DetectionActivity Detection | —Unverified | 0 |
| AAD: Adaptive Anomaly Detection through traffic surveillance videos | Aug 29, 2018 | Action DetectionActivity Detection | —Unverified | 0 |
| Activity Detection And Modeling Using Smart Meter Data: Concept And Case Studies | Oct 26, 2020 | Action DetectionActivity Detection | —Unverified | 0 |
| A Flexible Framework for Grant-Free Random Access in Cell-Free Massive MIMO Systems | Nov 14, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| Attention Is Not Always the Answer: Optimizing Voice Activity Detection with Simple Feature Fusion | Jun 2, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Active Privacy-Utility Trade-off Against Inference in Time-Series Data Sharing | Feb 11, 2022 | Action DetectionActivity Detection | —Unverified | 0 |
| Argus++: Robust Real-time Activity Detection for Unconstrained Video Streams with Overlapping Cube Proposals | Jan 14, 2022 | Action DetectionActivity Detection | —Unverified | 0 |
| Advanced Image Segmentation Techniques for Neural Activity Detection via C-fos Immediate Early Gene Expression | Dec 13, 2023 | Action DetectionActivity Detection | —Unverified | 0 |
| Accelerating Coordinate Descent via Active Set Selection for Device Activity Detection for Multi-Cell Massive Random Access | Apr 27, 2021 | Action DetectionActivity Detection | —Unverified | 0 |
| A Time-Frequency based Suspicious Activity Detection for Anti-Money Laundering | Nov 17, 2020 | Action DetectionActivity Detection | —Unverified | 0 |
| Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence | Apr 18, 2023 | Action DetectionActivity Detection | —Unverified | 0 |
| A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments | Oct 6, 2020 | Action DetectionActivity Detection | —Unverified | 0 |