| Moshi: a speech-text foundation model for real-time dialogue | Sep 17, 2024 | Action DetectionActivity Detection | CodeCode Available | 9 |
| pyannote.audio: neural building blocks for speaker diarization | Nov 4, 2019 | Action DetectionActivity Detection | CodeCode Available | 3 |
| audino: A Modern Annotation Tool for Audio and Speech | Jun 9, 2020 | Action DetectionActivity Detection | CodeCode Available | 2 |
| Speaker Diarization with Overlapping Community Detection Using Graph Attention Networks and Label Propagation Algorithm | Jun 3, 2025 | Action DetectionActivity Detection | CodeCode Available | 1 |
| VANPY: Voice Analysis Framework | Feb 17, 2025 | Action DetectionActivity Detection | CodeCode Available | 1 |
| WiFi CSI Based Temporal Activity Detection via Dual Pyramid Network | Dec 19, 2024 | Action DetectionAction Recognition | CodeCode Available | 1 |
| InaGVAD : a Challenging French TV and Radio Corpus Annotated for Speech Activity Detection and Speaker Gender Segmentation | Jun 6, 2024 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Online speaker diarization of meetings guided by speech separation | Jan 30, 2024 | Action DetectionActivity Detection | CodeCode Available | 1 |
| ivrit.ai: A Comprehensive Dataset of Hebrew Speech for AI Research and Development | Jul 17, 2023 | Action DetectionActivity Detection | CodeCode Available | 1 |
| TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings | Mar 7, 2023 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization | Nov 12, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| SG-VAD: Stochastic Gates Based Speech Activity Detection | Oct 28, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Multitask Detection of Speaker Changes, Overlapping Speech and Voice Activity Using wav2vec 2.0 | Oct 26, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation | Oct 24, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| MM-ALT: A Multimodal Automatic Lyric Transcription System | Jul 13, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| A semi-supervised methodology for fishing activity detection using the geometry behind the trajectory of multiple vessels | Jul 12, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering | Jun 27, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Low-Latency Speech Separation Guided Diarization for Telephone Conversations | Apr 5, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| HGCN: Harmonic gated compensation network for speech enhancement | Jan 30, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| NAS-VAD: Neural Architecture Search for Voice Activity Detection | Jan 22, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Exploiting Temporal Side Information in Massive IoT Connectivity | Jan 5, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| X-Vector based voice activity detection for multi-genre broadcast speech-to-text | Dec 9, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| AVASpeech-SMAD: A Strongly Labelled Speech and Music Activity Detection Dataset with Label Co-Occurrence | Nov 2, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications | Oct 12, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Classification of Abnormal Hand Movement for Aiding in Autism Detection: Machine Learning Study | Aug 18, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| WASE: Learning When to Attend for Speaker Extraction in Cocktail Party Environments | Jun 13, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| End-to-end speaker segmentation for overlap-aware resegmentation | Apr 8, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Learning spectro-temporal representations of complex sounds with parameterized neural networks | Mar 12, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| A Hybrid CNN-BiLSTM Voice Activity Detector | Mar 5, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| ROAD: The ROad event Awareness Dataset for Autonomous Driving | Feb 23, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| AV Taris: Online Audio-Visual Speech Recognition | Dec 14, 2020 | Action DetectionActivity Detection | CodeCode Available | 1 |
| VoxLingua107: a Dataset for Spoken Language Recognition | Nov 25, 2020 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Harvesting Ambient RF for Presence Detection Through Deep Learning | Feb 13, 2020 | Action DetectionActivity Detection | CodeCode Available | 1 |
| An End-to-End Architecture for Keyword Spotting and Voice Activity Detection | Nov 28, 2016 | Action DetectionActivity Detection | CodeCode Available | 1 |
| CBF-AFA: Chunk-Based Multi-SSL Fusion for Automatic Fluency Assessment | Jun 25, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Distributed Activity Detection for Cell-Free Hybrid Near-Far Field Communications | Jun 17, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Attention Is Not Always the Answer: Optimizing Voice Activity Detection with Simple Feature Fusion | Jun 2, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Joint Activity Detection and Channel Estimation for Massive Connectivity: Where Message Passing Meets Score-Based Generative Priors | May 31, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Towards Robust Overlapping Speech Detection: A Speaker-Aware Progressive Approach Using WavLM | May 29, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Robust Activity Detection for Massive Random Access | May 21, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Improving endpoint detection in end-to-end streaming ASR for conversational speech | May 19, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Multi-Stage Speaker Diarization for Noisy Classrooms | May 16, 2025 | Action DetectionActivity Detection | CodeCode Available | 0 |
| MicroNAS: An Automated Framework for Developing a Fall Detection System | Apr 10, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Fast MLE and MAPE-Based Device Activity Detection for Grant-Free Access via PSCA and PSCA-Net | Mar 19, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Federated Learning for Secure and Efficient Device Activity Detection in mMTC Networks | Mar 14, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Lightweight Learning for Grant-Free Activity Detection in Cell-Free Massive MIMO Networks | Mar 14, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Robust Learning-Based Sparse Recovery for Device Activity Detection in Grant-Free Random Access Cell-Free Massive MIMO: Enhancing Resilience to Impairments | Mar 13, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| CADDI: An in-Class Activity Detection Dataset using IMU data from low-cost sensors | Mar 4, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Optimizing Large Language Models for ESG Activity Detection in Financial Texts | Feb 28, 2025 | Action DetectionActivity Detection | CodeCode Available | 0 |
| Mixture of Experts-augmented Deep Unfolding for Activity Detection in IRS-aided Systems | Feb 27, 2025 | Action DetectionActivity Detection | —Unverified | 0 |