Activity Detection

Detecting activities in extended videos.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 380 papers

Title	Date	Tasks	Status	Hype
In-Ear-Voice: Towards Milli-Watt Audio Enhancement With Bone-Conduction Microphones for In-Ear Sensing Platforms	Sep 5, 2023	Action DetectionActivity Detection	—Unverified	0
The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023	Aug 15, 2023	Action DetectionActivity Detection	—Unverified	0
Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations	Aug 14, 2023	Action DetectionActivity Detection	CodeCode Available	0
An enhanced system for the detection and active cancellation of snoring signals	Jul 31, 2023	Action DetectionActivity Detection	—Unverified	0
ivrit.ai: A Comprehensive Dataset of Hebrew Speech for AI Research and Development	Jul 17, 2023	Action DetectionActivity Detection	CodeCode Available	1
Long-term Conversation Analysis: Exploring Utility and Privacy	Jun 28, 2023	Action DetectionActivity Detection	CodeCode Available	0
Multi-microphone Automatic Speech Segmentation in Meetings Based on Circular Harmonics Features	Jun 7, 2023	Action DetectionActivity Detection	—Unverified	0
Parallel Neurosymbolic Integration with Concordia	Jun 1, 2023	Action DetectionActivity Detection	—Unverified	0
SVVAD: Personal Voice Activity Detection for Speaker Verification	May 31, 2023	Action DetectionActivity Detection	—Unverified	0
Building Accurate Low Latency ASR for Streaming Voice Search	May 29, 2023	Action DetectionActivity Detection	—Unverified	0
Joint Activity-Delay Detection and Channel Estimation for Asynchronous Massive Random Access	May 21, 2023	Action DetectionActivity Detection	—Unverified	0
Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction	May 21, 2023	Action DetectionActivity Detection	—Unverified	0
FunASR: A Fundamental End-to-End Speech Recognition Toolkit	May 18, 2023	Action DetectionActivity Detection	—Unverified	0
Deep Learning for Asynchronous Massive Access with Data Frame Length Diversity	May 12, 2023	Action DetectionActivity Detection	—Unverified	0
Joint Activity Detection and Channel Estimation for Clustered Massive Machine Type Communications	May 4, 2023	Action DetectionActivity Detection	—Unverified	0
Cooperative Multi-Cell Massive Access with Temporally Correlated Activity	Apr 19, 2023	Action DetectionActivity Detection	—Unverified	0
Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence	Apr 18, 2023	Action DetectionActivity Detection	—Unverified	0
Grant-free Massive Random Access with Retransmission: Receiver Optimization and Performance Analysis	Apr 12, 2023	Action DetectionActivity Detection	—Unverified	0
Evaluation of Noise Reduction Methods for Sentence Recognition by Sinhala Speaking Listeners	Mar 31, 2023	Action DetectionActivity Detection	CodeCode Available	0
Better Together: Dialogue Separation and Voice Activity Detection for Audio Personalization in TV	Mar 23, 2023	Action DetectionActivity Detection	—Unverified	0
End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations	Mar 21, 2023	Action DetectionActivity Detection	—Unverified	0
A processing framework to access large quantities of whispered speech found in ASMR	Mar 13, 2023	Action DetectionActivity Detection	—Unverified	0
Multi-Task Sub-Band Network For Deep Residual Echo Suppression	Mar 11, 2023	Action DetectionActivity Detection	—Unverified	0
TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings	Mar 7, 2023	Action DetectionActivity Detection	CodeCode Available	1
Improving Transformer-based End-to-End Speaker Diarization by Assigning Auxiliary Losses to Attention Heads	Mar 2, 2023	Action DetectionActivity Detection	—Unverified	0
Learnable Frontends that do not Learn: Quantifying Sensitivity to Filterbank Initialisation	Feb 20, 2023	Action DetectionActivity Detection	—Unverified	0
The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description	Jan 17, 2023	Action DetectionActivity Detection	—Unverified	0
KIDS: kinematics-based (in)activity detection and segmentation in a sleep case study	Jan 4, 2023	Action DetectionActivity Detection	—Unverified	0
Activity Detection for Grant-Free NOMA in Massive IoT Networks	Dec 23, 2022	Action DetectionActivity Detection	—Unverified	0
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks	Dec 14, 2022	Action DetectionActivity Detection	—Unverified	0
Trajectory-User Linking Is Easier Than You Think	Dec 14, 2022	Action DetectionActivity Detection	—Unverified	0
BC-VAD: A Robust Bone Conduction Voice Activity Detection	Dec 6, 2022	Action DetectionActivity Detection	—Unverified	0
Proximal Gradient-Based Unfolding for Massive Random Access in IoT Networks	Dec 4, 2022	Action DetectionActivity Detection	—Unverified	0
Joint Estimation of Clustered User Activity and Correlated Channels with Unknown Covariance in mMTC	Nov 30, 2022	Action DetectionActivity Detection	—Unverified	0
Multi-timescale Event Detection in Nonintrusive Load Monitoring based on MDL Principle	Nov 19, 2022	Action DetectionActivity Detection	—Unverified	0
Token Turing Machines	Nov 16, 2022	Action DetectionActivity Detection	—Unverified	0
On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches	Nov 16, 2022	Action DetectionActivity Detection	—Unverified	0
Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization	Nov 12, 2022	Action DetectionActivity Detection	CodeCode Available	1
Two-stream Multi-dimensional Convolutional Network for Real-time Violence Detection	Nov 8, 2022	Action DetectionActivity Detection	—Unverified	0
OFDM-Based Massive Connectivity for LEO Satellite Internet of Things	Oct 31, 2022	Action DetectionActivity Detection	—Unverified	0
Target-Speaker Voice Activity Detection via Sequence-to-Sequence Prediction	Oct 28, 2022	Action DetectionActivity Detection	—Unverified	0
Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition	Oct 28, 2022	Action DetectionActivity Detection	—Unverified	0
SG-VAD: Stochastic Gates Based Speech Activity Detection	Oct 28, 2022	Action DetectionActivity Detection	CodeCode Available	1
Multitask Detection of Speaker Changes, Overlapping Speech and Voice Activity Using wav2vec 2.0	Oct 26, 2022	Action DetectionActivity Detection	CodeCode Available	1
TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge	Oct 26, 2022	Action DetectionActivity Detection	—Unverified	0
Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation	Oct 24, 2022	Action DetectionActivity Detection	CodeCode Available	1
Intel Labs at Ego4D Challenge 2022: A Better Baseline for Audio-Visual Diarization	Oct 14, 2022	Action DetectionActive Speaker Detection	—Unverified	0
The DKU-DukeECE Diarization System for the VoxCeleb Speaker Recognition Challenge 2022	Oct 4, 2022	Action DetectionActivity Detection	—Unverified	0
Learnable Acoustic Frontends in Bird Activity Detection	Oct 3, 2022	Action DetectionActivity Detection	—Unverified	0
Signed Latent Factors for Spamming Activity Detection	Sep 28, 2022	Action DetectionActivity Detection	—Unverified	0

Show:10 25 50

← PrevPage 3 of 8Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	CNN-BiLSTM_best	ROC-AUC	95.14	—	Unverified
2	CNN-BiLSTM_small	ROC-AUC	95.13	—	Unverified
3	SG-VAD (ours)	ROC-AUC	94.3	—	Unverified
4	ADA-VAD	ROC-AUC	79.1	—	Unverified