CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-training May 23, 2025 Automatic Speech Recognition Emotion Recognition
Code Code Available 11OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia Jan 23, 2025 Emotion Recognition Event Detection
Code Code Available 3EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark Jun 11, 2024 Cross-corpus Emotion Recognition
Code Code Available 3emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation Dec 23, 2023 Emotion Recognition Self-Supervised Learning
Code Code Available 3Attention Is All You Need Jun 12, 2017 Abstractive Text Summarization All
Code Code Available 3EmoSphere-SER: Enhancing Speech Emotion Recognition Through Spherical Representation with Auxiliary Classification May 26, 2025 Emotion Recognition regression
Code Code Available 2BLSP-Emo: Towards Empathetic Large Speech-Language Models Jun 6, 2024 Emotion Recognition Instruction Following
Code Code Available 2EMO-SUPERB: An In-depth Look at Speech Emotion Recognition Feb 20, 2024 Emotion Recognition Self-Supervised Learning
Code Code Available 2LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT Oct 7, 2023 Audio captioning Automatic Speech Recognition
Code Code Available 2Dawn of the transformer era in speech emotion recognition: closing the valence gap Mar 14, 2022 Cross-corpus Emotion Recognition
Code Code Available 2AST: Audio Spectrogram Transformer Apr 5, 2021 Audio Classification Audio Tagging
Code Code Available 2Steering Language Model to Stable Speech Emotion Recognition via Contextual Perception and Chain of Thought Feb 25, 2025 Emotion Recognition Language Modeling
Code Code Available 1SigWavNet: Learning Multiresolution Signal Wavelet Network for Speech Emotion Recognition Feb 1, 2025 Denoising Emotion Recognition
Code Code Available 1SER Evals: In-domain and Out-of-domain Benchmarking for Speech Emotion Recognition Aug 14, 2024 Automatic Speech Recognition Benchmarking
Code Code Available 1Odyssey 2024 - Speech Emotion Recognition Challenge: Dataset, Baseline Framework, and Results Jun 20, 2024 Attribute Emotion Recognition
Code Code Available 1Accuracy enhancement method for speech emotion recognition from spectrogram using temporal frequency correlation and positional information learning through knowledge transfer Mar 26, 2024 Emotion Recognition Speech Emotion Recognition
Code Code Available 1emoDARTS: Joint Optimisation of CNN & Sequential Neural Network Architectures for Superior Speech Emotion Recognition Mar 21, 2024 Emotion Recognition Neural Architecture Search
Code Code Available 1Speech Emotion Recognition Via CNN-Transformer and Multidimensional Attention Mechanism Mar 7, 2024 Emotion Recognition Speech Emotion Recognition
Code Code Available 1Frame-level emotional state alignment method for speech emotion recognition Dec 27, 2023 Emotion Recognition Speech Emotion Recognition
Code Code Available 1Do You Remember? Overcoming Catastrophic Forgetting for Fake Audio Detection Aug 7, 2023 Continual Learning Emotion Recognition
Code Code Available 1Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus Speech Emotion Recognition Aug 4, 2023 Cross-corpus Domain Adaptation
Code Code Available 1Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition Jul 20, 2023 Emotion Recognition Speech Emotion Recognition
Code Code Available 1Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition Jun 26, 2023 Data Augmentation Emotion Recognition
Code Code Available 1Speech Emotion Diarization: Which Emotion Appears When? Jun 22, 2023 Emotion Recognition speaker-diarization
Code Code Available 1Enhancing Speech Emotion Recognition Through Differentiable Architecture Search May 23, 2023 Emotion Recognition Neural Architecture Search
Code Code Available 1A vector quantized masked autoencoder for speech emotion recognition Apr 21, 2023 Emotion Recognition Self-Supervised Learning
Code Code Available 1DWFormer: Dynamic Window transFormer for Speech Emotion Recognition Mar 3, 2023 Emotion Recognition Speech Emotion Recognition
Code Code Available 1SpeechFormer++: A Hierarchical Efficient Framework for Paralinguistic Speech Processing Feb 27, 2023 Alzheimer's Disease Detection Emotion Recognition
Code Code Available 1EmoGator: A New Open Source Vocal Burst Dataset with Baseline Machine Learning Classification Methodologies Jan 2, 2023 Emotion Recognition Speech Emotion Recognition
Code Code Available 1Large Raw Emotional Dataset with Aggregation Mechanism Dec 23, 2022 Emotion Recognition Speech Emotion Recognition
Code Code Available 1A Persian ASR-based SER: Modification of Sharif Emotional Speech Database and Investigation of Persian Text Corpora Nov 18, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition Nov 14, 2022 Speech Emotion Recognition
Code Code Available 1SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transformers Nov 4, 2022 Cross-corpus Emotion Recognition
Code Code Available 1GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion Causality for Speech Emotion Recognition Oct 28, 2022 Emotion Recognition Representation Learning
Code Code Available 1Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation Apr 12, 2022 Emotion Recognition Speech Emotion Recognition
Code Code Available 1MMER: Multimodal Multi-task Learning for Speech Emotion Recognition Mar 31, 2022 Emotion Recognition Multimodal Emotion Recognition
Code Code Available 1Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information Mar 29, 2022 Emotion Recognition Speech Emotion Recognition
Code Code Available 1SepTr: Separable Transformer for Audio Spectrogram Processing Mar 17, 2022 Audio Classification Speech Emotion Recognition
Code Code Available 1Semi-FedSER: Semi-supervised Learning for Speech Emotion Recognition On Federated Learning using Multiview Pseudo-Labeling Mar 15, 2022 Emotion Recognition Federated Learning
Code Code Available 1Privacy-preserving Speech Emotion Recognition through Semi-Supervised Federated Learning Feb 5, 2022 Emotion Recognition Federated Learning
Code Code Available 1A proposal for Multimodal Emotion Recognition using aural transformers and Action Units on RAVDESS dataset Dec 30, 2021 Autonomous Driving Emotion Recognition
Code Code Available 1Attribute Inference Attack of Speech Emotion Recognition in Federated Learning Settings Dec 26, 2021 Attribute Emotion Recognition
Code Code Available 1Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition Oct 12, 2021 Emotion Recognition Speech Emotion Recognition
Code Code Available 1Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset Oct 9, 2021 Deep Learning Emotion Recognition
Code Code Available 1SERAB: A multi-lingual benchmark for speech emotion recognition Oct 7, 2021 Benchmarking Emotion Recognition
Code Code Available 1Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition Oct 7, 2021 Emotion Recognition Speech Emotion Recognition
Code Code Available 1Speech Emotion Recognition with Multi-Task Learning Sep 6, 2021 Emotion Classification Emotion Recognition
Code Code Available 1Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention Jun 8, 2021 Emotion Classification Emotion Recognition
Code Code Available 1Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings Apr 8, 2021 Emotion Recognition Speech Emotion Recognition
Code Code Available 1EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition Mar 10, 2021 Emotion Recognition Speech Emotion Recognition
Code Code Available 1