SyntAct: A Synthesized Database of Basic Emotions Jun 1, 2022 Emotion Recognition Speech Emotion Recognition
— Unverified 0Synthesizing Audio for Hindi WordNet Jan 1, 2018 Speech Synthesis
— Unverified 0Synt++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition Oct 21, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Audio Deepfake Attribution: An Initial Dataset and Investigation Aug 21, 2022 Audio Generation Binary Classification
— Unverified 0Taiwanese-Accented Mandarin and English Multi-Speaker Talking-Face Synthesis System Nov 1, 2022 Face Generation Speech Synthesis
— Unverified 0TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos Nov 19, 2020 speech-recognition Speech Recognition
— Unverified 0Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages Nov 1, 2022 Chunking Rhythm
— Unverified 0Text-aware and Context-aware Expressive Audiobook Speech Synthesis Jun 9, 2024 Contrastive Learning Language Modeling
— Unverified 0Text-free non-parallel many-to-many voice conversion using normalising flows Mar 15, 2022 Normalising Flows Speech Synthesis
— Unverified 0Text Generation with Speech Synthesis for ASR Data Augmentation May 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis Mar 27, 2023 All Automatic Speech Recognition
— Unverified 0Text Normalization and Unit Selection for a Memory Based Non Uniform Unit Selection TTS in Malayalam Dec 1, 2015 Speech Synthesis Text Normalization
— Unverified 0Texto4Science: a Quebec French Database of Annotated Short Text Messages May 1, 2012 Speech Synthesis Text-To-Speech Synthesis
— Unverified 0Text-To-Speech for Languages without an Orthography Dec 1, 2012 Speech Synthesis text-to-speech
— Unverified 0Text-to-Speech Pipeline for Swiss German -- A comparison May 31, 2023 Speech Synthesis text-to-speech
— Unverified 0Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder Dec 16, 2022 Representation Learning Speech Synthesis
— Unverified 0Text-To-Speech Synthesis In The Wild Sep 13, 2024 Benchmarking Speaker Recognition
— Unverified 0ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech Nov 5, 2019 Person Recognition Speaker Verification
— Unverified 0The AV-LASYN Database : A synchronous corpus of audio and 3D facial marker data for audio-visual laughter synthesis May 1, 2014 Dimensionality Reduction Speech Synthesis
— Unverified 0The Cohort and Speechify Libraries for Rapid Construction of Speech Enabled Applications for Android Sep 1, 2015 Action Detection Speech Recognition
— Unverified 0The DeepZen Speech Synthesis System for Blizzard Challenge 2023 Aug 30, 2023 Sentence Speech Synthesis
— Unverified 0The Deterministic plus Stochastic Model of the Residual Signal and its Applications Dec 29, 2019 Speaker Identification Speech Synthesis
— Unverified 0The Development of the Multilingual LUNA Corpus for Spoken Language System Porting May 1, 2014 Machine Translation Speech Synthesis
— Unverified 0The dramatic piece reader for the blind and visually impaired Aug 1, 2013 Speech Synthesis
— Unverified 0The FruitShell French synthesis system at the Blizzard 2023 Challenge Sep 1, 2023 Data Augmentation Speech Synthesis
— Unverified 0The Future of Spoken Dialogue Systems is in their Past: Long-Term Adaptive, Conversational Assistants Jun 1, 2012 Language Modelling Speech Recognition
— Unverified 0The Herme Database of Spontaneous Multimodal Human-Robot Dialogues May 1, 2012 Gesture Recognition Speech Recognition
— Unverified 0Towards Developing State-of-the-Art TTS Synthesisers for 13 Indian Languages with Signal Processing aided Alignments Oct 31, 2022 Speech Synthesis
— Unverified 0The InproTK 2012 release Jun 1, 2012 Dialogue Management Speech Recognition
— Unverified 0The MMASCS multi-modal annotated synchronous corpus of audio, video, facial motion and tongue motion data of normal, fast and slow speech May 1, 2014 Speech Synthesis
— Unverified 0The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance Apr 11, 2022 Speaker Verification Speech Synthesis
— Unverified 0The Potential of Neural Speech Synthesis-based Data Augmentation for Personalized Speech Enhancement Nov 14, 2022 Data Augmentation Speech Enhancement
— Unverified 0The SYSU System for the Interspeech 2015 Automatic Speaker Verification Spoofing and Countermeasures Challenge Jul 24, 2015 Speaker Verification Speech Synthesis
— Unverified 0The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach Oct 14, 2019 Expressive Speech Synthesis Sociology
— Unverified 0The Virtual Doctor: An Interactive Artificial Intelligence based on Deep Learning for Non-Invasive Prediction of Diabetes Mar 9, 2019 Prognosis speech-recognition
— Unverified 01000 African Voices: Advancing inclusive multi-speaker multi-accent speech synthesis Jun 17, 2024 Diversity Speech Synthesis
— Unverified 0The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units Oct 12, 2020 Speech Synthesis
— Unverified 0TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition Aug 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ToneUnit: A Speech Discretization Approach for Tonal Language Speech Synthesis Jun 13, 2024 Quantization Speech Synthesis
— Unverified 0Toward accessible comics for blind and low vision readers Jul 11, 2024 Optical Character Recognition Prompt Engineering
— Unverified 0PromptTTS: Controllable Text-to-Speech with Text Descriptions Nov 22, 2022 Decoder Speech Synthesis
Code Code Available 0Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity Nov 2, 2021 Cross-Lingual Transfer speech-recognition
Code Code Available 0Extending Text-to-Speech Synthesis with Articulatory Movement Prediction using Ultrasound Tongue Imaging Jul 12, 2021 Prediction Speech Synthesis
Code Code Available 0Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale Jun 23, 2023 In-Context Learning Speech Synthesis
Code Code Available 0Tools and resources for Romanian text-to-speech and speech-to-text applications Feb 15, 2018 speech-recognition Speech Recognition
Code Code Available 0Jejueo Datasets for Machine Translation and Speech Synthesis Nov 27, 2019 Machine Translation Speech Synthesis
Code Code Available 0A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution Oct 27, 2022 Speech Synthesis
Code Code Available 0ChatGPT in the context of precision agriculture data analytics Nov 10, 2023 Language Modelling speech-recognition
Code Code Available 0CaloFlow II: Even Faster and Still Accurate Generation of Calorimeter Showers with Normalizing Flows Oct 21, 2021 Speech Synthesis
Code Code Available 0Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation Dec 8, 2020 Attribute Disentanglement
Code Code Available 0