Speech-to-Text

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–300 of 403 papers

Title	Date	Tasks	Status	Hype
Training end-to-end speech-to-text models on mobile phones	Dec 7, 2021	CPUSpeech-to-Text	—Unverified	0
Improve Sinhala Speech Recognition Through e2e LF-MMI Model	Dec 1, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
An Experiment on Speech-to-Text Translation Systems for Manipuri to English on Low Resource Setting	Dec 1, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Impact of Microphone position Measurement Error on Multi Channel Distant Speech Recognition & Intelligibility	Dec 1, 2021	Distant Speech RecognitionPosition	—Unverified	0
Cross Attention Augmented Transducer Networks for Simultaneous Translation	Nov 1, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
Scribosermo: Fast Speech-to-Text models for German and other Languages	Oct 15, 2021	Speech RecognitionSpeech-to-Text	CodeCode Available	0
Decision Attentive Regularization to Improve Simultaneous Speech Translation Systems	Oct 13, 2021	SentenceSimultaneous Speech-to-Text Translation	—Unverified	0
Comparison of SVD and factorized TDNN approaches for speech to text	Oct 13, 2021	Speech-to-Text	—Unverified	0
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation	Oct 11, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Automated Testing of AI Models	Oct 7, 2021	FairnessSpeech-to-Text	—Unverified	0
EdiTTS: Score-based Editing for Controllable Text-to-Speech	Oct 6, 2021	Speech SynthesisSpeech-to-Text	CodeCode Available	1
Late reverberation suppression using U-nets	Oct 5, 2021	DecoderSpeech Dereverberation	CodeCode Available	1
Challenges and Opportunities of Speech Recognition for Bengali Language	Sep 27, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Audio Interval Retrieval using Convolutional Neural Networks	Sep 21, 2021	Audio ClassificationRetrieval	—Unverified	0
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition	Sep 19, 2021	Language ModelingLanguage Modelling	—Unverified	0
Infusing Future Information into Monotonic Attention Through Language Models	Sep 7, 2021	Language ModelingLanguage Modelling	—Unverified	0
Speech Emotion Recognition with Multi-Task Learning	Sep 6, 2021	Emotion ClassificationEmotion Recognition	CodeCode Available	1
One TTS Alignment To Rule Them All	Aug 23, 2021	AllSpeech Synthesis	CodeCode Available	1
With One Voice: Composing a Travel Voice Assistant from Re-purposed Models	Aug 4, 2021	BIG-bench Machine Learningnamed-entity-recognition	—Unverified	0
Corpus Creation and Evaluation for Speech-to-Text and Speech Translation	Aug 1, 2021	Machine TranslationSpeech-to-Text	—Unverified	0
BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text	Aug 1, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Multilingual Speech Translation from Efficient Finetuning of Pretrained Models	Aug 1, 2021	DecoderSpeech-to-Text	—Unverified	0
A Large-Scale Chinese Multimodal NER Dataset with Speech Clues	Aug 1, 2021	named-entity-recognitionNamed Entity Recognition	CodeCode Available	1
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task	Jul 12, 2021	DecoderKnowledge Distillation	—Unverified	0
Kosp2e: Korean Speech to English Translation Corpus	Jul 6, 2021	speech-recognitionSpeech Recognition	CodeCode Available	1
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021	Jul 1, 2021	Data AugmentationSpeech-to-Text	—Unverified	0
Towards Automatic Speech to Sign Language Generation	Jun 24, 2021	Speech-to-TextText Generation	CodeCode Available	1
Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling	Jun 21, 2021	speech-recognitionSpeech Recognition	—Unverified	0
Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR	Jun 11, 2021	Simultaneous Speech-to-Text TranslationSpeech-to-Text	—Unverified	0
TASK AWARE MULTI-TASK LEARNING FOR SPEECH TO TEXT TASKS	Jun 10, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
On the Design of Strategic Task Recommendations for Sustainable Crowdsourcing-Based Content Moderation	Jun 4, 2021	Recommendation SystemsSpeech-to-Text	—Unverified	0
Findings of the Second Workshop on Automatic Simultaneous Translation	Jun 1, 2021	Machine TranslationSpeech-to-Text	—Unverified	0
Worldly Wise (WoW) - Cross-Lingual Knowledge Fusion for Fact-based Visual Spoken-Question Answering	Jun 1, 2021	Knowledge GraphsQuestion Answering	—Unverified	0
Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech Translation	May 11, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
What shall we do with an hour of data? Speech recognition for the un- and under-served languages of Common Voice	May 10, 2021	speech-recognitionSpeech Recognition	—Unverified	0
Learning Shared Semantic Space for Speech-to-Text Translation	May 7, 2021	Machine TranslationSpeech-to-Text	CodeCode Available	1
A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect	May 7, 2021	BenchmarkingSpeech-to-Text	—Unverified	0
Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models	Apr 25, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
End-to-end Speech Translation via Cross-modal Progressive Training	Apr 21, 2021	Machine TranslationSpeech-to-Text	CodeCode Available	1
Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers	Apr 21, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition	Apr 5, 2021	speech-recognitionSpeech Recognition	CodeCode Available	0
Multi-Discriminator Sobolev Defense-GAN Against Adversarial Attacks for End-to-End Speech Systems	Mar 15, 2021	Speech-to-Text	—Unverified	0
Towards Robust Speech-to-Text Adversarial Attack	Mar 15, 2021	Adversarial AttackRoom Impulse Response (RIR)	—Unverified	0
Towards the evaluation of automatic simultaneous speech translation from a communicative perspective	Mar 15, 2021	automatic-speech-translationInformativeness	—Unverified	0
Inductive biases, pretraining and fine-tuning jointly account for brain responses to speech	Feb 25, 2021	Scene ClassificationSpeech-to-Text	—Unverified	0
NUVA: A Naming Utterance Verifier for Aphasia Treatment	Feb 10, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Audio Adversarial Examples: Attacks Using Vocal Masks	Feb 4, 2021	Adversarial AttackSpeech-to-Text	—Unverified	0
Graph Neural Networks to Predict Customer Satisfaction Following Interactions with a Corporate Call Center	Jan 31, 2021	Graph Neural NetworkSpeech-to-Text	—Unverified	0
BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge	Jan 29, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
WER-BERT: Automatic WER Estimation with BERT in a Balanced Ordinal Classification Paradigm	Jan 14, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0

Show:10 25 50

← PrevPage 6 of 9Next →

No leaderboard results yet.