Speech-to-Text

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 301–350 of 403 papers

Title	Date	Tasks	Status
Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers	Apr 21, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition	Apr 5, 2021	speech-recognitionSpeech Recognition	CodeCode Available
Multi-Discriminator Sobolev Defense-GAN Against Adversarial Attacks for End-to-End Speech Systems	Mar 15, 2021	Speech-to-Text	—Unverified
Towards the evaluation of automatic simultaneous speech translation from a communicative perspective	Mar 15, 2021	automatic-speech-translationInformativeness	—Unverified
Towards Robust Speech-to-Text Adversarial Attack	Mar 15, 2021	Adversarial AttackRoom Impulse Response (RIR)	—Unverified
Inductive biases, pretraining and fine-tuning jointly account for brain responses to speech	Feb 25, 2021	Scene ClassificationSpeech-to-Text	—Unverified
NUVA: A Naming Utterance Verifier for Aphasia Treatment	Feb 10, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Audio Adversarial Examples: Attacks Using Vocal Masks	Feb 4, 2021	Adversarial AttackSpeech-to-Text	—Unverified
Graph Neural Networks to Predict Customer Satisfaction Following Interactions with a Corporate Call Center	Jan 31, 2021	Graph Neural NetworkSpeech-to-Text	—Unverified
BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge	Jan 29, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
WER-BERT: Automatic WER Estimation with BERT in a Balanced Ordinal Classification Paradigm	Jan 14, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Exploring Transfer Learning For End-to-End Spoken Language Understanding	Dec 15, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Incorporating Domain Knowledge To Improve Topic Segmentation Of Long MOOC Lecture Videos	Dec 8, 2020	Language ModelingLanguage Modelling	—Unverified
End to End ASR System with Automatic Punctuation Insertion	Dec 3, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Attentively Embracing Noise for Robust Latent Representation in BERT	Dec 1, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
mask-Net: Learning Context Aware Invariant Features using Adversarial Forgetting (Student Abstract)	Nov 25, 2020	Speech-to-Text	CodeCode Available
A low latency ASR-free end to end spoken language understanding system	Nov 10, 2020	Speech-to-TextSpoken Language Understanding	—Unverified
Effectively pretraining a speech translation decoder with Machine Translation data	Nov 1, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Bridging the Modality Gap for Speech-to-Text Translation	Oct 28, 2020	DecoderSpeech-to-Text	—Unverified
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models	Oct 24, 2020	Cross-Lingual TransferDecoder	—Unverified
MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation	Oct 22, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Class-Conditional Defense GAN Against End-to-End Speech Attacks	Oct 22, 2020	Generative Adversarial NetworkSentence	—Unverified
A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks	Oct 21, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Towards End-to-End Training of Automatic Speech Recognition for Nigerian Pidgin	Oct 21, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines	Oct 19, 2020	Cross-Lingual Information RetrievalInformation Retrieval	—Unverified
Ensemble Chinese End-to-End Spoken Language Understanding for Abnormal Event Detection from audio stream	Oct 19, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
fairseq S2T: Fast Speech-to-Text Modeling with fairseq	Oct 11, 2020	Machine TranslationMulti-Task Learning	CodeCode Available
End-to-End Learning of Speech 2D Feature-Trajectory for Prosthetic Hands	Sep 22, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Contextualized Translation of Automatically Segmented Speech	Aug 5, 2020	SegmentationSentence	CodeCode Available
Adversarial Attacks against Neural Networks in Audio Domain: Exploiting Principal Components	Jul 14, 2020	ClassificationGeneral Classification	—Unverified
Contextualized Spoken Word Representations from Convolutional Autoencoders	Jul 6, 2020	Speech-to-TextWord Embeddings	—Unverified
SimulSpeech: End-to-End Simultaneous Speech to Text Translation	Jul 1, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
End-to-End Simultaneous Translation System for IWSLT2020 Using Modality Agnostic Meta-Learning	Jul 1, 2020	Meta-LearningSpeech-to-Text	—Unverified
End-to-End Offline Speech Translation System for IWSLT 2020 using Modality Agnostic Meta-Learning	Jul 1, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Self-Supervised Representations Improve End-to-End Speech Translation	Jun 22, 2020	Cross-Lingual Transferspeech-recognition	—Unverified
Exploration of End-to-End ASR for OpenSTT -- Russian Open Speech-to-Text Dataset	Jun 15, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation	Jun 9, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020	May 24, 2020	Data AugmentationDecoder	—Unverified
Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation	May 17, 2020	Computational Efficiencyspeech-recognition	—Unverified
SpiCE: A New Open-Access Corpus of Conversational Bilingual Speech in Cantonese and English	May 1, 2020	SentenceSpeech-to-Text	—Unverified
Subtitles to Segmentation: Improving Low-Resource Speech-to-TextTranslation Pipelines	May 1, 2020	Cross-Lingual Information RetrievalInformation Retrieval	—Unverified
Crossing the SSH Bridge with Interview Data	May 1, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Jointly Trained Transformers models for Spoken Language Translation	Apr 25, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Cloud-Based Face and Speech Recognition for Access Control Applications	Apr 23, 2020	Face Recognitionspeech-recognition	—Unverified
Learnings from Technological Interventions in a Low Resource Language: A Case-Study on Gondi	Apr 21, 2020	Machine TranslationSpeech-to-Text	—Unverified
Speak2Label: Using Domain Knowledge for Creating a Large Scale Driver Gaze Zone Estimation Dataset	Apr 13, 2020	Gaze PredictionSpeech-to-Text	—Unverified
The Spotify Podcast Dataset	Apr 8, 2020	Speech-to-Text	—Unverified
A.I. based Embedded Speech to Text Using Deepspeech	Feb 25, 2020	Raspberry Pi 3speech-recognition	—Unverified
Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding	Dec 16, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Re-Translation Strategies For Long Form, Simultaneous, Spoken Language Translation	Dec 6, 2019	FormMachine Translation	CodeCode Available

Show:10 25 50

← PrevPage 7 of 9Next →

No leaderboard results yet.