| Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers | Apr 21, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition | Apr 5, 2021 | speech-recognitionSpeech Recognition | CodeCode Available | 0 |
| Multi-Discriminator Sobolev Defense-GAN Against Adversarial Attacks for End-to-End Speech Systems | Mar 15, 2021 | Speech-to-Text | —Unverified | 0 |
| Towards the evaluation of automatic simultaneous speech translation from a communicative perspective | Mar 15, 2021 | automatic-speech-translationInformativeness | —Unverified | 0 |
| Towards Robust Speech-to-Text Adversarial Attack | Mar 15, 2021 | Adversarial AttackRoom Impulse Response (RIR) | —Unverified | 0 |
| Inductive biases, pretraining and fine-tuning jointly account for brain responses to speech | Feb 25, 2021 | Scene ClassificationSpeech-to-Text | —Unverified | 0 |
| NUVA: A Naming Utterance Verifier for Aphasia Treatment | Feb 10, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Audio Adversarial Examples: Attacks Using Vocal Masks | Feb 4, 2021 | Adversarial AttackSpeech-to-Text | —Unverified | 0 |
| Graph Neural Networks to Predict Customer Satisfaction Following Interactions with a Corporate Call Center | Jan 31, 2021 | Graph Neural NetworkSpeech-to-Text | —Unverified | 0 |
| BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge | Jan 29, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| WER-BERT: Automatic WER Estimation with BERT in a Balanced Ordinal Classification Paradigm | Jan 14, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Exploring Transfer Learning For End-to-End Spoken Language Understanding | Dec 15, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Incorporating Domain Knowledge To Improve Topic Segmentation Of Long MOOC Lecture Videos | Dec 8, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| End to End ASR System with Automatic Punctuation Insertion | Dec 3, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Attentively Embracing Noise for Robust Latent Representation in BERT | Dec 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| mask-Net: Learning Context Aware Invariant Features using Adversarial Forgetting (Student Abstract) | Nov 25, 2020 | Speech-to-Text | CodeCode Available | 0 |
| A low latency ASR-free end to end spoken language understanding system | Nov 10, 2020 | Speech-to-TextSpoken Language Understanding | —Unverified | 0 |
| Effectively pretraining a speech translation decoder with Machine Translation data | Nov 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Bridging the Modality Gap for Speech-to-Text Translation | Oct 28, 2020 | DecoderSpeech-to-Text | —Unverified | 0 |
| Multilingual Speech Translation with Efficient Finetuning of Pretrained Models | Oct 24, 2020 | Cross-Lingual TransferDecoder | —Unverified | 0 |
| MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation | Oct 22, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Class-Conditional Defense GAN Against End-to-End Speech Attacks | Oct 22, 2020 | Generative Adversarial NetworkSentence | —Unverified | 0 |
| A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks | Oct 21, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Towards End-to-End Training of Automatic Speech Recognition for Nigerian Pidgin | Oct 21, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines | Oct 19, 2020 | Cross-Lingual Information RetrievalInformation Retrieval | —Unverified | 0 |
| Ensemble Chinese End-to-End Spoken Language Understanding for Abnormal Event Detection from audio stream | Oct 19, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| fairseq S2T: Fast Speech-to-Text Modeling with fairseq | Oct 11, 2020 | Machine TranslationMulti-Task Learning | CodeCode Available | 0 |
| End-to-End Learning of Speech 2D Feature-Trajectory for Prosthetic Hands | Sep 22, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Contextualized Translation of Automatically Segmented Speech | Aug 5, 2020 | SegmentationSentence | CodeCode Available | 0 |
| Adversarial Attacks against Neural Networks in Audio Domain: Exploiting Principal Components | Jul 14, 2020 | ClassificationGeneral Classification | —Unverified | 0 |
| Contextualized Spoken Word Representations from Convolutional Autoencoders | Jul 6, 2020 | Speech-to-TextWord Embeddings | —Unverified | 0 |
| SimulSpeech: End-to-End Simultaneous Speech to Text Translation | Jul 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-End Simultaneous Translation System for IWSLT2020 Using Modality Agnostic Meta-Learning | Jul 1, 2020 | Meta-LearningSpeech-to-Text | —Unverified | 0 |
| End-to-End Offline Speech Translation System for IWSLT 2020 using Modality Agnostic Meta-Learning | Jul 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Self-Supervised Representations Improve End-to-End Speech Translation | Jun 22, 2020 | Cross-Lingual Transferspeech-recognition | —Unverified | 0 |
| Exploration of End-to-End ASR for OpenSTT -- Russian Open Speech-to-Text Dataset | Jun 15, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation | Jun 9, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020 | May 24, 2020 | Data AugmentationDecoder | —Unverified | 0 |
| Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation | May 17, 2020 | Computational Efficiencyspeech-recognition | —Unverified | 0 |
| SpiCE: A New Open-Access Corpus of Conversational Bilingual Speech in Cantonese and English | May 1, 2020 | SentenceSpeech-to-Text | —Unverified | 0 |
| Subtitles to Segmentation: Improving Low-Resource Speech-to-TextTranslation Pipelines | May 1, 2020 | Cross-Lingual Information RetrievalInformation Retrieval | —Unverified | 0 |
| Crossing the SSH Bridge with Interview Data | May 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Jointly Trained Transformers models for Spoken Language Translation | Apr 25, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Cloud-Based Face and Speech Recognition for Access Control Applications | Apr 23, 2020 | Face Recognitionspeech-recognition | —Unverified | 0 |
| Learnings from Technological Interventions in a Low Resource Language: A Case-Study on Gondi | Apr 21, 2020 | Machine TranslationSpeech-to-Text | —Unverified | 0 |
| Speak2Label: Using Domain Knowledge for Creating a Large Scale Driver Gaze Zone Estimation Dataset | Apr 13, 2020 | Gaze PredictionSpeech-to-Text | —Unverified | 0 |
| The Spotify Podcast Dataset | Apr 8, 2020 | Speech-to-Text | —Unverified | 0 |
| A.I. based Embedded Speech to Text Using Deepspeech | Feb 25, 2020 | Raspberry Pi 3speech-recognition | —Unverified | 0 |
| Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding | Dec 16, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Re-Translation Strategies For Long Form, Simultaneous, Spoken Language Translation | Dec 6, 2019 | FormMachine Translation | CodeCode Available | 0 |