Speech-to-Text

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 403 papers

Title	Date	Tasks	Status	Hype
Kencorpus: A Kenyan Language Corpus of Swahili, Dholuo and Luhya for Natural Language Processing Tasks	Aug 25, 2022	Machine TranslationPart-Of-Speech Tagging	—Unverified	0
Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech	Aug 10, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Extending RNN-T-based speech recognition systems with emotion and language classification	Jul 28, 2022	Emotion ClassificationEmotion Recognition	—Unverified	0
RSD-GAN: Regularized Sobolev Defense GAN Against Speech-to-Text Adversarial Attacks	Jul 14, 2022	Speech-to-Text	—Unverified	0
M-Adapter: Modality Adaptation for End-to-End Speech-to-Text Translation	Jul 3, 2022	DecoderSpeech-to-Text	CodeCode Available	0
Language Model Augmented Monotonic Attention for Simultaneous Translation	Jul 1, 2022	Language ModelingLanguage Modelling	—Unverified	0
System Description on Automatic Simultaneous Translation Workshop	Jul 1, 2022	SentenceSpeech-to-Text	—Unverified	0
Findings of the Third Workshop on Automatic Simultaneous Translation	Jul 1, 2022	Speech-to-TextTranslation	—Unverified	0
Swiss German Speech to Text system evaluation	Jul 1, 2022	Speech-to-Text	—Unverified	0
Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models	Jun 29, 2022	Intent ClassificationSlot Filling	CodeCode Available	0
Developing a Speech Recognition System for Recognizing Tonal Speech Signals Using a Convolutional Neural Network	Jun 17, 2022	speech-recognitionSpeech Recognition	—Unverified	0
Revisiting End-to-End Speech-to-Text Translation From Scratch	Jun 9, 2022	Decoderspeech-recognition	—Unverified	0
The Nós Project: Opening routes for the Galician language in the field of language technologies	Jun 1, 2022	Cultural Vocal Bursts Intensity PredictionMachine Translation	—Unverified	0
Towards Large Vocabulary Kazakh-Russian Sign Language Dataset: KRSL-OnlineSchool	Jun 1, 2022	Sign Language TranslationSpeech-to-Text	—Unverified	0
A Semi-Automated Live Interlingual Communication Workflow Featuring Intralingual Respeaking: Evaluation and Benchmarking	Jun 1, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Clinical Dialogue Transcription Error Correction using Seq2Seq Models	May 26, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Semantic-preserved Communication System for Highly Efficient Speech Transmission	May 25, 2022	Semantic Communicationspeech-recognition	—Unverified	0
PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit	May 20, 2022	AllAutomatic Speech Recognition (ASR)	CodeCode Available	6
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation	May 17, 2022	Representation LearningRetrieval	—Unverified	0
Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language	May 6, 2022	speech-recognitionSpeech Recognition	—Unverified	0
Cross-modal Contrastive Learning for Speech Translation	May 5, 2022	Contrastive LearningRetrieval	CodeCode Available	1
Design of a novel Korean learning application for efficient pronunciation correction	May 4, 2022	Sentencespeech-recognition	—Unverified	0
Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages	May 2, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
Learning Adaptive Segmentation Policy for End-to-End Simultaneous Translation	May 1, 2022	SegmentationSimultaneous Speech-to-Text Translation	—Unverified	0
NAIST Simultaneous Speech-to-Text Translation System for IWSLT 2022	May 1, 2022	SegmentationSimultaneous Speech-to-Text Translation	—Unverified	0
The HW-TSC’s Simultaneous Speech Translation System for IWSLT 2022 Evaluation	May 1, 2022	ChunkingSentence	—Unverified	0
The AISP-SJTU Simultaneous Translation System for IWSLT 2022	May 1, 2022	Speech-to-TextTranslation	—Unverified	0
LibriS2S: A German-English Speech-to-Speech Translation Corpus	Apr 22, 2022	Speech-to-Speech TranslationSpeech-to-Text	CodeCode Available	0
WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment	Apr 22, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation	Apr 6, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems	Apr 4, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Deep Speech Based End-to-End Automated Speech Recognition (ASR) for Indian-English Accents	Apr 3, 2022	speech-recognitionSpeech Recognition	—Unverified	0
The MIT Voice Name System	Mar 28, 2022	Speech-to-Text	—Unverified	0
A Dataset for Speech Emotion Recognition in Greek Theatrical Plays	Mar 27, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	0
XTREME-S: Evaluating Cross-lingual Speech Representations	Mar 21, 2022	Representation LearningRetrieval	—Unverified	0
STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation	Mar 20, 2022	Machine TranslationSpeech-to-Text	CodeCode Available	1
A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing	Mar 18, 2022	Representation LearningSpeaker Verification	CodeCode Available	1
A combined approach to the analysis of speech conversations in a contact center domain	Mar 12, 2022	Speech-to-Text	—Unverified	0
Attacks as Defenses: Designing Robust Audio CAPTCHAs Using Attacks on Automatic Speech Recognition Systems	Mar 10, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Which French speech recognition system for assistant robots?	Mar 4, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments	Feb 21, 2022	Data AugmentationPhoneme Recognition	CodeCode Available	0
Punctuation restoration in Swedish through fine-tuned KB-BERT	Feb 14, 2022	Language ModellingPunctuation Restoration	—Unverified	0
Semantic-aware Speech to Text Transmission with Redundancy Removal	Feb 7, 2022	Semantic CommunicationSpeech-to-Text	—Unverified	0
Optimization of a Real-Time Wavelet-Based Algorithm for Improving Speech Intelligibility	Feb 5, 2022	Speech EnhancementSpeech-to-Text	—Unverified	0
CVSS Corpus and Massively Multilingual Speech-to-Speech Translation	Jan 11, 2022	SentenceSpeech-to-Speech Translation	CodeCode Available	2
A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architecture	Jan 6, 2022	Speech-to-Texttext-to-speech	CodeCode Available	0
InstaIndoor and Multi-modal Deep Learning for Indoor Scene Recognition	Dec 23, 2021	BenchmarkingDeep Learning	CodeCode Available	0
Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement	Dec 21, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
Cross-modal Contrastive Learning for Speech Translation	Dec 17, 2021	Contrastive LearningRetrieval	—Unverified	0
X-Vector based voice activity detection for multi-genre broadcast speech-to-text	Dec 9, 2021	Action DetectionActivity Detection	CodeCode Available	1

Show:10 25 50

← PrevPage 5 of 9Next →

No leaderboard results yet.