SOTAVerified

Speech-to-Text

Papers

Showing 151200 of 403 papers

TitleStatusHype
Audio Adversarial Examples: Attacks Using Vocal Masks0
Comparison of SVD and factorized TDNN approaches for speech to text0
Acquisition of high-quality images for camera calibration in robotics applications via speech prompts0
Compact Speech Translation Models via Discrete Speech Units Pretraining0
Kencorpus: A Kenyan Language Corpus of Swahili, Dholuo and Luhya for Natural Language Processing Tasks0
Open Brain AI. Automatic Language Assessment0
Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers0
Graph Neural Networks to Predict Customer Satisfaction Following Interactions with a Corporate Call Center0
Contextual Biasing to Improve Domain-specific Custom Vocabulary Audio Transcription without Explicit Fine-Tuning of Whisper Model0
Handling and extracting key entities from customer conversations using Speech recognition and Named Entity recognition0
Hands-Free VR0
Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language0
Finetuning End-to-End Models for Estonian Conversational Spoken Language Translation0
How "Real" is Your Real-Time Simultaneous Speech-to-Text Translation System?0
How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not0
Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks0
Findings of the Third Workshop on Automatic Simultaneous Translation0
Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks0
Impact of Microphone position Measurement Error on Multi Channel Distant Speech Recognition & Intelligibility0
Improved Cross-Lingual Transfer Learning For Automatic Speech Translation0
Improve Sinhala Speech Recognition Through e2e LF-MMI Model0
Improving Autoregressive NLP Tasks via Modular Linearized Attention0
Findings of the Second Workshop on Automatic Simultaneous Translation0
Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech0
Fast Labeling and Transcription with the Speechalyzer Toolkit0
Attention-Based End-to-End Speech Recognition on Voice Search0
Improving Metrics for Speech Translation0
Improving RNN-Transducers with Acoustic LookAhead0
CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders0
Improving Speech Recognition Accuracy Using Custom Language Models with the Vosk Toolkit0
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task0
Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach0
Extending RNN-T-based speech recognition systems with emotion and language classification0
IMS-Speech: A Speech to Text Tool0
AI-Powered Immersive Assistance for Interactive Task Execution in Industrial Environments0
Exploring Transfer Learning For End-to-End Spoken Language Understanding0
Exploration of End-to-End ASR for OpenSTT -- Russian Open Speech-to-Text Dataset0
Attacks as Defenses: Designing Robust Audio CAPTCHAs Using Attacks on Automatic Speech Recognition Systems0
Infusing Future Information into Monotonic Attention Through Language Models0
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models0
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation0
Interpreting Strategies Annotation in the WAW Corpus0
Investigating Decoder-only Large Language Models for Speech-to-text Translation0
Jointly Trained Transformers models for Spoken Language Translation0
Language Model Augmented Monotonic Attention for Simultaneous Translation0
Isochrony-Controlled Speech-to-Text Translation: A study on translating from Sino-Tibetan to Indo-European Languages0
I Speak and You Find: Robust 3D Visual Grounding with Noisy and Ambiguous Speech Inputs0
Existential Crisis: A Social Robot's Reason for Being0
Evaluation of real-time transcriptions using end-to-end ASR models0
CMU's IWSLT 2024 Simultaneous Speech Translation System0
Show:102550
← PrevPage 4 of 9Next →

No leaderboard results yet.