SOTAVerified

Speech-to-Text

Papers

Showing 101150 of 403 papers

TitleStatusHype
Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model0
Deepfake audio as a data augmentation technique for training automatic speech to text transcription models0
Deep Learning Based Natural Language Processing for End to End Speech Translation0
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models0
Deep Speech Based End-to-End Automated Speech Recognition (ASR) for Indian-English Accents0
Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing0
Cross-modal Contrastive Learning for Speech Translation0
Design of a novel Korean learning application for efficient pronunciation correction0
Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions0
Developing automatic verbatim transcripts for international multilingual meetings: an end-to-end solution0
Handling and extracting key entities from customer conversations using Speech recognition and Named Entity recognition0
Dialetto, ma Quanto Dialetto? Transcribing and Evaluating Dialects on a Continuum0
Digits micro-model for accurate and secure transactions0
Direct Punjabi to English speech translation using discrete units0
Crossing the SSH Bridge with Interview Data0
Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR0
BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text0
Automated Testing of AI Models0
Analyzing ASR pretraining for low-resource speech-to-text translation0
A Case Study on Filtering for End-to-End Speech Translation0
Effectively pretraining a speech translation decoder with Machine Translation data0
Efficient Monotonic Multihead Attention0
Graph Neural Networks to Predict Customer Satisfaction Following Interactions with a Corporate Call Center0
Impact of Microphone position Measurement Error on Multi Channel Distant Speech Recognition & Intelligibility0
Challenges and Opportunities of Speech Recognition for Bengali Language0
Improve Sinhala Speech Recognition Through e2e LF-MMI Model0
Hands-Free VR0
CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving0
AudioPaLM: A Large Language Model That Can Speak and Listen0
COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning0
Corpus Creation and Evaluation for Speech-to-Text and Speech Translation0
A low latency ASR-free end to end spoken language understanding system0
Conversational Recommendation System using NLP and Sentiment Analysis0
Audio Interval Retrieval using Convolutional Neural Networks0
AdaST: Dynamically Adapting Encoder States in the Decoder for End-to-End Speech-to-Text Translation0
Contextualized Spoken Word Representations from Convolutional Autoencoders0
Contextual Biasing to Improve Domain-specific Custom Vocabulary Audio Transcription without Explicit Fine-Tuning of Whisper Model0
Algorithms For Automatic Accentuation And Transcription Of Russian Texts In Speech Recognition Systems0
Advancing STT for Low-Resource Real-World Speech0
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages0
Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language0
Open Brain AI. Automatic Language Assessment0
Audio Adversarial Examples: Attacks Using Vocal Masks0
Comparison of SVD and factorized TDNN approaches for speech to text0
Compact Speech Translation Models via Discrete Speech Units Pretraining0
Acquisition of high-quality images for camera calibration in robotics applications via speech prompts0
Findings of the Third Workshop on Automatic Simultaneous Translation0
Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks0
Findings of the Second Workshop on Automatic Simultaneous Translation0
Fast Labeling and Transcription with the Speechalyzer Toolkit0
Show:102550
← PrevPage 3 of 9Next →

No leaderboard results yet.