SOTAVerified

Speech-to-Text

Papers

Showing 301350 of 403 papers

TitleStatusHype
Attention-Based End-to-End Speech Recognition on Voice Search0
Audio Adversarial Examples: Attacks Using Vocal Masks0
Audio Interval Retrieval using Convolutional Neural Networks0
AudioPaLM: A Large Language Model That Can Speak and Listen0
Automated Testing of AI Models0
A Voice Controlled E-Commerce Web Application0
Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM0
BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge0
Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models0
Bridging the Modality Gap for Speech-to-Text Translation0
BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text0
Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data?0
Challenges and Opportunities of Speech Recognition for Bengali Language0
Characterizing Financial Market Coverage using Artificial Intelligence0
CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training0
Class-Conditional Defense GAN Against End-to-End Speech Attacks0
Cross-lingual topic prediction for speech using translations0
Clinical Dialogue Transcription Error Correction using Seq2Seq Models0
Cloud-Based Face and Speech Recognition for Access Control Applications0
CMU's IWSLT 2024 Simultaneous Speech Translation System0
CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders0
Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks0
Compact Speech Translation Models via Discrete Speech Units Pretraining0
Comparison of SVD and factorized TDNN approaches for speech to text0
Open Brain AI. Automatic Language Assessment0
Contextual Biasing to Improve Domain-specific Custom Vocabulary Audio Transcription without Explicit Fine-Tuning of Whisper Model0
Contextualized Spoken Word Representations from Convolutional Autoencoders0
Conversational Recommendation System using NLP and Sentiment Analysis0
Corpus Creation and Evaluation for Speech-to-Text and Speech Translation0
COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning0
CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving0
Crossing the SSH Bridge with Interview Data0
Cross-modal Contrastive Learning for Speech Translation0
Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing0
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models0
CTC Alignments Improve Autoregressive Translation0
CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR0
DARTS: Dialectal Arabic Transcription System0
Data Efficient Direct Speech-to-Text Translation with Modality Agnostic Meta-Learning0
Decision Attentive Regularization to Improve Simultaneous Speech Translation Systems0
DeepCruiser: Automated Guided Testing for Stateful Deep Learning Systems0
Deepfake audio as a data augmentation technique for training automatic speech to text transcription models0
Deep Learning Based Natural Language Processing for End to End Speech Translation0
Deep Speech Based End-to-End Automated Speech Recognition (ASR) for Indian-English Accents0
Design of a novel Korean learning application for efficient pronunciation correction0
Developing a Speech Recognition System for Recognizing Tonal Speech Signals Using a Convolutional Neural Network0
Developing automatic verbatim transcripts for international multilingual meetings: an end-to-end solution0
Development of Natural Language Processing Tools for Cook Islands M\=aori0
Dialetto, ma Quanto Dialetto? Transcribing and Evaluating Dialects on a Continuum0
Digits micro-model for accurate and secure transactions0
Show:102550
← PrevPage 7 of 9Next →

No leaderboard results yet.