SOTAVerified

Automatic Speech Recognition

Papers

Showing 21012150 of 3174 papers

TitleStatusHype
Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation0
Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation0
LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge0
Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction0
Lexicon and Attention based Handwritten Text Recognition System0
Lightweight and Robust Multi-Channel End-to-End Speech Recognition with Spherical Harmonic Transform0
Lightweight End-to-End Speech Recognition from Raw Audio Data Using Sinc-Convolutions0
Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems0
Lightweight Target-Speaker-Based Overlap Transcription for Practical Streaming ASR0
Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition0
LinTO Audio and Textual Datasets to Train and Evaluate Automatic Speech Recognition in Tunisian Arabic Dialect0
LinTO Platform: A Smart Open Voice Assistant for Business Environments0
LipDiffuser: Lip-to-Speech Generation with Conditional Diffusion Models0
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models0
Listening while Speaking: Speech Chain by Deep Learning0
Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End0
LiSTra, Automatic Speech Translation: English to Lingala case study0
LiSTra Automatic Speech Translation: English to Lingala Case Study0
Literary and Colloquial Dialect Identification for Tamil using Acoustic Features0
LiteVSR: Efficient Visual Speech Recognition by Learning from Speech Representations of Unlabeled Data0
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale0
LLM-based phoneme-to-grapheme for phoneme-based speech recognition0
LM-assisted keyword biasing with Aho-Corasick algorithm for Transducer-based ASR0
LM-SPT: LM-Aligned Semantic Distillation for Speech Tokenization0
Local Feature or Mel Frequency Cepstral Coefficients - Which One is Better for MLN-Based Bangla Speech Recognition?0
Locality enhanced dynamic biasing and sampling strategies for contextual ASR0
Local Monotonic Attention Mechanism for End-to-End Speech and Language Processing0
LoCoML: A Framework for Real-World ML Inference Pipelines0
Lombard Effect for Bilingual Speakers in Cantonese and English: importance of spectro-temporal features0
LongFNT: Long-form Speech Recognition with Factorized Neural Transducer0
Incorporating VAD into ASR System by Multi-task Learning0
Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers0
Looking Enhances Listening: Recovering Missing Speech Using Images0
Loquacious Set: 25,000 Hours of Transcribed and Diverse English Speech Recognition Data for Research and Commercial Use0
LoRA-Whisper: Parameter-Efficient and Extensible Multilingual ASR0
Loss Landscape Dependent Self-Adjusting Learning Rates in Decentralized Stochastic Gradient Descent0
Loss Prediction: End-to-End Active Learning Approach For Speech Recognition0
Lost in Transcription, Found in Distribution Shift: Demystifying Hallucination in Speech Foundation Models0
Lost in Transcription: Identifying and Quantifying the Accuracy Biases of Automatic Speech Recognition Systems Against Disfluent Speech0
Loudspeaker Beamforming to Enhance Speech Recognition Performance of Voice Driven Applications0
Low Latency ASR for Simultaneous Speech Translation0
Low-Rank and Sparse Model Merging for Multi-Lingual Speech Recognition and Translation0
Low-rank Gradient Approximation For Memory-Efficient On-device Training of Deep Neural Network0
Low-Resource Domain Adaptation for Speech LLMs via Text-Only Fine-Tuning0
Low-Resourced Speech Recognition for Iu Mien Language via Weakly-Supervised Phoneme-based Multilingual Pre-training0
Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System0
Low-Resource Machine Transliteration Using Recurrent Neural Networks of Asian Languages0
LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition0
SHARP: An Adaptable, Energy-Efficient Accelerator for Recurrent Neural Network0
LUPET: Incorporating Hierarchical Information Path into Multilingual ASR0
Show:102550
← PrevPage 43 of 64Next →

No leaderboard results yet.