SOTAVerified

Language Modeling

Papers

Showing 1055110600 of 14182 papers

TitleStatusHype
Balancing Average and Worst-case Accuracy in Multitask Learning0
Learning Compact Metrics for MTCode1
SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition0
Multi-Task Learning for Situated Multi-Domain End-to-End Dialogue Systems0
On a Benefit of Mask Language Modeling: Robustness to Simplicity Bias0
Unsupervised Neural Machine Translation with Generative Language Models Only0
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric0
Breaking the Softmax Bottleneck for Sequential Recommender Systems with Dropout and Decoupling0
Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits0
Automatic Text Extractive Summarization Based on Graph and Pre-trained Language Model Attention0
Long Expressive Memory for Sequence ModelingCode1
Yuan 1.0: Large-Scale Pre-trained Language Model in Zero-Shot and Few-Shot LearningCode1
Improving Multi-Party Dialogue Discourse Parsing via Domain IntegrationCode1
Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic FactorsCode1
KaraSinger: Score-Free Singing Voice Synthesis with VQ-VAE using Mel-spectrograms0
Layer-wise Pruning of Transformer Attention Heads for Efficient Language ModelingCode1
Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition0
Back from the future: bidirectional CTC decoding using future information in speech recognition0
Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddingsCode1
Beam Search with Bidirectional Strategies for Neural Response Generation0
Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition0
Cut the CARP: Fishing for zero-shot story evaluation0
ABC: Attention with Bounded-memory Control0
8-bit Optimizers via Block-wise QuantizationCode3
Language Modeling using LMUs: 10x Better Data Efficiency or Improved Scaling Compared to Transformers0
Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition0
Attention Augmented Convolutional Transformer for Tabular Time-series0
AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts0
JuriBERT: A Masked-Language Model Adaptation for French Legal TextCode1
Contextualized Semantic Distance between Highly Overlapped TextsCode0
Leveraging Information Bottleneck for Scientific Document Summarization0
Revisiting Self-Training for Few-Shot Learning of Language ModelCode1
Stochastic Anderson Mixing for Nonconvex Stochastic Optimization0
Generative Adversarial Networks based on Mixed-Attentions for Citation Intent Classification in Scientific Publications0
A Study on Contextualized Language Modeling for Machine Reading Comprehension0
Exploiting Low-Resource Code-Switching Data to Mandarin-English Speech Recognition Systems0
Improving Punctuation Restoration for Speech Transcripts via External Data0
Speech Technology for Everyone: Automatic Speech Recognition for Non-Native English with Transfer Learning0
Unpacking the Interdependent Systems of Discrimination: Ableist Bias in NLP Systems through an Intersectional Lens0
Span Labeling Approach for Vietnamese and Chinese Word Segmentation0
MatSciBERT: A Materials Domain Language Model for Text Mining and Information ExtractionCode1
SlovakBERT: Slovak Masked Language ModelCode1
Deep Neural Compression Via Concurrent Pruning and Self-Distillation0
Focused Contrastive Training for Test-based Constituency Analysis0
BERT got a Date: Introducing Transformers to Temporal TaggingCode1
GenTAL: Generative Denoising Skip-gram Transformer for Unsupervised Binary Code Similarity Detection0
Analyzing the Implicit Position Encoding Ability of Transformer Decoder0
Fairness in Representation for Multilingual NLP: Insights from Controlled Experiments on Conditional Language Modeling0
Generate, Annotate, and Learn: Generative Models Advance Self-Training and Knowledge Distillation0
Rethinking Client Reweighting for Selfish Federated Learning0
Show:102550
← PrevPage 212 of 284Next →

No leaderboard results yet.