SOTAVerified

Language Modeling

Papers

Showing 31313140 of 14182 papers

TitleStatusHype
XLNet: Generalized Autoregressive Pretraining for Language UnderstandingCode1
How multilingual is Multilingual BERT?Code1
Does It Make Sense? And Why? A Pilot Study for Sense Making and ExplanationCode1
Adapting Text Embeddings for Causal InferenceCode1
Stochastic Gradient Methods with Layer-wise Adaptive Moments for Training of Deep NetworksCode1
Discrete Flows: Invertible Generative Models of Discrete DataCode1
Adaptive Attention Span in TransformersCode1
A Surprisingly Robust Trick for Winograd Schema ChallengeCode1
How to Fine-Tune BERT for Text Classification?Code1
RWTH ASR Systems for LibriSpeech: Hybrid vs Attention -- w/o Data AugmentationCode1
Show:102550
← PrevPage 314 of 1419Next →

No leaderboard results yet.