SOTAVerified

Language Modeling

Papers

Showing 1195112000 of 14182 papers

TitleStatusHype
Misinformation Has High PerplexityCode0
BERT Loses Patience: Fast and Robust Inference with Early ExitCode1
Language Models as Fact Checkers?0
A Dataset and Benchmarks for Multimedia Social Analysis0
GMAT: Global Memory Augmentation for TransformersCode0
Tensorized Transformer for Dynamical Systems Modeling0
Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers0
Contextual RNN-T For Open Domain ASR0
Cross-model Back-translated Distillation for Unsupervised Machine TranslationCode0
Segatron: Segment-aware Transformer for Language Modeling and Understanding0
Position Masking for Language Models0
FlauBERT : des mod\`eles de langue contextualis\'es pr\'e-entra\^ \'es pour le fran (FlauBERT : Unsupervised Language Model Pre-training for French)Code0
Contextualized French Language Models for Biomedical Named Entity Recognition0
An Effective Contextual Language Modeling Framework for Speech Summarization with Augmented Features0
LRG at SemEval-2020 Task 7: Assessing the Ability of BERT and Derivative Models to Perform Short-Edits based Humor Grading0
Massive Choice, Ample Tasks (MaChAmp): A Toolkit for Multi-task Learning in NLPCode1
Language Models are Few-Shot LearnersCode3
Unsupervised Relation Extraction from Language Models using Constrained Cloze Completion0
Self-Training for Unsupervised Parsing with PRPN0
Syntactic Structure Distillation Pretraining For Bidirectional Encoders0
TIME: Text and Image Mutual-Translation Adversarial Networks0
qDKT: Question-centric Deep Knowledge Tracing0
When does MAML Work the Best? An Empirical Study on Model-Agnostic Meta-Learning in NLP Applications0
Living Machines: A study of atypical animacyCode0
Improving Segmentation for Technical Support ProblemsCode0
ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition0
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning0
Text-to-Text Pre-Training for Data-to-Text TasksCode1
Contrastive Learning for Debiased Candidate Generation in Large-Scale Recommender Systems0
BERTweet: A pre-trained language model for English TweetsCode1
Investigation of Large-Margin Softmax in Neural Language Modeling0
Early Stage LM Integration Using Local and Global Log-Linear Combination0
Improving Proper Noun Recognition in End-to-End ASR By Customization of the MWER Loss Criterion0
Iterative Pseudo-Labeling for Speech RecognitionCode0
Table Search Using a Deep Contextualized Language ModelCode1
Yseop at SemEval-2020 Task 5: Cascaded BERT Language Model for Counterfactual Statement Analysis0
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems0
GPT-too: A language-model-first approach for AMR-to-text generationCode1
How much complexity does an RNN architecture need to learn syntax-sensitive dependencies?Code0
Conformer: Convolution-augmented Transformer for Speech RecognitionCode3
MicroNet for Efficient Language ModelingCode1
Spelling Error Correction with Soft-Masked BERTCode1
Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model0
Multi-agent Communication meets Natural Language: Synergies between Functional and Structural Language Learning0
You Do Not Need More Data: Improving End-To-End Speech Recognition by Text-To-Speech Data Augmentation0
Parallel Corpus Filtering via Pre-trained Language Models0
A Mixture of h-1 Heads is Better than h Heads0
Document-Level Event Role Filler Extraction using Multi-Granularity Contextualized EncodingCode1
AttViz: Online exploration of self-attention for transparent neural language modelingCode0
Exploiting Syntactic Structure for Better Language Modeling: A Syntactic Distance ApproachCode0
Show:102550
← PrevPage 240 of 284Next →

No leaderboard results yet.