SOTAVerified

Language Modeling

Papers

Showing 30513100 of 14182 papers

TitleStatusHype
Train No Evil: Selective Masking for Task-Guided Pre-TrainingCode1
Adaptive Attention Span in Computer VisionCode1
Transform and Tell: Entity-Aware News Image CaptioningCode1
Fast and Accurate Deep Bidirectional Language Representations for Unsupervised LearningCode1
SPECTER: Document-level Representation Learning using Citation-informed TransformersCode1
TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented DialogueCode1
PALM: Pre-training an Autoencoding&Autoregressive Language Model for Context-conditioned GenerationCode1
AMR Parsing via Graph-Sequence Iterative InferenceCode1
Unsupervised Commonsense Question Answering with Self-TalkCode1
Injecting Numerical Reasoning Skills into Language ModelsCode1
Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic FidelityCode1
Downstream Model Design of Pre-trained Language Model for Relation Extraction TaskCode1
Exploring Versatile Generative Language Model Via Parameter-Efficient Transfer LearningCode1
Byte Pair Encoding is Suboptimal for Language Model PretrainingCode1
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-based Question AnsweringCode1
Sparse Text GenerationCode1
SelfORE: Self-supervised Relational Feature Learning for Open Relation ExtractionCode1
Optimus: Organizing Sentences via Pre-trained Modeling of a Latent SpaceCode1
MemCap: Memorizing Style Knowledge for Image CaptioningCode1
Felix: Flexible Text Editing Through Tagging and InsertionCode1
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than GeneratorsCode1
Beheshti-NER: Persian Named Entity Recognition Using BERTCode1
Efficient Content-Based Sparse Attention with Routing TransformersCode1
ReZero is All You Need: Fast Convergence at Large DepthCode1
ProGen: Language Modeling for Protein GenerationCode1
RecipeGPT: Generative Pre-training Based Cooking Recipe Generation and Evaluation SystemCode1
Talking-Heads AttentionCode1
Data Augmentation using Pre-trained Transformer ModelsCode1
Understanding Contexts Inside Robot and Human Manipulation Tasks through a Vision-Language Model and Ontology System in a Video StreamCode1
UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-TrainingCode1
Fill in the BLANC: Human-free quality estimation of document summariesCode1
Addressing Some Limitations of Transformers with Feedback MemoryCode1
LAMBERT: Layout-Aware (Language) Modeling for information extractionCode1
SentenceMIM: A Latent Variable Language ModelCode1
UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and GenerationCode1
Transformer on a DietCode1
How Much Knowledge Can You Pack Into the Parameters of a Language Model?Code1
REALM: Retrieval-Augmented Language Model Pre-TrainingCode1
Blank Language ModelsCode1
Time-aware Large Kernel ConvolutionsCode1
Parsing as PretrainingCode1
Explaining Relationships Between Scientific DocumentsCode1
Adversarial Training for Aspect-Based Sentiment Analysis with BERTCode1
DUMA: Reading Comprehension with Transposition ThinkingCode1
Scaling Laws for Neural Language ModelsCode1
Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on GeneralizationCode1
A Simple Baseline to Semi-Supervised Domain Adaptation for Machine TranslationCode1
Exploiting Cloze Questions for Few Shot Text Classification and Natural Language InferenceCode1
Domain-Aware Dialogue State Tracker for Multi-Domain Dialogue SystemsCode1
RobBERT: a Dutch RoBERTa-based Language ModelCode1
Show:102550
← PrevPage 62 of 284Next →

No leaderboard results yet.