SOTAVerified

Language Modeling

Papers

Showing 1215112200 of 14182 papers

TitleStatusHype
Sparse Text GenerationCode1
SelfORE: Self-supervised Relational Feature Learning for Open Relation ExtractionCode1
Exploring Early Prediction of Buyer-Seller Negotiation Outcomes0
Semi-supervised acoustic and language model training for English-isiZulu code-switched speech recognition0
Optimus: Organizing Sentences via Pre-trained Modeling of a Latent SpaceCode1
Semantics of the Unwritten: The Effect of End of Paragraph and Sequence Tokens on Text Generation with GPT2Code0
Syntax-driven Iterative Expansion Language Models for Controllable Text Generation0
CG-BERT: Conditional Text Generation with BERT for Generalized Few-shot Intent Detection0
BAE: BERT-based Adversarial Examples for Text ClassificationCode2
MemCap: Memorizing Style Knowledge for Image CaptioningCode1
Adversarial Transfer Learning for Punctuation Restoration0
NukeBERT: A Pre-trained language model for Low Resource Nuclear DomainCode0
Meta Fine-Tuning Neural Language Models for Multi-Domain Text MiningCode0
Abstractive Text Summarization based on Language Model Conditioning and Locality ModelingCode0
Common-Knowledge Concept Recognition for SEVACode0
Felix: Flexible Text Editing Through Tagging and InsertionCode1
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than GeneratorsCode1
SAC: Accelerating and Structuring Self-Attention via Sparse Adaptive Connection0
Dynamic Sampling and Selective Masking for Communication-Efficient Federated Learning0
TNT-KID: Transformer-based Neural Tagger for Keyword IdentificationCode0
Beheshti-NER: Persian Named Entity Recognition Using BERTCode1
Anchor & Transform: Learning Sparse Embeddings for Large Vocabularies0
Self-Supervised Log ParsingCode2
Key Phrase Classification in Complex Assignments0
Finnish Language Modeling with Deep Transformer Models0
Meta-CoTGAN: A Meta Cooperative Training Paradigm for Improving Adversarial Text Generation0
Hybrid Autoregressive Transducer (hat)0
Efficient Content-Based Sparse Attention with Routing TransformersCode1
ReZero is All You Need: Fast Convergence at Large DepthCode1
ProGen: Language Modeling for Protein GenerationCode1
What the [MASK]? Making Sense of Language-Specific BERT Models0
Talking-Heads AttentionCode1
RecipeGPT: Generative Pre-training Based Cooking Recipe Generation and Evaluation SystemCode1
Data Augmentation using Pre-trained Transformer ModelsCode1
CLUECorpus2020: A Large-scale Chinese Corpus for Pre-training Language ModelCode2
XGPT: Cross-modal Generative Pre-Training for Image Captioning0
Understanding Contexts Inside Robot and Human Manipulation Tasks through a Vision-Language Model and Ontology System in a Video StreamCode1
UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-TrainingCode1
A Deep Generative Model for Fragment-Based Molecule GenerationCode0
Using a thousand optimization tasks to learn hyperparameter search strategies0
Quantized Neural Network Inference with Precision Batching0
Refined Gate: A Simple and Effective Gating Mechanism for Recurrent Units0
Sparse Sinkhorn AttentionCode0
A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition0
Object Relational Graph with Teacher-Recommended Learning for Video Captioning0
A more abstractive summarization model0
Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity0
Semi-Supervised Speech Recognition via Local Prior MatchingCode3
Sequence Preserving Network Traffic Generation0
Fill in the BLANC: Human-free quality estimation of document summariesCode1
Show:102550
← PrevPage 244 of 284Next →

No leaderboard results yet.