SOTAVerified

Language Modeling

Papers

Showing 1110111150 of 14182 papers

TitleStatusHype
Should we Stop Training More Monolingual Models, and Simply Use Machine Translation Instead?Code1
On Sampling-Based Training Criteria for Neural Language Modeling0
Pre-training for Spoken Language Understanding with Joint Textual and Phonetic Representation Learning0
Improving Biomedical Pretrained Language Models with KnowledgeCode1
Frustratingly Easy Edit-based Linguistic Steganography with a Masked Language ModelCode1
B-PROP: Bootstrapped Pre-training with Representative Words Prediction for Ad-hoc RetrievalCode0
Differentiable Model Compression via Pseudo Quantization NoiseCode1
BERTić -- The Transformer Language Model for Bosnian, Croatian, Montenegrin and Serbian0
ELECTRAMed: a new pre-trained language representation model for biomedical NLPCode1
Operationalizing a National Digital Library: The Case for a Norwegian Transformer ModelCode1
When FastText Pays Attention: Efficient Estimation of Word Representations using Constrained Positional WeightingCode0
Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training0
Go Forth and Prosper: Language Modeling with Ancient Textual HistoryCode0
SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal ConversationsCode1
On the Influence of Masking Policies in Intermediate Pre-training0
Probing Across Time: What Does RoBERTa Know and When?Code1
Text2App: A Framework for Creating Android Apps from Text DescriptionsCode1
Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema0
Enriching a Model's Notion of Belief using a Persistent Memory0
A Masked Segmental Language Model for Unsupervised Natural Language SegmentationCode0
Effect of Visual Extensions on Natural Language Understanding in Vision-and-Language ModelsCode0
Detecting Polarized Topics Using Partisanship-aware Contextualized Topic EmbeddingsCode0
KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation ExtractionCode1
How to Train BERT with an Academic BudgetCode1
Bilingual alignment transfers to multilingual alignment for unsupervised parallel text miningCode0
SINA-BERT: A pre-trained Language Model for Analysis of Medical Texts in Persian0
Quantifying Gender Bias Towards Politicians in Cross-Lingual Language ModelsCode0
Time-Stamped Language Model: Teaching Language Models to Understand the Flow of EventsCode1
UDALM: Unsupervised Domain Adaptation through Language ModelingCode0
Mean-Squared Accuracy of Good-Turing Estimator0
Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little0
TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding LearningCode1
Event Detection as Question Answering with Entity InformationCode0
IGA : An Intent-Guided Authoring AssistantCode0
Large-Scale Self- and Semi-Supervised Learning for Speech Translation0
K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-CommerceCode1
Learning How to Ask: Querying LMs with Mixtures of Soft PromptsCode1
EAT: Enhanced ASR-TTS for Self-supervised Speech RecognitionCode0
What's in your Head? Emergent Behaviour in Multi-Task Transformer Models0
Transformer-based Methods for Recognizing Ultra Fine-grained Entities (RUFES)0
Restoring and Mining the Records of the Joseon Dynasty via Neural Language Modeling and Machine Translation0
Paragraph-level Simplification of Medical TextsCode1
On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic DependenciesCode1
Estimating Subjective Crowd-Evaluations as an Additional Objective to Improve Natural Language Generation0
Building a Swedish Open-Domain Conversational Language ModelCode0
Investigating Methods to Improve Language Model Integration for Attention-based Encoder-Decoder ASR Models0
Lookup-Table Recurrent Language Models for Long Tail Speech Recognition0
Language model fusion for streaming end to end speech recognition0
Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LMCode0
Extended Parallel Corpus for Amharic-English Machine Translation0
Show:102550
← PrevPage 223 of 284Next →

No leaderboard results yet.