SOTAVerified

Language Modeling

Papers

Showing 97019750 of 14182 papers

TitleStatusHype
Formulating Few-shot Fine-tuning Towards Language Model Pre-training: A Pilot Study on Named Entity RecognitionCode0
Enhancing Continual Learning with Global Prototypes: Counteracting Negative Representation Drift0
RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-EncoderCode2
On the Role of Bidirectionality in Language Model Pre-Training0
PoeLM: A Meter- and Rhyme-Controllable Language Model for Unsupervised Poetry GenerationCode0
PERT: A New Solution to Pinyin to Character Conversion TaskCode0
On Measuring Social Biases in Prompt-Based Multi-Task LearningCode1
Improving Short Text Classification With Augmented Data Using GPT-30
Challenges in Measuring Bias via Open-Ended Language GenerationCode0
BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in BanglaCode1
The Diminishing Returns of Masked Language Models to Science0
Supporting Vision-Language Model Inference with Causality-pruning Knowledge Prompt0
PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language ModelsCode1
Looking for a Handsome Carpenter! Debiasing GPT-3 Job AdvertisementsCode0
Prompt Tuning for Discriminative Pre-trained Language ModelsCode1
The Geometry of Multilingual Language Model RepresentationsCode1
Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models0
Housekeep: Tidying Virtual Households using Commonsense ReasoningCode1
Named Entity Linking with Entity Representation by Multiple Embeddings0
Scenario-based Multi-product Advertising Copywriting Generation for E-Commerce0
DeepStruct: Pretraining of Language Models for Structure PredictionCode1
Multilingual Normalization of Temporal Expressions with Masked Language ModelsCode0
UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes0
Visually-Augmented Language ModelingCode1
KERPLE: Kernelized Relative Positional Embedding for Length ExtrapolationCode1
Progressive Class Semantic Matching for Semi-supervised Text ClassificationCode0
RankGen: Improving Text Generation with Large Ranking ModelsCode1
Foundation Posteriors for Approximate Probabilistic Inference0
Great Power, Great Responsibility: Recommendations for Reducing Energy for Training Language Models0
Automatic Spoken Language Identification using a Time-Delay Neural Network0
Are Prompt-based Models Clueless?0
GPoeT-2: A GPT-2 Based Poem GeneratorCode0
Feature Aggregation in Zero-Shot Cross-Lingual Transfer Using Multilingual BERT0
TiBERT: Tibetan Pre-trained Language Model0
PathologyBERT -- Pre-trained Vs. A New Transformer Language Model for Pathology Domain0
Bootstrapping Text Anonymization Models with Distant SupervisionCode0
Controlling Translation Formality Using Pre-trained Multilingual Language Models0
TIE: Topological Information Enhanced Structural Reading Comprehension on Web PagesCode1
Weakly Supervised Text Classification using Supervision Signals from a Language ModelCode1
Localized Vision-Language Matching for Open-vocabulary Object DetectionCode1
Efficient and Training-Free Control of Language Generation0
AdaVAE: Exploring Adaptive GPT-2s in Variational Auto-Encoders for Language ModelingCode1
A Generalist AgentCode2
Towards the Generation of Musical Explanations with GPT-3Code0
Towards Unified Prompt Tuning for Few-shot Text Classification0
An Empirical Study Of Self-supervised Learning Approaches For Object Detection With TransformersCode0
Aggregating Pairwise Semantic Differences for Few-Shot Claim Veracity Classification0
DistilProtBert: A distilled protein language model used to distinguish between real proteins and their randomly shuffled counterpartsCode1
From Distillation to Hard Negative Sampling: Making Sparse Neural IR Models More EffectiveCode1
Human Language ModelingCode1
Show:102550
← PrevPage 195 of 284Next →

No leaderboard results yet.