SOTAVerified

Language Modeling

Papers

Showing 23512375 of 14182 papers

TitleStatusHype
Knowledge Graph Generation From TextCode1
AfroLM: A Self-Active Learning-based Multilingual Pretrained Language Model for 23 African LanguagesCode1
Investigating Fairness Disparities in Peer Review: A Language Model Enhanced ApproachCode1
KGLM: Integrating Knowledge Graph Structure in Language Models for Link PredictionCode1
Fine-Tuning Language Models via Epistemic Neural NetworksCode1
Contextual information integration for stance detection via cross-attentionCode1
LMentry: A Language Model Benchmark of Elementary Language TasksCode1
Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language ModelCode1
Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks AdaptivelyCode1
data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setupCode1
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5Code1
Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic ChangeCode1
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular ControlCode1
L-GreCo: Layerwise-Adaptive Gradient Compression for Efficient and Accurate Deep LearningCode1
Differentiable Data Augmentation for Contrastive Sentence Representation LearningCode1
RoChBert: Towards Robust BERT Fine-tuning for ChineseCode1
Leveraging Label Correlations in a Multi-label Setting: A Case Study in EmotionCode1
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust LearningCode1
Truncation Sampling as Language Model DesmoothingCode1
Will we run out of data? Limits of LLM scaling based on human-generated dataCode1
N-gram Is Back: Residual Learning of Neural Text Generation with n-gram Language ModelCode1
Inducer-tuning: Connecting Prefix-tuning and Adapter-tuningCode1
Synthetic Text Generation with Differential Privacy: A Simple and Practical RecipeCode1
MemoNet: Memorizing All Cross Features' Representations Efficiently via Multi-Hash Codebook Network for CTR PredictionCode1
Help me write a poem: Instruction Tuning as a Vehicle for Collaborative Poetry WritingCode1
Show:102550
← PrevPage 95 of 568Next →

No leaderboard results yet.