SOTAVerified

Language Modeling

Papers

Showing 23512400 of 14182 papers

TitleStatusHype
Knowledge Graph Generation From TextCode1
AfroLM: A Self-Active Learning-based Multilingual Pretrained Language Model for 23 African LanguagesCode1
Investigating Fairness Disparities in Peer Review: A Language Model Enhanced ApproachCode1
KGLM: Integrating Knowledge Graph Structure in Language Models for Link PredictionCode1
Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language ModelCode1
Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks AdaptivelyCode1
Contextual information integration for stance detection via cross-attentionCode1
Fine-Tuning Language Models via Epistemic Neural NetworksCode1
LMentry: A Language Model Benchmark of Elementary Language TasksCode1
data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setupCode1
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5Code1
Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic ChangeCode1
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular ControlCode1
L-GreCo: Layerwise-Adaptive Gradient Compression for Efficient and Accurate Deep LearningCode1
Differentiable Data Augmentation for Contrastive Sentence Representation LearningCode1
RoChBert: Towards Robust BERT Fine-tuning for ChineseCode1
Leveraging Label Correlations in a Multi-label Setting: A Case Study in EmotionCode1
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust LearningCode1
Truncation Sampling as Language Model DesmoothingCode1
Inducer-tuning: Connecting Prefix-tuning and Adapter-tuningCode1
Will we run out of data? Limits of LLM scaling based on human-generated dataCode1
N-gram Is Back: Residual Learning of Neural Text Generation with n-gram Language ModelCode1
Synthetic Text Generation with Differential Privacy: A Simple and Practical RecipeCode1
A single-cell gene expression language modelCode1
Help me write a poem: Instruction Tuning as a Vehicle for Collaborative Poetry WritingCode1
MemoNet: Memorizing All Cross Features' Representations Efficiently via Multi-Hash Codebook Network for CTR PredictionCode1
ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text GenerationCode1
Code4Struct: Code Generation for Few-Shot Event Structure PredictionCode1
Language Model Pre-Training with Sparse Latent TypingCode1
Generative Prompt Tuning for Relation ClassificationCode1
Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model InfillingCode1
Diffuser: Efficient Transformers with Multi-hop Attention Diffusion for Long SequencesCode1
InforMask: Unsupervised Informative Masking for Language Model PretrainingCode1
Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal ProofsCode1
Tele-Knowledge Pre-training for Fault AnalysisCode1
Improving Aspect Sentiment Quad Prediction via Template-Order Data AugmentationCode1
The Devil in Linear TransformerCode1
Continued Pretraining for Better Zero- and Few-Shot PromptabilityCode1
Language Model Decomposition: Quantifying the Dependency and Correlation of Language ModelsCode1
Sentiment-Aware Word and Sentence Level Pre-training for Sentiment AnalysisCode1
RARR: Researching and Revising What Language Models Say, Using Language ModelsCode1
Knowledge Prompting in Pre-trained Language Model for Natural Language UnderstandingCode1
Construction Repetition Reduces Information Rate in DialogueCode1
CAB: Comprehensive Attention Benchmarking on Long Sequence ModelingCode1
BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text GenerationCode1
Extracting Cultural Commonsense Knowledge at ScaleCode1
M2D2: A Massively Multi-domain Language Modeling DatasetCode1
Language Model Decoding as Likelihood-Utility AlignmentCode1
ImaginaryNet: Learning Object Detectors without Real Images and AnnotationsCode1
AD-DROP: Attribution-Driven Dropout for Robust Language Model Fine-TuningCode1
Show:102550
← PrevPage 48 of 284Next →

No leaderboard results yet.