SOTAVerified

Language Modeling

Papers

Showing 28512900 of 14182 papers

TitleStatusHype
GradInit: Learning to Initialize Neural Networks for Stable and Efficient TrainingCode1
COCO-LM: Correcting and Contrasting Text Sequences for Language Model PretrainingCode1
DOBF: A Deobfuscation Pre-Training Objective for Programming LanguagesCode1
End-to-end Audio-visual Speech Recognition with ConformersCode1
Unsupervised Extractive Summarization using Pointwise Mutual InformationCode1
Proof Artifact Co-training for Theorem Proving with Language ModelsCode1
AuGPT: Auxiliary Tasks and Data Augmentation for End-To-End Dialogue with Pre-Trained Language ModelsCode1
Unifying Vision-and-Language Tasks via Text GenerationCode1
Phoneme-BERT: Joint Language Modelling of Phoneme Sequence and ASR TranscriptCode1
Generative Spoken Language Modeling from Raw AudioCode1
LESA: Linguistic Encapsulation and Semantic Amalgamation Based Generalised Claim Detection from Online ContentCode1
PolyLM: Learning about Polysemy through Language ModelingCode1
EGFI: Drug-Drug Interaction Extraction and Generation with Fusion of Enriched Entity and Sentence InformationCode1
CPT: Efficient Deep Neural Network Training via Cyclic PrecisionCode1
PalmTree: Learning an Assembly Language Model for Instruction EmbeddingCode1
Persistent Anti-Muslim Bias in Large Language ModelsCode1
Implicit Unlikelihood Training: Improving Neural Text Generation with Reinforcement LearningCode1
Trankit: A Light-Weight Transformer-based Toolkit for Multilingual Natural Language ProcessingCode1
Multitask Learning for Emotion and Personality DetectionCode1
PhoNLP: A joint multi-task learning model for Vietnamese part-of-speech tagging, named entity recognition and dependency parsingCode1
Outline to Story: Fine-grained Controllable Story Generation from Cascaded EventsCode1
KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense GenerationCode1
CDLM: Cross-Document Language ModelingCode1
Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative TransformersCode1
Discovering Autoregressive Orderings with Variational InferenceCode1
BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in BanglaCode1
Not All Memories are Created Equal: Learning to ExpireCode1
WARP: Word-level Adversarial ReProgrammingCode1
K-PLUG: KNOWLEDGE-INJECTED PRE-TRAINED LANGUAGE MODEL FOR NATURAL LANGUAGE UNDERSTANDING AND GENERATIONCode1
Shortformer: Better Language Modeling using Shorter InputsCode1
Unified Mandarin TTS Front-end Based on Distilled BERT ModelCode1
AraGPT2: Pre-Trained Transformer for Arabic Language GenerationCode1
AraELECTRA: Pre-Training Text Discriminators for Arabic Language UnderstandingCode1
Generating Query Focused Summaries from Query-Free ResourcesCode1
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-TuningCode1
RealFormer: Transformer Likes Residual AttentionCode1
Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-TrainingCode1
Binary Black-box Evasion Attacks Against Deep Learning-based Static Malware Detectors with Adversarial Byte-Level Language ModelCode1
Extracting Training Data from Large Language ModelsCode1
Towards Neural Programming InterfacesCode1
Fusing Context Into Knowledge Graph for Commonsense Question AnsweringCode1
TAP: Text-Aware Pre-training for Text-VQA and Text-CaptionCode1
Pre-training Protein Language Models with Label-Agnostic Binding Pairs Enhances Performance in Downstream TasksCode1
Multi-Task Learning for Knowledge Graph Completion with Pre-trained Language ModelsCode1
End-to-End Automatic Speech Recognition for GujaratiCode1
Kungfupanda at SemEval-2020 Task 12: BERT-Based Multi-TaskLearning for Offensive Language DetectionCode1
Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNetCode1
Retrieving Skills from Job Descriptions: A Language Model Based Extreme Multi-label Classification FrameworkCode1
CPM: A Large-scale Generative Chinese Pre-trained Language ModelCode1
SentiX: A Sentiment-Aware Pre-Trained Model for Cross-Domain Sentiment AnalysisCode1
Show:102550
← PrevPage 58 of 284Next →

No leaderboard results yet.