SOTAVerified

Language Modeling

Papers

Showing 1020110250 of 14182 papers

TitleStatusHype
AcTune: Uncertainty-aware Active Self-Training for Semi-Supervised Active Learning with Pretrained Language ModelsCode1
DOCmT5: Document-Level Pretraining of Multilingual Language Models0
Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge0
Lacuna Reconstruction: Self-supervised Pre-training for Low-Resource Historical Document Transcription0
Learning To Retrieve Prompts for In-Context LearningCode1
Prompt Tuning GPT-2 language model for parameter-efficient domain adaptation of ASR systems0
Goal-Directed Story Generation: Augmenting Generative Language Models with Reinforcement Learning0
UNIREX: A Unified Learning Framework for Language Model Rationale ExtractionCode1
Knowledge-Grounded Dialogue Generation with a Unified Knowledge Representation0
Assisted Text Annotation Using Active Learning to Achieve High Quality with Little Effort0
Applying SoftTriple Loss for Supervised Language Model Fine Tuning0
Improving Conversational Recommendation Systems' Quality with Context-Aware Item Meta InformationCode1
Simple Text Detoxification by Identifying a Linear Toxic Subspace in Language Model Embeddings0
Value Retrieval with Arbitrary Queries for Form-like DocumentsCode1
Linguistic Frameworks Go Toe-to-Toe at Neuro-Symbolic Language ModelingCode0
Towards Interactive Language Modeling0
Deciphering antibody affinity maturation with language models and weakly supervised learningCode1
Few-shot Multi-hop Question Answering over Knowledge Base0
Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model0
LMTurk: Few-Shot Learners as Crowdsourcing Workers in a Language-Model-as-a-Service Framework0
CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising0
From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model CompressionCode0
Large Language Models are not Models of Natural Language: they are Corpus Models0
Controlled Cue Generation for Play Scripts0
GLaM: Efficient Scaling of Language Models with Mixture-of-Experts0
Surfer100: Generating Surveys From Web Resources, Wikipedia-style0
Step-unrolled Denoising Autoencoders for Text GenerationCode1
Efficient and Reliable Overlay Networks for Decentralized Federated Learning0
Improving Code-switching Language Modeling with Artificially Generated Texts using Cycle-consistent Adversarial Networks0
Discourse-Aware Soft Prompting for Text Generation0
Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation0
MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based FinetuningCode1
From Scattered Sources to Comprehensive Technology Landscape: A Recommendation-based Retrieval Approach0
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
Transformer-Based Approach for Joint Handwriting and Named Entity Recognition in Historical documents0
MLP Architectures for Vision-and-Language Modeling: An Empirical StudyCode1
Zero-Shot Recommendation as Language ModelingCode1
JABER and SABER: Junior and Senior Arabic BERt0
A deep language model to predict metabolic network equilibria0
GKS: Graph-based Knowledge Selector for Task-oriented Dialog System0
Automated Story Generation as Question-Answering0
Quantifying Adaptability in Pre-trained Language Models with 500 TasksCode1
An Effective GCN-based Hierarchical Multi-label classification for Protein Function Prediction0
DIBERT: Dependency Injected Bidirectional Encoder Representations from TransformersCode0
Gaudí: Conversational Interactions with Deep Representations to Generate Image Collections0
Achieving Forgetting Prevention and Knowledge Transfer in Continual Learning0
Causal Distillation for Language ModelsCode1
Representation Learning for Conversational Data using Discourse Mutual Information Maximization0
Single-Shot Black-Box Adversarial Attacks Against Malware Detectors: A Causal Language Model Approach0
Siamese BERT-based Model for Web Search Relevance Ranking Evaluated on a New Czech DatasetCode1
Show:102550
← PrevPage 205 of 284Next →

No leaderboard results yet.