SOTAVerified

Language Modeling

Papers

Showing 23012325 of 14182 papers

TitleStatusHype
CTRL: A Conditional Transformer Language Model for Controllable GenerationCode1
CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text GenerationCode1
Character-Aware Neural Language ModelsCode1
Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language ModelsCode1
DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance ScalingCode1
Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across ModalitiesCode1
ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based PolishingCode1
A Critical Analysis of Biased Parsers in Unsupervised ParsingCode1
A Simple Contrastive Learning Objective for Alleviating Neural Text DegenerationCode1
ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up?Code1
DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNACode1
Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed LanguageCode1
DARTS: Differentiable Architecture SearchCode1
data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setupCode1
Aioli: A Unified Optimization Framework for Language Model Data MixingCode1
Data Augmentation using Pre-trained Transformer ModelsCode1
ChangeChat: An Interactive Model for Remote Sensing Change Analysis via Multimodal Instruction TuningCode1
Knowledge-enhanced Visual-Language Pretraining for Computational PathologyCode1
Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge AcquisitionCode1
Knowledge Graphs and Pre-trained Language Models enhanced Representation Learning for Conversational Recommender SystemsCode1
CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web VideosCode1
Language Modeling with Editable External KnowledgeCode1
Knowledge Distillation for BERT Unsupervised Domain AdaptationCode1
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent ClassificationCode1
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language ModelsCode1
Show:102550
← PrevPage 93 of 568Next →

No leaderboard results yet.