SOTAVerified

Language Modeling

Papers

Showing 24012450 of 14182 papers

TitleStatusHype
A Critical Analysis of Biased Parsers in Unsupervised ParsingCode1
ChangeChat: An Interactive Model for Remote Sensing Change Analysis via Multimodal Instruction TuningCode1
EndoChat: Grounded Multimodal Large Language Model for Endoscopic SurgeryCode1
Generative Prompt Tuning for Relation ClassificationCode1
Enhancing Biomedical Relation Extraction with DirectionalityCode1
Generative Spoken Language Modeling from Raw AudioCode1
CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web VideosCode1
Empower Large Language Model to Perform Better on Industrial Domain-Specific Question AnsweringCode1
Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!Code1
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language ModelsCode1
ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign UsersCode1
Enabling Language Models to Fill in the BlanksCode1
Empower Entity Set Expansion via Language Model ProbingCode1
ARS: Automatic Routing Solver with Large Language ModelsCode1
Empowering Large Language Model Agents through Action LearningCode1
Co-Learning: Code Learning for Multi-Agent Reinforcement Collaborative Framework with Conversational Natural Language InterfacesCode1
GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language ModelCode1
GLADIS: A General and Large Acronym Disambiguation BenchmarkCode1
EmojiLM: Modeling the New Emoji LanguageCode1
Emotion-Aware Transformer Encoder for Empathetic Dialogue GenerationCode1
Empowering Large Language Model for Continual Video Question Answering with Collaborative PromptingCode1
GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue GenerationCode1
Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product OperatorsCode1
Enhancing Chinese Pre-trained Language Model via Heterogeneous Linguistics GraphCode1
Estimating Contamination via Perplexity: Quantifying Memorisation in Language Model EvaluationCode1
Extracting Cultural Commonsense Knowledge at ScaleCode1
ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text GenerationCode1
COCO-LM: Correcting and Contrasting Text Sequences for Language Model PretrainingCode1
Code4Struct: Code Generation for Few-Shot Event Structure PredictionCode1
MR-GSM8K: A Meta-Reasoning Benchmark for Large Language Model EvaluationCode1
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than GeneratorsCode1
ELECTRAMed: a new pre-trained language representation model for biomedical NLPCode1
ELI5: Long Form Question AnsweringCode1
CodeArt: Better Code Models by Attention Regularization When Symbols Are LackingCode1
Efficient recurrent architectures through activity sparsity and sparse back-propagation through timeCode1
Eliciting Knowledge from Pretrained Language Models for Prototypical Prompt VerbalizerCode1
GQA: Training Generalized Multi-Query Transformer Models from Multi-Head CheckpointsCode1
Gradient Ascent Post-training Enhances Language Model GeneralizationCode1
Emergent Analogical Reasoning in Large Language ModelsCode1
Efficient Pre-training of Masked Language Model via Concept-based Curriculum MaskingCode1
GraphLLM: Boosting Graph Reasoning Ability of Large Language ModelCode1
Graph Neural Prompting with Large Language ModelsCode1
EGFI: Drug-Drug Interaction Extraction and Generation with Fusion of Enriched Entity and Sentence InformationCode1
Clinfo.ai: An Open-Source Retrieval-Augmented Large Language Model System for Answering Medical Questions using Scientific LiteratureCode1
Clinical Camel: An Open Expert-Level Medical Language Model with Dialogue-Based Knowledge EncodingCode1
GraPPa: Grammar-Augmented Pre-Training for Table Semantic ParsingCode1
G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable RecommendationCode1
Efficient Nearest Neighbor Language ModelsCode1
ClinicalMamba: A Generative Clinical Language Model on Longitudinal Clinical NotesCode1
Chain of Natural Language Inference for Reducing Large Language Model Ungrounded HallucinationsCode1
Show:102550
← PrevPage 49 of 284Next →

No leaderboard results yet.