SOTAVerified

Language Modeling

Papers

Showing 23512400 of 14182 papers

TitleStatusHype
Characterizing Large Language Model Geometry Helps Solve Toxicity Detection and GenerationCode1
EndoChat: Grounded Multimodal Large Language Model for Endoscopic SurgeryCode1
End-to-end lyrics Recognition with Voice to Singing Style TransferCode1
ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from TransformerCode1
Enabling Language Models to Fill in the BlanksCode1
Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!Code1
Chinese Lexical SimplificationCode1
Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product OperatorsCode1
Encoder-Decoder Models Can Benefit from Pre-trained Masked Language Models in Grammatical Error CorrectionCode1
Enhancing RL Safety with Counterfactual LLM ReasoningCode1
Emotion-Aware Transformer Encoder for Empathetic Dialogue GenerationCode1
Empower Entity Set Expansion via Language Model ProbingCode1
FLEX: Unifying Evaluation for Few-Shot NLPCode1
Chinese Spelling Correction as Rephrasing Language ModelCode1
Fly-Swat or Cannon? Cost-Effective Language Model Choice via Meta-ModelingCode1
FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual ModelsCode1
EMO: Earth Mover Distance Optimization for Auto-Regressive Language ModelingCode1
EMMA: Efficient Visual Alignment in Multi-Modal LLMsCode1
EmojiLM: Modeling the New Emoji LanguageCode1
Empowering Large Language Model Agents through Action LearningCode1
A Kernel-Based View of Language Model Fine-TuningCode1
Foundation TransformersCode1
Character-Aware Neural Language ModelsCode1
f-PO: Generalizing Preference Optimization with f-divergence MinimizationCode1
Emergent Analogical Reasoning in Large Language ModelsCode1
A Critical Analysis of Biased Parsers in Unsupervised ParsingCode1
ChangeChat: An Interactive Model for Remote Sensing Change Analysis via Multimodal Instruction TuningCode1
From Two to One: A New Scene Text Recognizer with Visual Language Modeling NetworkCode1
Clover: Towards A Unified Video-Language Alignment and Fusion ModelCode1
Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language ModelsCode1
Frustratingly Simple Pretraining Alternatives to Masked Language ModelingCode1
Empowering Large Language Model for Continual Video Question Answering with Collaborative PromptingCode1
CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web VideosCode1
Eliciting Knowledge from Pretrained Language Models for Prototypical Prompt VerbalizerCode1
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language ModelsCode1
ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign UsersCode1
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust LearningCode1
Gandalf the Red: Adaptive Security for LLMsCode1
ELI5: Long Form Question AnsweringCode1
Gated Linear Attention Transformers with Hardware-Efficient TrainingCode1
ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text GenerationCode1
Efficient recurrent architectures through activity sparsity and sparse back-propagation through timeCode1
ARS: Automatic Routing Solver with Large Language ModelsCode1
Generalization through Memorization: Nearest Neighbor Language ModelsCode1
ELECTRAMed: a new pre-trained language representation model for biomedical NLPCode1
CodeArt: Better Code Models by Attention Regularization When Symbols Are LackingCode1
CL-ReLKT: Cross-lingual Language Knowledge Transfer for Multilingual Retrieval Question AnsweringCode1
Citekit: A Modular Toolkit for Large Language Model Citation GenerationCode1
CloudEval-YAML: A Practical Benchmark for Cloud Configuration GenerationCode1
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than GeneratorsCode1
Show:102550
← PrevPage 48 of 284Next →

No leaderboard results yet.