SOTAVerified

Language Modeling

Papers

Showing 14511500 of 14182 papers

TitleStatusHype
hmBERT: Historical Multilingual Language Models for Named Entity RecognitionCode1
History Matters: Temporal Knowledge Editing in Large Language ModelCode1
How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language modelCode1
CDLM: Cross-Document Language ModelingCode1
Hierarchical Transformers Are More Efficient Language ModelsCode1
High-Dimension Human Value Representation in Large Language ModelsCode1
AMPERSAND: Argument Mining for PERSuAsive oNline DiscussionsCode1
UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language ModelingCode1
CriticEval: Evaluating Large Language Model as CriticCode1
How does the pre-training objective affect what large language models learn about linguistic properties?Code1
Hexatagging: Projective Dependency Parsing as TaggingCode1
Crafting Large Language Models for Enhanced InterpretabilityCode1
HetSeq: Distributed GPU Training on Heterogeneous InfrastructureCode1
Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source)Code1
HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World ClaimsCode1
Heterogeneous Graph Reasoning for Fact Checking over Texts and TablesCode1
HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language ModelCode1
CPM: A Large-scale Generative Chinese Pre-trained Language ModelCode1
CPLLM: Clinical Prediction with Large Language ModelsCode1
A Model of Cross-Lingual Knowledge-Grounded Response Generation for Open-Domain Dialogue SystemsCode1
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and GenerationCode1
Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward HackingCode1
Coupling Large Language Models with Logic Programming for Robust and General Reasoning from TextCode1
Cross-model Control: Improving Multiple Large Language Models in One-time TrainingCode1
Hello, It's GPT-2 -- How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue SystemsCode1
CPT: Efficient Deep Neural Network Training via Cyclic PrecisionCode1
Automatic Controllable Product Copywriting for E-CommerceCode1
AdaSplash: Adaptive Sparse Flash AttentionCode1
CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language ModelCode1
Counterfactual Token Generation in Large Language ModelsCode1
CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and OptimizationCode1
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-trainingCode1
Helpful or Harmful Data? Fine-tuning-free Shapley Attribution for Explaining Language Model PredictionsCode1
Help me write a poem: Instruction Tuning as a Vehicle for Collaborative Poetry WritingCode1
CREAM: Consistency Regularized Self-Rewarding Language ModelsCode1
Automatic Evaluation of Attribution by Large Language ModelsCode1
How far is Language Model from 100% Few-shot Named Entity Recognition in Medical DomainCode1
Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate SpeechCode1
Hallucinations in Large Multilingual Translation ModelsCode1
Critic-Guided Decoding for Controlled Text GenerationCode1
Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model BiasCode1
Cross-Align: Modeling Deep Cross-lingual Interactions for Word AlignmentCode1
AMR Parsing via Graph-Sequence Iterative InferenceCode1
Automatic Label Sequence Generation for Prompting Sequence-to-sequence ModelsCode1
CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue CoreferenceCode1
Handwritten Mathematical Expression Recognition with Bidirectionally Trained TransformerCode1
cosFormer: Rethinking Softmax in AttentionCode1
Automatic Model Selection with Large Language Models for ReasoningCode1
HoneyBee: Progressive Instruction Finetuning of Large Language Models for Materials ScienceCode1
Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model InfillingCode1
Show:102550
← PrevPage 30 of 284Next →

No leaderboard results yet.