SOTAVerified

Language Modeling

Papers

Showing 21012150 of 14182 papers

TitleStatusHype
Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model BiasCode1
Can ChatGPT replace StackOverflow? A Study on Robustness and Reliability of Large Language Model Code GenerationCode1
CDLM: Cross-Document Language ModelingCode1
Cross-Align: Modeling Deep Cross-lingual Interactions for Word AlignmentCode1
JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language ModelCode1
Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoECode1
CriticEval: Evaluating Large Language Model as CriticCode1
Critic-Guided Decoding for Controlled Text GenerationCode1
UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language ModelingCode1
CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language ModelCode1
AbGPT: De Novo Antibody Design via Generative Language ModelingCode1
CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and OptimizationCode1
JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed MetadataCode1
Iterative Few-shot Semantic Segmentation from Image Label TextCode1
ITER: Iterative Transformer-based Entity Recognition and Relation ExtractionCode1
A Study of Generative Large Language Model for Medical Research and HealthcareCode1
Crafting Large Language Models for Enhanced InterpretabilityCode1
CREAM: Consistency Regularized Self-Rewarding Language ModelsCode1
IterVM: Iterative Vision Modeling Module for Scene Text RecognitionCode1
CPT: Efficient Deep Neural Network Training via Cyclic PrecisionCode1
Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language ModelsCode1
CPM: A Large-scale Generative Chinese Pre-trained Language ModelCode1
A Better Way to Do Masked Language Model ScoringCode1
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and GenerationCode1
IvyGPT: InteractiVe Chinese pathwaY language model in medical domainCode1
Counterfactual Token Generation in Large Language ModelsCode1
Coupling Large Language Models with Logic Programming for Robust and General Reasoning from TextCode1
Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3Code1
KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation ExtractionCode1
AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language ModelsCode1
IPA-CHILDES & G2P+: Feature-Rich Resources for Cross-Lingual Phonology and Phonemic Language ModelingCode1
Is ChatGPT Fair for Recommendation? Evaluating Fairness in Large Language Model RecommendationCode1
Counterfactual Data Augmentation for Neural Machine TranslationCode1
InvestLM: A Large Language Model for Investment using Financial Domain Instruction TuningCode1
cosFormer: Rethinking Softmax in AttentionCode1
Investigating Fairness Disparities in Peer Review: A Language Model Enhanced ApproachCode1
Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOMCode1
IoT-LM: Large Multisensory Language Models for the Internet of ThingsCode1
Is Child-Directed Speech Effective Training Data for Language Models?Code1
Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model InfillingCode1
InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply ChainsCode1
Interpreting Song Lyrics with an Audio-Informed Pre-trained Language ModelCode1
CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language ModelsCode1
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-TuningCode1
Invariant Language ModelingCode1
CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model GenerationCode1
ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based PolishingCode1
CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue CoreferenceCode1
Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language ModelsCode1
Copy Is All You NeedCode1
Show:102550
← PrevPage 43 of 284Next →

No leaderboard results yet.