SOTAVerified

Language Modeling

Papers

Showing 34263450 of 14182 papers

TitleStatusHype
GradPower: Powering Gradients for Faster Language Model Pre-Training0
FABLE: A Novel Data-Flow Analysis Benchmark on Procedural Text for Large Language Model EvaluationCode0
From Macro to Micro: Probing Dataset Diversity in Language Model Fine-Tuning0
TRIDENT: Enhancing Large Language Model Safety with Tri-Dimensional Diversified Red-Teaming Data SynthesisCode0
Transformers Are Universally Consistent0
Intuitionistic Fuzzy Sets for Large Language Model Data Annotation: A Novel Approach to Side-by-Side Preference Labeling0
Circuit Stability Characterizes Language Model GeneralizationCode0
HardTests: Synthesizing High-Quality Test Cases for LLM Coding0
Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and FinetuningCode0
Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation0
MythTriage: Scalable Detection of Opioid Use Disorder Myths on a Video-Sharing Platform0
Unsupervised Word-level Quality Estimation for Machine Translation Through the Lens of Annotators (Dis)agreementCode0
Preemptive Hallucination Reduction: An Input-Level Approach for Multimodal Language Model0
SCORPIO: Serving the Right Requests at the Right Time for Heterogeneous SLOs in LLM Inference0
FLAT-LLM: Fine-grained Low-rank Activation Space Transformation for Large Language Model CompressionCode0
Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners0
An Empirical Study of Federated Prompt Learning for Vision Language Model0
Discriminative Policy Optimization for Token-Level Reward ModelsCode0
Beam-Guided Knowledge Replay for Knowledge-Rich Image Captioning using Vision-Language Model0
VLM-RRT: Vision Language Model Guided RRT Search for Autonomous UAV Navigation0
Dataset Cartography for Large Language Model Alignment: Mapping and Diagnosing Preference Data0
Reducing Latency in LLM-Based Natural Language Commands Processing for Robot Navigation0
Spoken Language Modeling with Duration-Penalized Self-Supervised UnitsCode0
Learning Parametric Distributions from Samples and PreferencesCode0
Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training0
Show:102550
← PrevPage 138 of 568Next →

No leaderboard results yet.