SOTAVerified

Language Modeling

Papers

Showing 17511775 of 14182 papers

TitleStatusHype
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model0
Prompt-based Depth Pruning of Large Language Models0
Reviving The Classics: Active Reward Modeling in Large Language Model AlignmentCode2
Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUsCode2
JingFang: A Traditional Chinese Medicine Large Language Model of Expert-Level Medical Diagnosis and Syndrome Differentiation-Based Treatment0
Flatten Graphs as Sequences: Transformers are Scalable Graph Generators0
Rethinking Homogeneity of Vision and Text Tokens in Large Vision-and-Language Models0
Analyzing Similarity Metrics for Data Selection for Language Model Pretraining0
Unlocking Efficient Large Inference Models: One-Bit Unrolling Tips the Scales0
Connections between Schedule-Free Optimizers, AdEMAMix, and Accelerated SGD VariantsCode0
LLM-USO: Large Language Model-based Universal Sizing Optimizer0
MPIC: Position-Independent Multimodal Context Caching System for Efficient MLLM Serving0
CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level RoutingCode1
ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling0
EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues0
Knowledge Synthesis of Photosynthesis Research Using a Large Language Model0
Eliciting Language Model Behaviors with Investigator Agents0
InfoBridge: Mutual Information estimation via Bridge Matching0
Scaling Embedding Layers in Language Models0
Learning to Learn Weight Generation via Local Consistency Diffusion0
Scalable Language Models with Posterior Inference of Latent Thought Vectors0
The Differences Between Direct Alignment Algorithms are a Blur0
FALCON: Fine-grained Activation Manipulation by Contrastive Orthogonal Unalignment for Large Language Model0
Soup-of-Experts: Pretraining Specialist Models via Parameters Averaging0
QLESS: A Quantized Approach for Data Valuation and Selection in Large Language Model Fine-TuningCode0
Show:102550
← PrevPage 71 of 568Next →

No leaderboard results yet.