SOTAVerified

Language Modeling

Papers

Showing 17511800 of 14182 papers

TitleStatusHype
Prompt-based Depth Pruning of Large Language Models0
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model0
Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUsCode2
JingFang: A Traditional Chinese Medicine Large Language Model of Expert-Level Medical Diagnosis and Syndrome Differentiation-Based Treatment0
Reviving The Classics: Active Reward Modeling in Large Language Model AlignmentCode2
LLM-USO: Large Language Model-based Universal Sizing Optimizer0
Unlocking Efficient Large Inference Models: One-Bit Unrolling Tips the Scales0
Flatten Graphs as Sequences: Transformers are Scalable Graph Generators0
MPIC: Position-Independent Multimodal Context Caching System for Efficient MLLM Serving0
Connections between Schedule-Free Optimizers, AdEMAMix, and Accelerated SGD VariantsCode0
Analyzing Similarity Metrics for Data Selection for Language Model Pretraining0
Rethinking Homogeneity of Vision and Text Tokens in Large Vision-and-Language Models0
EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues0
CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level RoutingCode1
ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling0
Knowledge Synthesis of Photosynthesis Research Using a Large Language Model0
Eliciting Language Model Behaviors with Investigator Agents0
InfoBridge: Mutual Information estimation via Bridge Matching0
Scaling Embedding Layers in Language Models0
Learning to Learn Weight Generation via Local Consistency Diffusion0
Scalable Language Models with Posterior Inference of Latent Thought Vectors0
The Differences Between Direct Alignment Algorithms are a Blur0
QLESS: A Quantized Approach for Data Valuation and Selection in Large Language Model Fine-TuningCode0
Latent Lexical Projection in Large Language Models: A Novel Approach to Implicit Representation Refinement0
Explaining Context Length Scaling and Bounds for Language ModelsCode0
Soup-of-Experts: Pretraining Specialist Models via Parameters Averaging0
FALCON: Fine-grained Activation Manipulation by Contrastive Orthogonal Unalignment for Large Language Model0
An Inquiry into Datacenter TCO for LLM Inference with FP80
Position: Towards a Responsible LLM-empowered Multi-Agent Systems0
Fine-Tuning Discrete Diffusion Models with Policy Gradient MethodsCode1
Simulating Rumor Spreading in Social Networks using LLM AgentsCode1
ConditionNET: Learning Preconditions and Effects for Execution Monitoring0
Language Models Use Trigonometry to Do Addition0
Efficient Multi-Agent System Training with Data Influence-Oriented Tree Search0
Agent-Based Uncertainty Awareness Improves Automated Radiology Report Labeling with an Open-Source Large Language Model0
Vision-centric Token Compression in Large Language Model0
Decision-informed Neural Networks with Large Language Model Integration for Portfolio Optimization0
LLM Safety Alignment is Divergence Estimation in DisguiseCode0
Avoiding exp(R_max) scaling in RLHF through Preference-based ExplorationCode0
LIBRA: Measuring Bias of Large Language Model from a Local ContextCode0
A statistically consistent measure of Semantic Variability using Language Models0
Doing More with Less -- Implementing Routing Strategies in Large Language Model-Based Systems: An Extended Survey0
INSIGHT: Enhancing Autonomous Driving Safety through Vision-Language Models on Context-Aware Hazard Detection and Edge Case Evaluation0
Speculative Ensemble: Fast Large Language Model Ensemble via SpeculationCode1
Enhancing Token Filtering Efficiency in Large Language Model Training with Collider0
MetaOpenFOAM 2.0: Large Language Model Driven Chain of Thought for Automating CFD Simulation and Post-ProcessingCode2
OrcaLoca: An LLM Agent Framework for Software Issue Localization0
Resolving Editing-Unlearning Conflicts: A Knowledge Codebook Framework for Large Language Model Updating0
Can AI Solve the Peer Review Crisis? A Large Scale Cross Model Experiment of LLMs' Performance and Biases in Evaluating over 1000 Economics Papers0
Mobile Robot Navigation Using Hand-Drawn Maps: A Vision Language Model Approach0
Show:102550
← PrevPage 36 of 284Next →

No leaderboard results yet.