SOTAVerified

Language Modeling

Papers

Showing 35513600 of 14182 papers

TitleStatusHype
QwenLong-CPRS: Towards -LLMs with Dynamic Context Optimization0
Inference-Time Decomposition of Activations (ITDA): A Scalable Approach to Interpreting Large Language ModelsCode0
Plan-R1: Safe and Feasible Trajectory Planning as Language Modeling0
ELDeR: Getting Efficient LLMs through Data-Driven Regularized Layer-wise Pruning0
keepitsimple at SemEval-2025 Task 3: LLM-Uncertainty based Approach for Multilingual Hallucination Span DetectionCode0
Runaway is Ashamed, But Helpful: On the Early-Exit Behavior of Large Language Model-based Agents in Embodied EnvironmentsCode0
Simulating Macroeconomic Expectations using LLM Agents0
Large language model as user daily behavior data generator: balancing population diversity and individual personality0
NSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache0
Taming LLMs with Negative Samples: A Reference-Free Framework to Evaluate Presentation Content with Actionable Feedback0
Selection Mechanisms for Sequence Modeling using Linear State Space Models0
SATURN: SAT-based Reinforcement Learning to Unleash Language Model ReasoningCode0
INFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling0
Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine0
Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question AnsweringCode0
Power-Law Decay Loss for Large Language Model Finetuning: Focusing on Information Sparsity to Enhance Generation QualityCode0
DeepRec: Towards a Deep Dive Into the Item Space with Large Language Model Based Recommendation0
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning0
Small-to-Large Generalization: Data Influences Models Consistently Across Scale0
Edge-First Language Model Inference: Models, Metrics, and Tradeoffs0
Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning0
CTRAP: Embedding Collapse Trap to Safeguard Large Language Models from Harmful Fine-Tuning0
How do Scaling Laws Apply to Knowledge Graph Engineering Tasks? The Impact of Model Size on Large Language Model Performance0
TensorAR: Refinement is All You Need in Autoregressive Image Generation0
Incremental Sequence Classification with Temporal Consistency0
Mechanistic Understanding and Mitigation of Language Confusion in English-Centric Large Language Models0
Large Language Model-Empowered Interactive Load Forecasting0
PaTH Attention: Position Encoding via Accumulating Householder Transformations0
Attention with Trained Embeddings Provably Selects Important Tokens0
Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning0
Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks0
MM-MovieDubber: Towards Multi-Modal Learning for Multi-Modal Movie Dubbing0
On Multilingual Encoder Language Model Compression for Low-Resource Languages0
EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human ActionsCode0
A Japanese Language Model and Three New Evaluation Benchmarks for Pharmaceutical NLPCode0
Latent Principle Discovery for Language Model Self-Improvement0
CASTILLO: Characterizing Response Length Distributions of Large Language ModelsCode0
Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector0
Listen to the Context: Towards Faithful Large Language Models for Retrieval Augmented Generation on Climate Questions0
Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning0
Forging Time Series with Language: A Large Language Model Approach to Synthetic Data Generation0
Self-GIVE: Associative Thinking from Limited Structured Knowledge for Enhanced Large Language Model Reasoning0
Ensembling Sparse Autoencoders0
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective0
Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response TheoryCode0
Diagnosing our datasets: How does my language model learn clinical information?Code0
Revealing Language Model Trajectories via Kullback-Leibler Divergence0
Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering0
Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language ModelCode0
Likelihood Variance as Text Importance for Resampling Texts to Map Language Models0
Show:102550
← PrevPage 72 of 284Next →

No leaderboard results yet.