| Mechanistic Understanding and Mitigation of Language Confusion in English-Centric Large Language Models | May 22, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Large Language Model-Empowered Interactive Load Forecasting | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PaTH Attention: Position Encoding via Accumulating Householder Transformations | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Attention with Trained Embeddings Provably Selects Important Tokens | May 22, 2025 | Binary ClassificationLanguage Modeling | —Unverified | 0 |
| Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks | May 22, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| MM-MovieDubber: Towards Multi-Modal Learning for Multi-Modal Movie Dubbing | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On Multilingual Encoder Language Model Compression for Low-Resource Languages | May 22, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human Actions | May 22, 2025 | Claim VerificationFact Checking | CodeCode Available | 0 |
| A Japanese Language Model and Three New Evaluation Benchmarks for Pharmaceutical NLP | May 22, 2025 | Continual PretrainingDiagnostic | CodeCode Available | 0 |
| Latent Principle Discovery for Language Model Self-Improvement | May 22, 2025 | ClusteringLanguage Modeling | —Unverified | 0 |
| CASTILLO: Characterizing Response Length Distributions of Large Language Models | May 22, 2025 | Instruction FollowingLanguage Modeling | CodeCode Available | 0 |
| Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector | May 21, 2025 | Bias DetectionIn-Context Learning | —Unverified | 0 |
| Listen to the Context: Towards Faithful Large Language Models for Retrieval Augmented Generation on Climate Questions | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Forging Time Series with Language: A Large Language Model Approach to Synthetic Data Generation | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-GIVE: Associative Thinking from Limited Structured Knowledge for Enhanced Large Language Model Reasoning | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Ensembling Sparse Autoencoders | May 21, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective | May 21, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory | May 21, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| Diagnosing our datasets: How does my language model learn clinical information? | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Revealing Language Model Trajectories via Kullback-Leibler Divergence | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering | May 21, 2025 | counterfactualDenoising | —Unverified | 0 |
| Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language Model | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Likelihood Variance as Text Importance for Resampling Texts to Map Language Models | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |