| QwenLong-CPRS: Towards -LLMs with Dynamic Context Optimization | May 23, 2025 | 4kLanguage Modeling | —Unverified | 0 |
| Inference-Time Decomposition of Activations (ITDA): A Scalable Approach to Interpreting Large Language Models | May 23, 2025 | GPULanguage Modeling | CodeCode Available | 0 |
| Plan-R1: Safe and Feasible Trajectory Planning as Language Modeling | May 23, 2025 | Autonomous DrivingCollision Avoidance | —Unverified | 0 |
| ELDeR: Getting Efficient LLMs through Data-Driven Regularized Layer-wise Pruning | May 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| keepitsimple at SemEval-2025 Task 3: LLM-Uncertainty based Approach for Multilingual Hallucination Span Detection | May 23, 2025 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Runaway is Ashamed, But Helpful: On the Early-Exit Behavior of Large Language Model-based Agents in Embodied Environments | May 23, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Simulating Macroeconomic Expectations using LLM Agents | May 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large language model as user daily behavior data generator: balancing population diversity and individual personality | May 23, 2025 | Data AugmentationDiversity | —Unverified | 0 |
| NSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache | May 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Taming LLMs with Negative Samples: A Reference-Free Framework to Evaluate Presentation Content with Actionable Feedback | May 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Selection Mechanisms for Sequence Modeling using Linear State Space Models | May 23, 2025 | Fault DetectionLanguage Modeling | —Unverified | 0 |
| SATURN: SAT-based Reinforcement Learning to Unleash Language Model Reasoning | May 22, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| INFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine | May 22, 2025 | Causal InferenceDrug Discovery | —Unverified | 0 |
| Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering | May 22, 2025 | Global FactsLanguage Modeling | CodeCode Available | 0 |
| Power-Law Decay Loss for Large Language Model Finetuning: Focusing on Information Sparsity to Enhance Generation Quality | May 22, 2025 | Abstractive Text SummarizationInformativeness | CodeCode Available | 0 |
| DeepRec: Towards a Deep Dive Into the Item Space with Large Language Model Based Recommendation | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Small-to-Large Generalization: Data Influences Models Consistently Across Scale | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Edge-First Language Model Inference: Models, Metrics, and Tradeoffs | May 22, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CTRAP: Embedding Collapse Trap to Safeguard Large Language Models from Harmful Fine-Tuning | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How do Scaling Laws Apply to Knowledge Graph Engineering Tasks? The Impact of Model Size on Large Language Model Performance | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TensorAR: Refinement is All You Need in Autoregressive Image Generation | May 22, 2025 | AllImage Generation | —Unverified | 0 |
| Incremental Sequence Classification with Temporal Consistency | May 22, 2025 | ClassificationLanguage Modeling | —Unverified | 0 |
| Mechanistic Understanding and Mitigation of Language Confusion in English-Centric Large Language Models | May 22, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Large Language Model-Empowered Interactive Load Forecasting | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PaTH Attention: Position Encoding via Accumulating Householder Transformations | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Attention with Trained Embeddings Provably Selects Important Tokens | May 22, 2025 | Binary ClassificationLanguage Modeling | —Unverified | 0 |
| Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks | May 22, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| MM-MovieDubber: Towards Multi-Modal Learning for Multi-Modal Movie Dubbing | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On Multilingual Encoder Language Model Compression for Low-Resource Languages | May 22, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human Actions | May 22, 2025 | Claim VerificationFact Checking | CodeCode Available | 0 |
| A Japanese Language Model and Three New Evaluation Benchmarks for Pharmaceutical NLP | May 22, 2025 | Continual PretrainingDiagnostic | CodeCode Available | 0 |
| Latent Principle Discovery for Language Model Self-Improvement | May 22, 2025 | ClusteringLanguage Modeling | —Unverified | 0 |
| CASTILLO: Characterizing Response Length Distributions of Large Language Models | May 22, 2025 | Instruction FollowingLanguage Modeling | CodeCode Available | 0 |
| Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector | May 21, 2025 | Bias DetectionIn-Context Learning | —Unverified | 0 |
| Listen to the Context: Towards Faithful Large Language Models for Retrieval Augmented Generation on Climate Questions | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Forging Time Series with Language: A Large Language Model Approach to Synthetic Data Generation | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-GIVE: Associative Thinking from Limited Structured Knowledge for Enhanced Large Language Model Reasoning | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Ensembling Sparse Autoencoders | May 21, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective | May 21, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory | May 21, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| Diagnosing our datasets: How does my language model learn clinical information? | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Revealing Language Model Trajectories via Kullback-Leibler Divergence | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering | May 21, 2025 | counterfactualDenoising | —Unverified | 0 |
| Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language Model | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Likelihood Variance as Text Importance for Resampling Texts to Map Language Models | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |