| Scalable Language Models with Posterior Inference of Latent Thought Vectors | Feb 3, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| The Differences Between Direct Alignment Algorithms are a Blur | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FALCON: Fine-grained Activation Manipulation by Contrastive Orthogonal Unalignment for Large Language Model | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| QLESS: A Quantized Approach for Data Valuation and Selection in Large Language Model Fine-Tuning | Feb 3, 2025 | Data ValuationLanguage Modeling | CodeCode Available | 0 |
| Soup-of-Experts: Pretraining Specialist Models via Parameters Averaging | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Explaining Context Length Scaling and Bounds for Language Models | Feb 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Latent Lexical Projection in Large Language Models: A Novel Approach to Implicit Representation Refinement | Feb 3, 2025 | Computational EfficiencyDiversity | —Unverified | 0 |
| ConditionNET: Learning Preconditions and Effects for Execution Monitoring | Feb 3, 2025 | Anomaly DetectionLanguage Modeling | —Unverified | 0 |
| An Inquiry into Datacenter TCO for LLM Inference with FP8 | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods | Feb 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |