| The Differences Between Direct Alignment Algorithms are a Blur | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Scalable Language Models with Posterior Inference of Latent Thought Vectors | Feb 3, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Position: Towards a Responsible LLM-empowered Multi-Agent Systems | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Soup-of-Experts: Pretraining Specialist Models via Parameters Averaging | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| QLESS: A Quantized Approach for Data Valuation and Selection in Large Language Model Fine-Tuning | Feb 3, 2025 | Data ValuationLanguage Modeling | CodeCode Available | 0 |
| Scaling Embedding Layers in Language Models | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FALCON: Fine-grained Activation Manipulation by Contrastive Orthogonal Unalignment for Large Language Model | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Explaining Context Length Scaling and Bounds for Language Models | Feb 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Learning to Learn Weight Generation via Local Consistency Diffusion | Feb 3, 2025 | Domain GeneralizationFew-Shot Learning | —Unverified | 0 |
| InfoBridge: Mutual Information estimation via Bridge Matching | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |