| PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs | Mar 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reinforcement Learning is all You Need | Mar 12, 2025 | AllLanguage Modeling | —Unverified | 0 |
| Why LLMs Cannot Think and How to Fix It | Mar 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BAMBI: Developing Baby Language Models for Italian | Mar 12, 2025 | Language AcquisitionLanguage Modeling | —Unverified | 0 |
| SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability | Mar 12, 2025 | DisentanglementLanguage Modeling | —Unverified | 0 |
| NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language Model | Mar 12, 2025 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Sometimes Painful but Certainly Promising: Feasibility and Trade-offs of Language Model Inference at the Edge | Mar 12, 2025 | CPUGPU | —Unverified | 0 |
| Membership Inference Attacks fueled by Few-Short Learning to detect privacy leakage tackling data integrity | Mar 12, 2025 | Deep LearningFew-Shot Learning | —Unverified | 0 |
| SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment | Mar 12, 2025 | Autonomous DrivingBench2Drive | CodeCode Available | 3 |
| Perplexity Trap: PLM-Based Retrievers Overrate Low Perplexity Documents | Mar 11, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| D3PO: Preference-Based Alignment of Discrete Diffusion Models | Mar 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method | Mar 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Understanding the Quality-Diversity Trade-off in Diffusion Language Models | Mar 11, 2025 | DiversityLanguage Modeling | CodeCode Available | 0 |
| Extragradient Preference Optimization (EGPO): Beyond Last-Iterate Convergence for Nash Learning from Human Feedback | Mar 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LongProLIP: A Probabilistic Vision-Language Model with Long Context Text | Mar 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Training Plug-n-Play Knowledge Modules with Deep Context Distillation | Mar 11, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees | Mar 11, 2025 | ChatbotLanguage Modeling | CodeCode Available | 1 |
| A Cascading Cooperative Multi-agent Framework for On-ramp Merging Control Integrating Large Language Models | Mar 11, 2025 | Decision Makingglobal-optimization | —Unverified | 0 |
| Position-Aware Depth Decay Decoding (D^3): Boosting Large Language Model Inference Efficiency | Mar 11, 2025 | GSM8KLanguage Modeling | —Unverified | 0 |
| Cross-Examiner: Evaluating Consistency of Large Language Model-Generated Explanations | Mar 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OASIS: Order-Augmented Strategy for Improved Code Search | Mar 11, 2025 | Code SearchLanguage Modeling | —Unverified | 0 |
| Large Language Model as Meta-Surrogate for Data-Driven Many-Task Optimization: A Proof-of-Principle Study | Mar 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BiasEdit: Debiasing Stereotyped Language Models via Model Editing | Mar 11, 2025 | counterfactualLanguage Modeling | CodeCode Available | 1 |
| Mellow: a small audio language model for reasoning | Mar 11, 2025 | Audio captioningLanguage Modeling | CodeCode Available | 2 |
| Prompt-OT: An Optimal Transport Regularization Paradigm for Knowledge Preservation in Vision-Language Model Adaptation | Mar 11, 2025 | Domain GeneralizationLanguage Modeling | CodeCode Available | 0 |