| Token Weighting for Long-Range Language Modeling | Mar 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| xVLM2Vec: Adapting LVLM-based embedding models to multilinguality using Self-Knowledge Distillation | Mar 12, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Toward a method for LLM-enabled Indoor Navigation | Mar 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Medical Large Language Model Benchmarks Should Prioritize Construct Validity | Mar 12, 2025 | Clinical KnowledgeLanguage Modeling | —Unverified | 0 |
| Leveraging Knowledge Graphs and LLMs for Context-Aware Messaging | Mar 12, 2025 | Entity LinkingEvent Detection | —Unverified | 0 |
| Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo | Mar 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BAMBI: Developing Baby Language Models for Italian | Mar 12, 2025 | Language AcquisitionLanguage Modeling | —Unverified | 0 |
| Global Position Aware Group Choreography using Large Language Model | Mar 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Extragradient Preference Optimization (EGPO): Beyond Last-Iterate Convergence for Nash Learning from Human Feedback | Mar 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Accelerating MoE Model Inference with Expert Sharding | Mar 11, 2025 | DecoderGPU | —Unverified | 0 |