| Think Like a Person Before Responding: A Multi-Faceted Evaluation of Persona-Guided LLMs for Countering Hate | Jun 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Rectified Sparse Attention | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| "Don't Do That!": Guiding Embodied Systems through Large Language Model-based Constraint Generation | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Understanding and Meeting Practitioner Needs When Measuring Representational Harms Caused by LLM-Based Systems | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EuroLLM-9B: Technical Report | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale | Jun 4, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Evaluating Apple Intelligence's Writing Tools for Privacy Against Large Language Model-Based Inference Attacks: Insights from Early Datasets | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Statistical Physics of Language Model Reasoning | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating Large Language Model Capabilities in Assessing Spatial Econometrics Research | Jun 4, 2025 | counterfactualEconometrics | —Unverified | 0 |
| Go-Browse: Training Web Agents with Structured Exploration | Jun 4, 2025 | Efficient ExplorationLanguage Modeling | —Unverified | 0 |