| Evaluating Large Language Model Capabilities in Assessing Spatial Econometrics Research | Jun 4, 2025 | counterfactualEconometrics | —Unverified | 0 |
| POSS: Position Specialist Generates Better Draft for Speculative Decoding | Jun 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RewardAnything: Generalizable Principle-Following Reward Models | Jun 4, 2025 | Instruction FollowingLarge Language Model | CodeCode Available | 1 |
| EuroLLM-9B: Technical Report | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Automated Skill Discovery for Language Agents through Exploration and Iterative Feedback | Jun 4, 2025 | Large Language Model | —Unverified | 0 |
| GEM: Empowering LLM for both Embedding Generation and Language Understanding | Jun 4, 2025 | DecoderLarge Language Model | —Unverified | 0 |
| MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale | Jun 4, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions | Jun 4, 2025 | Data AugmentationDiversity | —Unverified | 0 |
| "Don't Do That!": Guiding Embodied Systems through Large Language Model-based Constraint Generation | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Seed-Coder: Let the Code Model Curate Data for Itself | Jun 4, 2025 | Code CompletionCode Generation | CodeCode Available | 4 |