| ANPMI: Assessing the True Comprehension Capabilities of LLMs for Multiple Choice Questions | Feb 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Kanana: Compute-efficient Bilingual Language Models | Feb 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating Gender Bias in German Machine Translation | Feb 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Improving Representation Learning of Complex Critical Care Data with ICU-BERT | Feb 26, 2025 | Feature EngineeringLanguage Modeling | —Unverified | 0 |
| Large Language Model Driven Agents for Simulating Echo Chamber Formation | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| from Benign import Toxic: Jailbreaking the Language Model via Adversarial Metaphors | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model Generalization | Feb 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Faster, Cheaper, Better: Multi-Objective Hyperparameter Optimization for LLM and RAG Systems | Feb 25, 2025 | Bayesian OptimizationHyperparameter Optimization | —Unverified | 0 |
| AMPO: Active Multi-Preference Optimization | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation | Feb 25, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |