| ORBIT: Cost-Effective Dataset Curation for Large Language Model Domain Adaptation with an Astronomy Case Study | Dec 19, 2024 | AstronomyDomain Adaptation | CodeCode Available | 0 | 5 |
| CommonIT: Commonality-Aware Instruction Tuning for Large Language Models via Data Partitions | Oct 4, 2024 | Instruction FollowingMMLU | CodeCode Available | 0 | 5 |
| Noise Injection Reveals Hidden Capabilities of Sandbagging Language Models | Dec 2, 2024 | MMLUMultiple-choice | CodeCode Available | 0 | 5 |
| Empowering Cross-lingual Abilities of Instruction-tuned Large Language Models by Translation-following demonstrations | Aug 27, 2023 | Instruction FollowingMMLU | CodeCode Available | 0 | 5 |
| Inconsistencies in Masked Language Models | Dec 30, 2022 | LAMBADAMMLU | CodeCode Available | 0 | 5 |
| EmPO: Emotion Grounding for Empathetic Response Generation through Preference Optimization | Jun 27, 2024 | DiversityEmpathetic Response Generation | CodeCode Available | 0 | 5 |
| Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations | Jul 7, 2025 | AttributeMMLU | CodeCode Available | 0 | 5 |
| Probing then Editing Response Personality of Large Language Models | Apr 14, 2025 | MMLU | CodeCode Available | 0 | 5 |
| ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation | Jun 16, 2024 | Continual LearningGSM8K | CodeCode Available | 0 | 5 |
| ChatBench: From Static Benchmarks to Human-AI Evaluation | Mar 22, 2025 | MathMMLU | CodeCode Available | 0 | 5 |