| LLMs and the Abstraction and Reasoning Corpus: Successes, Failures, and the Importance of Object-based Representations | May 26, 2023 | ARCLanguage Modelling | CodeCode Available | 1 | 5 |
| LLM-SR: Scientific Equation Discovery via Programming with Large Language Models | Apr 29, 2024 | Equation DiscoveryInterpretable Machine Learning | CodeCode Available | 1 | 5 |
| LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models | Nov 11, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| LLM-in-the-loop: Leveraging Large Language Model for Thematic Analysis | Oct 23, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| LLM Platform Security: Applying a Systematic Evaluation Framework to OpenAI's ChatGPT Plugins | Sep 19, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling | Mar 2, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing Constraints | May 29, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions | Mar 20, 2025 | 2D Object DetectionDistributed Computing | CodeCode Available | 1 | 5 |
| AuditWen:An Open-Source Large Language Model for Audit | Oct 9, 2024 | Answer GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements | Jun 10, 2025 | Binary ClassificationFinancial Analysis | CodeCode Available | 1 | 5 |