| FuzzCoder: Byte-level Fuzzing Test via Large Language Model | Sep 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data | Aug 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AuditWen:An Open-Source Large Language Model for Audit | Oct 9, 2024 | Answer GenerationLanguage Modeling | CodeCode Available | 1 |
| EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements | Jun 10, 2025 | Binary ClassificationFinancial Analysis | CodeCode Available | 1 |
| C-LLM: Learn to Check Chinese Spelling Errors Character by Character | Jun 24, 2024 | Chinese Spell CheckingLanguage Modeling | CodeCode Available | 1 |
| Gandalf the Red: Adaptive Security for LLMs | Jan 14, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 |
| From Text to Pixel: Advancing Long-Context Understanding in MLLMs | May 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection | Dec 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models | Apr 9, 2024 | FairnessGPU | CodeCode Available | 1 |
| FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions | May 28, 2023 | AttributeImage Captioning | CodeCode Available | 1 |