| EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements | Jun 10, 2025 | Binary ClassificationFinancial Analysis | CodeCode Available | 1 |
| MasHost Builds It All: Autonomous Multi-Agent System Directed by Reinforcement Learning | Jun 10, 2025 | Allgraph construction | —Unverified | 0 |
| CAF-I: A Collaborative Multi-Agent Framework for Enhanced Irony Detection with Large Language Models | Jun 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SPBA: Utilizing Speech Large Language Model for Backdoor Attacks on Speech Classification Models | Jun 10, 2025 | Backdoor AttackKeyword Spotting | —Unverified | 0 |
| DeepForm: Reasoning Large Language Model for Communication System Formulation | Jun 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Your Agent Can Defend Itself against Backdoor Attacks | Jun 10, 2025 | Large Language Model | —Unverified | 0 |
| LeanTutor: A Formally-Verified AI Tutor for Mathematical Proofs | Jun 10, 2025 | Large Language ModelMath | —Unverified | 0 |
| Safe and Economical UAV Trajectory Planning in Low-Altitude Airspace: A Hybrid DRL-LLM Approach with Compliance Awareness | Jun 10, 2025 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 |
| SakugaFlow: A Stagewise Illustration Framework Emulating the Human Drawing Process and Providing Interactive Tutoring for Novice Drawing Skills | Jun 10, 2025 | AnatomyImage Generation | —Unverified | 0 |
| Towards Secure and Private Language Models for Nuclear Power Plants | Jun 10, 2025 | GPULanguage Modeling | —Unverified | 0 |