| A Survey on Large Language Model based Human-Agent Systems | May 1, 2025 | Human Agent CollaborationLanguage Modeling | CodeCode Available | 0 |
| LLM-Based Threat Detection and Prevention Framework for IoT Ecosystems | May 1, 2025 | Anomaly DetectionLanguage Modeling | —Unverified | 0 |
| UniBiomed: A Universal Foundation Model for Grounded Biomedical Image Interpretation | Apr 30, 2025 | DiagnosticLarge Language Model | CodeCode Available | 1 |
| Confidence in Large Language Model Evaluation: A Bayesian Approach to Limited-Sample Challenges | Apr 30, 2025 | Bayesian InferenceLanguage Model Evaluation | —Unverified | 0 |
| DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition | Apr 30, 2025 | Automated Theorem ProvingLarge Language Model | CodeCode Available | 5 |
| Does the Prompt-based Large Language Model Recognize Students' Demographics and Introduce Bias in Essay Scoring? | Apr 30, 2025 | Automated Essay ScoringFairness | —Unverified | 0 |
| Leveraging Pre-trained Large Language Models with Refined Prompting for Online Task and Motion Planning | Apr 30, 2025 | Large Language ModelMotion Planning | —Unverified | 0 |
| Consistency-aware Fake Videos Detection on Short Video Platforms | Apr 30, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 0 |
| MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model Framework | Apr 30, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning | Apr 30, 2025 | Large Language Model | —Unverified | 0 |