| QG-SMS: Enhancing Test Item Analysis via Student Modeling and Simulation | Mar 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report Evaluation | Mar 7, 2025 | Large Language ModelMedical Report Generation | CodeCode Available | 0 |
| TPU-Gen: LLM-Driven Custom Tensor Processing Unit Generator | Mar 7, 2025 | Large Language ModelRAG | —Unverified | 0 |
| Leveraging Approximate Caching for Faster Retrieval-Augmented Generation | Mar 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Better Process Supervision with Bi-directional Rewarding Signals | Mar 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model | Mar 6, 2025 | General KnowledgeImage Captioning | CodeCode Available | 2 |
| AgentSafe: Safeguarding Large Language Model-based Multi-agent Systems via Hierarchical Data Management | Mar 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Measuring temporal effects of agent knowledge by date-controlled tool use | Mar 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| KidneyTalk-open: No-code Deployment of a Private Large Language Model with Medical Documentation-Enhanced Knowledge Database for Kidney Disease | Mar 6, 2025 | ChunkingLanguage Modeling | CodeCode Available | 0 |
| PP-DocBee: Improving Multimodal Document Understanding Through a Bag of Tricks | Mar 6, 2025 | document understandingLanguage Modeling | —Unverified | 0 |