| Automatically Generating Visual Hallucination Test Cases for Multimodal Large Language Models | Oct 15, 2024 | HallucinationLarge Language Model | CodeCode Available | 0 | 5 |
| Labels Generated by Large Language Model Helps Measuring People's Empathy in Vitro | Jan 1, 2025 | Data AugmentationLanguage Modeling | CodeCode Available | 0 | 5 |
| Know Your Needs Better: Towards Structured Understanding of Marketer Demands with Analogical Reasoning Augmented LLMs | Jan 9, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 | 5 |
| SBI-RAG: Enhancing Math Word Problem Solving for Students through Schema-Based Instruction and Retrieval-Augmented Generation | Oct 17, 2024 | GSM8KLanguage Modeling | CodeCode Available | 0 | 5 |
| StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation | Aug 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Language Model Behavior: A Comprehensive Survey | Mar 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Automated title and abstract screening for scoping reviews using the GPT-4 Large Language Model | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| KL Penalty Control via Perturbation for Direct Preference Optimization | Feb 18, 2025 | ChatbotLanguage Modeling | CodeCode Available | 0 | 5 |
| Adaptive Graph Pruning for Multi-Agent Communication | Jun 3, 2025 | Code GenerationLarge Language Model | CodeCode Available | 0 | 5 |
| KidneyTalk-open: No-code Deployment of a Private Large Language Model with Medical Documentation-Enhanced Knowledge Database for Kidney Disease | Mar 6, 2025 | ChunkingLanguage Modeling | CodeCode Available | 0 | 5 |