| Controlling Risk of Retrieval-augmented Generation: A Counterfactual Prompting Framework | Sep 24, 2024 | Benchmarkingcounterfactual | CodeCode Available | 0 |
| XTRUST: On the Multilingual Trustworthiness of Large Language Models | Sep 24, 2024 | EthicsFairness | CodeCode Available | 1 |
| Planning in the Dark: LLM-Symbolic Planning Pipeline without Experts | Sep 24, 2024 | Hallucination | —Unverified | 0 |
| AsthmaBot: Multi-modal, Multi-Lingual Retrieval Augmented Generation For Asthma Patient Support | Sep 24, 2024 | HallucinationQuestion Answering | —Unverified | 0 |
| Long-horizon Embodied Planning with Implicit Logical Inference and Hallucination Mitigation | Sep 24, 2024 | DiversityHallucination | —Unverified | 0 |
| Enhancing Text-to-SQL Capabilities of Large Language Models via Domain Database Knowledge Injection | Sep 24, 2024 | HallucinationSemantic Parsing | —Unverified | 0 |
| Parse Trees Guided LLM Prompt Compression | Sep 23, 2024 | Hallucination | CodeCode Available | 0 |
| A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor? | Sep 23, 2024 | HallucinationMedQA | —Unverified | 0 |
| Enhancing Scientific Reproducibility Through Automated BioCompute Object Creation Using Retrieval-Augmented Generation from Publications | Sep 23, 2024 | HallucinationLong-Context Understanding | —Unverified | 0 |
| Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption Utilization | Sep 22, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 0 |