| OpsEval: A Comprehensive IT Operations Benchmark Suite for Large Language Models | Oct 11, 2023 | HallucinationIn-Context Learning | CodeCode Available | 1 |
| Chain of Natural Language Inference for Reducing Large Language Model Ungrounded Hallucinations | Oct 6, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| AGIR: Automating Cyber Threat Intelligence Reporting with Natural Language Generation | Oct 4, 2023 | HallucinationText Generation | CodeCode Available | 1 |
| HallE-Control: Controlling Object Hallucination in Large Multimodal Models | Oct 3, 2023 | AttributeDecoder | CodeCode Available | 1 |
| LLM Lies: Hallucinations are not Bugs, but Features as Adversarial Examples | Oct 2, 2023 | Hallucination | CodeCode Available | 1 |
| BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models | Oct 2, 2023 | HallucinationRetrieval | CodeCode Available | 1 |
| Analyzing and Mitigating Object Hallucination in Large Vision-Language Models | Oct 1, 2023 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| Robust 3D Object Detection from LiDAR-Radar Point Clouds via Cross-Modal Feature Augmentation | Sep 29, 2023 | 3D Object DetectionAttribute | CodeCode Available | 1 |
| Self-supervised Cross-view Representation Reconstruction for Change Captioning | Sep 28, 2023 | Caption GenerationHallucination | CodeCode Available | 1 |
| Lyra: Orchestrating Dual Correction in Automated Theorem Proving | Sep 27, 2023 | Automated Theorem ProvingHallucination | CodeCode Available | 1 |