| Chatlaw: A Multi-Agent Collaborative Legal Assistant with Knowledge Graph Enhanced Mixture-of-Experts Large Language Model | Jun 28, 2023 | HallucinationKnowledge Graphs | CodeCode Available | 5 |
| Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning | Jun 26, 2023 | HallucinationVisual Question Answering | CodeCode Available | 2 |
| Evidence for Reduced Sensory Precision and Increased Reliance on Priors in Hallucination-Prone Individuals in a General Population Sample | Jun 24, 2023 | Hallucination | —Unverified | 0 |
| IERL: Interpretable Ensemble Representation Learning -- Combining CrowdSourced Knowledge and Distributed Semantic Representations | Jun 24, 2023 | Ensemble LearningHallucination | —Unverified | 0 |
| ToolQA: A Dataset for LLM Question Answering with External Tools | Jun 23, 2023 | HallucinationQuestion Answering | CodeCode Available | 2 |
| A Survey on Multimodal Large Language Models | Jun 23, 2023 | HallucinationIn-Context Learning | —Unverified | 0 |
| Hallucination is the last thing you need | Jun 20, 2023 | Fact CheckingHallucination | —Unverified | 0 |
| Vision Transformer with Attention Map Hallucination and FFN Compaction | Jun 19, 2023 | Dimensionality ReductionHallucination | —Unverified | 0 |
| Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and Beyond | Jun 16, 2023 | BenchmarkingEvidence Selection | CodeCode Available | 1 |
| Pushing the Limits of ChatGPT on NLP Tasks | Jun 16, 2023 | Dependency ParsingEvent Extraction | —Unverified | 0 |
| Explaining Legal Concepts with Augmented Large Language Models (GPT-4) | Jun 15, 2023 | HallucinationInformation Retrieval | —Unverified | 0 |
| KoLA: Carefully Benchmarking World Knowledge of Large Language Models | Jun 15, 2023 | BenchmarkingHallucination | CodeCode Available | 1 |
| LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models | Jun 15, 2023 | HallucinationImage Captioning | CodeCode Available | 2 |
| Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene Descriptions | Jun 9, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Trapping LLM Hallucinations Using Tagged Context Prompts | Jun 9, 2023 | Hallucination | —Unverified | 0 |
| Defocus to focus: Photo-realistic bokeh rendering by fusing defocus and radiance priors | Jun 7, 2023 | Hallucination | —Unverified | 0 |
| Efficient and Interpretable Compressive Text Summarisation with Unsupervised Dual-Agent Reinforcement Learning | Jun 6, 2023 | Hallucinationreinforcement-learning | CodeCode Available | 0 |
| Do Language Models Know When They're Hallucinating References? | May 29, 2023 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| An Investigation of Evaluation Metrics for Automated Medical Note Generation | May 27, 2023 | Graph EmbeddingHallucination | CodeCode Available | 0 |
| AdaPlanner: Adaptive Planning from Feedback with Language Models | May 26, 2023 | Decision MakingHallucination | CodeCode Available | 1 |
| Getting Sick After Seeing a Doctor? Diagnosing and Mitigating Knowledge Conflicts in Event Temporal Reasoning | May 24, 2023 | counterfactualData Augmentation | CodeCode Available | 0 |
| Enabling Large Language Models to Generate Text with Citations | May 24, 2023 | HallucinationRetrieval | CodeCode Available | 2 |
| Lawyer LLaMA Technical Report | May 24, 2023 | ArticlesHallucination | CodeCode Available | 2 |
| Gorilla: Large Language Model Connected with Massive APIs | May 24, 2023 | HallucinationLanguage Modeling | CodeCode Available | 6 |
| RefGPT: Dialogue Generation of GPT, by GPT, and for GPT | May 24, 2023 | Dialogue GenerationHallucination | CodeCode Available | 1 |