| Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration | Jul 11, 2023 | HallucinationLogic Grid Puzzle | CodeCode Available | 4 | 5 |
| Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models | Feb 12, 2024 | HallucinationObject Localization | CodeCode Available | 4 | 5 |
| ReAct: Synergizing Reasoning and Acting in Language Models | Oct 6, 2022 | Decision MakingFact Verification | CodeCode Available | 4 | 5 |
| A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and Challenges | Jan 4, 2025 | FairnessHallucination | CodeCode Available | 4 | 5 |
| Retrieval-Augmented Generation for Large Language Models: A Survey | Dec 18, 2023 | HallucinationRAG | CodeCode Available | 4 | 5 |
| G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering | Feb 12, 2024 | Common Sense ReasoningGraph Classification | CodeCode Available | 4 | 5 |
| LLM-Enhanced Data Management | Feb 4, 2024 | HallucinationManagement | CodeCode Available | 4 | 5 |
| LettuceDetect: A Hallucination Detection Framework for RAG Applications | Feb 24, 2025 | 8kGPU | CodeCode Available | 4 | 5 |
| Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language Models | Jul 30, 2023 | HallucinationPrompt Engineering | CodeCode Available | 4 | 5 |
| Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling | Nov 1, 2023 | HallucinationKnowledge Distillation | CodeCode Available | 4 | 5 |