| Towards End-to-End Embodied Decision Making via Multi-modal Large Language Model: Explorations with GPT4-Vision and Beyond | Oct 3, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| FELM: Benchmarking Factuality Evaluation of Large Language Models | Oct 1, 2023 | BenchmarkingMath | CodeCode Available | 1 |
| Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration | Sep 30, 2023 | World Knowledge | CodeCode Available | 1 |
| Pairwise Proximal Policy Optimization: Harnessing Relative Feedback for LLM Alignment | Sep 30, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Augmenting LLMs with Knowledge: A survey on hallucination prevention | Sep 28, 2023 | HallucinationLanguage Modeling | —Unverified | 0 |
| Analyzing the Efficacy of an LLM-Only Approach for Image-based Document Question Answering | Sep 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Physics of Language Models: Part 3.1, Knowledge Storage and Extraction | Sep 25, 2023 | Question AnsweringSentence | CodeCode Available | 1 |
| Bravo MaRDI: A Wikibase Powered Knowledge Graph on Mathematics | Sep 20, 2023 | World Knowledge | CodeCode Available | 0 |
| Retrieve-Rewrite-Answer: A KG-to-Text Enhanced LLMs Framework for Knowledge Graph Question Answering | Sep 20, 2023 | Graph Question AnsweringLanguage Modeling | CodeCode Available | 1 |
| Reformulating Sequential Recommendation: Learning Dynamic User Interest with Content-enriched Language Modeling | Sep 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |