| Can ChatGPT replace StackOverflow? A Study on Robustness and Reliability of Large Language Model Code Generation | Aug 20, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| Making Language Models Better Tool Learners with Execution Feedback | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems | May 23, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| DualAD: Dual-Layer Planning for Reasoning in Autonomous Driving | Sep 26, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 1 | 5 |
| Evolutionary Large Language Model for Automated Feature Transformation | May 25, 2024 | Efficient ExplorationEvolutionary Algorithms | CodeCode Available | 1 | 5 |
| Excuse me, sir? Your language model is leaking (information) | Jan 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication | Dec 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Non-Linear Inference Time Intervention: Improving LLM Truthfulness | Mar 27, 2024 | Large Language ModelMultiple-choice | CodeCode Available | 1 | 5 |
| ExaRanker: Explanation-Augmented Neural Ranker | Jan 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| A Study of Generative Large Language Model for Medical Research and Healthcare | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |