| Inference-Time Intervention: Eliciting Truthful Answers from a Language Model | Jun 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Can Large Language Model Agents Simulate Human Trust Behavior? | Feb 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Improving Factuality and Reasoning in Language Models through Multiagent Debate | May 23, 2023 | Few-Shot LearningLanguage Modeling | CodeCode Available | 2 |
| Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback | May 17, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Improved Representation Steering for Language Models | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Prediction | Mar 27, 2024 | Image CaptioningLanguage Modeling | CodeCode Available | 2 |
| Improve Vision Language Model Chain-of-thought Reasoning | Oct 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents | Mar 5, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvement | Jul 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Ignore Previous Prompt: Attack Techniques For Language Models | Nov 17, 2022 | Adversarial AttackAdversarial Text | CodeCode Available | 2 |