| DesCo: Learning Object Recognition with Rich Language Descriptions | Jun 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization | May 22, 2025 | Combinatorial OptimizationLanguage Modeling | CodeCode Available | 1 | 5 |
| Democratizing Reasoning Ability: Tailored Learning from Large Language Model | Oct 20, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 | 5 |
| ASSISTGUI: Task-Oriented Desktop Graphical User Interface Automation | Dec 20, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| MISR: Measuring Instrumental Self-Reasoning in Frontier Models | Dec 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Generator-Retriever-Generator Approach for Open-Domain Question Answering | Jul 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity Environments | May 31, 2025 | Large Language Model | CodeCode Available | 1 | 5 |
| MiniGPT-Pancreas: Multimodal Large Language Model for Pancreas Cancer Classification and Detection | Dec 20, 2024 | Cancer ClassificationChatbot | CodeCode Available | 1 | 5 |