| Multimodal ChatGPT for Medical Applications: an Experimental Study of GPT-4V | Oct 29, 2023 | DiagnosticLanguage Modeling | CodeCode Available | 1 | 5 |
| DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity Environments | May 31, 2025 | Large Language Model | CodeCode Available | 1 | 5 |
| Monte Carlo Thought Search: Large Language Model Querying for Complex Scientific Reasoning in Catalyst Design | Oct 22, 2023 | Computational chemistryInstruction Following | CodeCode Available | 1 | 5 |
| MoqaGPT : Zero-Shot Multi-modal Open-domain Question Answering with Large Language Model | Oct 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Democratizing Reasoning Ability: Tailored Learning from Large Language Model | Oct 20, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 | 5 |
| Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and Grounder | Jun 28, 2025 | Image SegmentationLarge Language Model | CodeCode Available | 1 | 5 |
| Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning | Mar 2, 2025 | Large Language ModelMulti-Instance Retrieval | CodeCode Available | 1 | 5 |
| Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability Detection | Dec 15, 2022 | Deep LearningGraph Learning | CodeCode Available | 1 | 5 |
| Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent | Dec 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DebUnc: Improving Large Language Model Agent Communication With Uncertainty Metrics | Jul 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |