| Enhancing RL Safety with Counterfactual LLM Reasoning | Sep 16, 2024 | counterfactualLanguage Modeling | CodeCode Available | 1 | 5 |
| CompeteAI: Understanding the Competition Dynamics in Large Language Model-based Agents | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Matching Patients to Clinical Trials with Large Language Models | Jul 27, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward Passes | Oct 22, 2024 | GSM8KLanguage Modeling | CodeCode Available | 1 | 5 |
| MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning | Aug 21, 2024 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval | Oct 4, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 | 5 |
| RARR: Researching and Revising What Language Models Say, Using Language Models | Oct 17, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge | Sep 11, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Composing Parameter-Efficient Modules with Arithmetic Operations | Jun 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit | Aug 19, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Compositional Chain-of-Thought Prompting for Large Multimodal Models | Nov 27, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| AttributionBench: How Hard is Automatic Attribution Evaluation? | Feb 23, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| MechAgents: Large language model multi-agent collaborations can solve mechanics problems, generate new data, and integrate knowledge | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Multi-label Sequential Sentence Classification via Large Language Model | Nov 23, 2024 | Contrastive LearningExtractive Summarization | CodeCode Available | 1 | 5 |
| MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis | Jun 23, 2025 | DiagnosticLarge Language Model | CodeCode Available | 1 | 5 |
| CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning | Jul 30, 2024 | Contrastive LearningDiagnostic | CodeCode Available | 1 | 5 |
| Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion | May 26, 2025 | DenoisingImage Generation | CodeCode Available | 1 | 5 |
| Democratizing Reasoning Ability: Tailored Learning from Large Language Model | Oct 20, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 | 5 |
| DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity Environments | May 31, 2025 | Large Language Model | CodeCode Available | 1 | 5 |
| Evaluating Retrieval Quality in Retrieval-Augmented Generation | Apr 21, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the Question Answering Performance of the GPT LLM Family | Mar 14, 2023 | Knowledge Base Question AnsweringLanguage Modeling | CodeCode Available | 1 | 5 |
| Can ChatGPT replace StackOverflow? A Study on Robustness and Reliability of Large Language Model Code Generation | Aug 20, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| DesCo: Learning Object Recognition with Rich Language Descriptions | Jun 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Making Language Models Better Tool Learners with Execution Feedback | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability Detection | Dec 15, 2022 | Deep LearningGraph Learning | CodeCode Available | 1 | 5 |