| MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models | Aug 30, 2024 | Image CaptioningLanguage Modeling | CodeCode Available | 1 | 5 |
| ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human Preferences | Nov 10, 2023 | Dialogue GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing Constraints | May 29, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Multimodal AI predicts clinical outcomes of drug combinations from preclinical data | Mar 4, 2025 | Large Language Model | CodeCode Available | 1 | 5 |
| ChemMLLM: Chemical Multimodal Large Language Model | May 22, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| ChemLLM: A Chemical Large Language Model | Feb 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Robust Planning with Compound LLM Architectures: An LLM-Modulo Approach | Nov 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model | Mar 31, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| MSE-Adapter: A Lightweight Plugin Endowing LLMs with the Capability to Perform Multimodal Sentiment Analysis and Emotion Recognition | Feb 18, 2025 | Emotion RecognitionLarge Language Model | CodeCode Available | 1 | 5 |
| Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resources | Sep 18, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning | Aug 21, 2024 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| RTLLM: An Open-Source Benchmark for Design RTL Generation with Large Language Model | Aug 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| MuLan: Multimodal-LLM Agent for Progressive and Interactive Multi-Object Diffusion | Feb 20, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 | 5 |
| Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models | Nov 1, 2024 | Decision MakingInformativeness | CodeCode Available | 1 | 5 |
| Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models | May 5, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 | 5 |
| MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property | Feb 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach | May 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DesCo: Learning Object Recognition with Rich Language Descriptions | Jun 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| How well can a large language model explain business processes as perceived by users? | Jan 23, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation | May 27, 2025 | Large Language ModelLogical Reasoning | CodeCode Available | 1 | 5 |
| Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions | Aug 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Motif: Intrinsic Motivation from Artificial Intelligence Feedback | Sep 29, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space | Feb 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity Environments | May 31, 2025 | Large Language Model | CodeCode Available | 1 | 5 |
| MolecularGPT: Open Large Language Model (LLM) for Few-Shot Molecular Property Prediction | Jun 18, 2024 | Drug DiscoveryGraph Neural Network | CodeCode Available | 1 | 5 |