| MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis | Jun 23, 2025 | DiagnosticLarge Language Model | CodeCode Available | 1 | 5 |
| DOMINO: A Dual-System for Multi-step Visual Language Reasoning | Oct 4, 2023 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 | 5 |
| Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time | Jul 1, 2024 | AUDIO-VISUAL QUESTION ANSWERING (MUSIC-AVQA-v2.0)Fact Checking | CodeCode Available | 1 | 5 |
| DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For Driving | Jun 21, 2025 | Autonomous DrivingDescriptive | CodeCode Available | 1 | 5 |
| MELLM: Exploring LLM-Powered Micro-Expression Understanding Enhanced by Subtle Motion Perception | May 11, 2025 | Emotion ClassificationLarge Language Model | CodeCode Available | 1 | 5 |
| MechAgents: Large language model multi-agent collaborations can solve mechanics problems, generate new data, and integrate knowledge | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DrugAssist: A Large Language Model for Molecule Optimization | Dec 28, 2023 | Drug DiscoveryLanguage Modeling | CodeCode Available | 1 | 5 |
| Do Large Language Model Benchmarks Test Reliability? | Feb 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| MedFILIP: Medical Fine-grained Language-Image Pre-training | Jan 18, 2025 | Contrastive LearningDiagnostic | CodeCode Available | 1 | 5 |
| Meaning Typed Prompting: A Technique for Efficient, Reliable Structured Output Generation | Oct 22, 2024 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 | 5 |
| A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition | Mar 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Measuring General Intelligence with Generated Games | May 12, 2025 | In-Context LearningLarge Language Model | CodeCode Available | 1 | 5 |
| DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling | Mar 2, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Code4Struct: Code Generation for Few-Shot Event Structure Prediction | Oct 23, 2022 | Code GenerationEvent Argument Extraction | CodeCode Available | 1 | 5 |
| A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration | Oct 3, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 | 5 |
| Matching Patients to Clinical Trials with Large Language Models | Jul 27, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions | Mar 20, 2025 | 2D Object DetectionDistributed Computing | CodeCode Available | 1 | 5 |
| DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing Constraints | May 29, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Co-Learning: Code Learning for Multi-Agent Reinforcement Collaborative Framework with Conversational Natural Language Interfaces | Sep 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Dynamic Updates for Language Adaptation in Visual-Language Tracking | Mar 9, 2025 | Large Language Model | CodeCode Available | 1 | 5 |
| Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning | Oct 10, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems | May 23, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Collaborative Large Language Model for Recommender Systems | Nov 2, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| Collaborative Retrieval for Large Language Model-based Conversational Recommender Systems | Feb 19, 2025 | Collaborative FilteringConversational Recommendation | CodeCode Available | 1 | 5 |
| Distillation Matters: Empowering Sequential Recommenders to Match the Performance of Large Language Model | May 1, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| Dissecting Human and LLM Preferences | Feb 17, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections | Nov 17, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language Models | Aug 19, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| ClusterLLM: Large Language Models as a Guide for Text Clustering | May 24, 2023 | ClusteringLanguage Modelling | CodeCode Available | 1 | 5 |
| Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward Passes | Oct 22, 2024 | GSM8KLanguage Modeling | CodeCode Available | 1 | 5 |
| CoLLM: A Large Language Model for Composed Image Retrieval | Mar 25, 2025 | Image RetrievalLanguage Modeling | CodeCode Available | 1 | 5 |
| CoLLMLight: Cooperative Large Language Model Agents for Network-Wide Traffic Signal Control | Mar 14, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 | 5 |
| Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences | Jan 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CloudEval-YAML: A Practical Benchmark for Cloud Configuration Generation | Nov 10, 2023 | BenchmarkingCloud Computing | CodeCode Available | 1 | 5 |
| Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resources | Sep 18, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration | Nov 14, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 1 | 5 |
| DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model | Mar 31, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models | Feb 2, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQL | Jun 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| C-LLM: Learn to Check Chinese Spelling Errors Character by Character | Jun 24, 2024 | Chinese Spell CheckingLanguage Modeling | CodeCode Available | 1 | 5 |
| M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis | Feb 17, 2025 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 | 5 |
| Common Sense Enhanced Knowledge-based Recommendation with Large Language Model | Mar 27, 2024 | Common Sense ReasoningKnowledge Graphs | CodeCode Available | 1 | 5 |
| Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting | Oct 1, 2024 | Continual LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| Empowering Many, Biasing a Few: Generalist Credit Scoring through Large Language Models | Oct 1, 2023 | Decision MakingLanguage Modelling | CodeCode Available | 1 | 5 |
| Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach | May 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Can Large Language Models Understand Molecules? | Jan 5, 2024 | Drug DiscoveryLanguage Modelling | CodeCode Available | 1 | 5 |
| M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation | May 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery | Jan 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DesCo: Learning Object Recognition with Rich Language Descriptions | Jun 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LUMA: A Benchmark Dataset for Learning from Uncertain and Multimodal Data | Jun 14, 2024 | BenchmarkingDecision Making | CodeCode Available | 1 | 5 |