| ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI Agents | Nov 6, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model | Aug 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Harnessing Multimodal Large Language Models for Multimodal Sequential Recommendation | Aug 19, 2024 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics | Oct 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Automatic Model Selection with Large Language Models for Reasoning | May 23, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 |
| PRompt Optimization in Multi-Step Tasks (PROMST): Integrating Human Feedback and Heuristic-based Sampling | Feb 13, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Hallucinations in Large Multilingual Translation Models | Mar 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Have You Merged My Model? On The Robustness of Large Language Model IP Protection Methods Against Model Merging | Apr 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model | Aug 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ChatCFD: an End-to-End CFD Agent with Domain-specific Structured Thinking | May 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos | Aug 26, 2024 | FormLanguage Modelling | CodeCode Available | 1 |
| Grounding Language Models for Visual Entity Recognition | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ChatEDA: A Large Language Model Powered Autonomous Agent for EDA | Aug 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Automatic Evaluation of Attribution by Large Language Models | May 10, 2023 | Fact CheckingLanguage Modeling | CodeCode Available | 1 |
| Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients | Jun 25, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning | Jul 30, 2024 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable Recommendation | Feb 18, 2025 | Collaborative FilteringExplainable Recommendation | CodeCode Available | 1 |
| Hallucination Augmented Contrastive Learning for Multimodal Large Language Model | Dec 12, 2023 | Contrastive LearningHallucination | CodeCode Available | 1 |
| IDEA-Bench: How Far are Generative Models from Professional Designing? | Dec 16, 2024 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| CityBench: Evaluating the Capabilities of Large Language Models for Urban Tasks | Jun 20, 2024 | General KnowledgeHuman Dynamics | CodeCode Available | 1 |
| AllSpark: A Multimodal Spatio-Temporal General Intelligence Model with Ten Modalities via Language as a Reference Framework | Dec 31, 2023 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| RadioLLM: Introducing Large Language Model into Cognitive Radio via Hybrid Prompt and Token Reprogrammings | Jan 28, 2025 | DenoisingDomain Generalization | CodeCode Available | 1 |
| GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Adaptive KalmanNet: Data-Driven Kalman Filter with Fast Adaptation | Sep 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Enabling LLM Knowledge Analysis via Extensive Materialization | Nov 7, 2024 | Knowledge Base ConstructionLarge Language Model | CodeCode Available | 1 |
| CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing | Feb 4, 2025 | Collaborative InferenceLanguage Modeling | CodeCode Available | 1 |
| Glinthawk: A Two-Tiered Architecture for Offline LLM Inference | Jan 20, 2025 | CPULanguage Modeling | CodeCode Available | 1 |
| Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection | Jul 12, 2024 | Collaborative InferenceLanguage Modelling | CodeCode Available | 1 |
| Automated Spinal MRI Labelling from Reports Using a Large Language Model | Oct 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text | Aug 14, 2023 | Drug DiscoveryImage Captioning | CodeCode Available | 1 |
| GeoSAM: Fine-tuning SAM with Multi-Modal Prompts for Mobility Infrastructure Segmentation | Nov 19, 2023 | Image SegmentationLarge Language Model | CodeCode Available | 1 |
| GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution | May 27, 2025 | 8kAvg | CodeCode Available | 1 |
| Citekit: A Modular Toolkit for Large Language Model Citation Generation | Aug 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Benchmark for Generalizing Across Diverse Team Strategies in Competitive Pokémon | Jun 12, 2025 | Large Language ModelStarcraft | CodeCode Available | 1 |
| CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory | May 8, 2025 | Large Language ModelNavigate | CodeCode Available | 1 |
| GraphLLM: Boosting Graph Reasoning Ability of Large Language Model | Oct 9, 2023 | Graph LearningLanguage Modeling | CodeCode Available | 1 |
| Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Generator-Retriever-Generator Approach for Open-Domain Question Answering | Jul 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Generative News Recommendation | Mar 6, 2024 | ArticlesLanguage Modelling | CodeCode Available | 1 |
| Generation Meets Verification: Accelerating Large Language Model Inference with Smart Parallel Auto-Correct Decoding | Feb 19, 2024 | HumanEvalLanguage Modeling | CodeCode Available | 1 |
| Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators | Mar 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Generation of Asset Administration Shell with Large Language Model Agents: Toward Semantic Interoperability in Digital Twins in the Context of Industry 4.0 | Mar 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Adapting Vision-Language Foundation Model for Next Generation Medical Ultrasound Image Analysis | Jun 10, 2025 | Domain AdaptationLarge Language Model | CodeCode Available | 1 |
| CIPHER: Cybersecurity Intelligent Penetration-testing Helper for Ethical Researcher | Aug 21, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| GIST: Generating Image-Specific Text for Fine-grained Object Classification | Jul 21, 2023 | ClassificationFine-Grained Image Classification | CodeCode Available | 1 |
| Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search | May 24, 2024 | Code GenerationLanguage Modelling | CodeCode Available | 1 |
| GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes | May 25, 2023 | Computed Tomography (CT)Image Generation | CodeCode Available | 1 |
| Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction | Feb 29, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner | Dec 24, 2024 | Autonomous DrivingDataset Generation | CodeCode Available | 1 |
| Aligning LLM Agents by Learning Latent Preference from User Edits | Apr 23, 2024 | DescriptiveLanguage Modelling | CodeCode Available | 1 |