| CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory | May 8, 2025 | Large Language ModelNavigate | CodeCode Available | 1 | 5 |
| Glinthawk: A Two-Tiered Architecture for Offline LLM Inference | Jan 20, 2025 | CPULanguage Modeling | CodeCode Available | 1 | 5 |
| Multi-label Sequential Sentence Classification via Large Language Model | Nov 23, 2024 | Contrastive LearningExtractive Summarization | CodeCode Available | 1 | 5 |
| DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling | Mar 2, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection | Jul 12, 2024 | Collaborative InferenceLanguage Modelling | CodeCode Available | 1 | 5 |
| Ranked List Truncation for Large Language Model-based Re-Ranking | Apr 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For Driving | Jun 21, 2025 | Autonomous DrivingDescriptive | CodeCode Available | 1 | 5 |
| AstroAgents: A Multi-Agent AI for Hypothesis Generation from Mass Spectrometry Data | Mar 29, 2025 | Large Language Model | CodeCode Available | 1 | 5 |
| CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Feb 20, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 | 5 |
| CityBench: Evaluating the Capabilities of Large Language Models for Urban Tasks | Jun 20, 2024 | General KnowledgeHuman Dynamics | CodeCode Available | 1 | 5 |
| Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning | Oct 10, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models | Aug 30, 2024 | Image CaptioningLanguage Modeling | CodeCode Available | 1 | 5 |
| NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention | Mar 2, 2024 | 16kCPU | CodeCode Available | 1 | 5 |
| On Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot Reasoning | Dec 15, 2022 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 | 5 |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CoS: Enhancing Personalization and Mitigating Bias with Context Steering | May 2, 2024 | Bayesian InferenceLanguage Modelling | CodeCode Available | 1 | 5 |
| CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing | Feb 4, 2025 | Collaborative InferenceLanguage Modeling | CodeCode Available | 1 | 5 |
| Citekit: A Modular Toolkit for Large Language Model Citation Generation | Aug 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections | Nov 17, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Explaining Relationships Between Scientific Documents | Feb 2, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CIPHER: Cybersecurity Intelligent Penetration-testing Helper for Ethical Researcher | Aug 21, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Distillation Matters: Empowering Sequential Recommenders to Match the Performance of Large Language Model | May 1, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients | Jun 25, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Dissecting Human and LLM Preferences | Feb 17, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization | May 22, 2025 | Combinatorial OptimizationLanguage Modeling | CodeCode Available | 1 | 5 |
| Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models | May 5, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 | 5 |
| ASSISTGUI: Task-Oriented Desktop Graphical User Interface Automation | Dec 20, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property | Feb 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Motif: Intrinsic Motivation from Artificial Intelligence Feedback | Sep 29, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| Working Memory Capacity of ChatGPT: An Empirical Study | Apr 30, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 1 | 5 |
| DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model | Mar 31, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions | Mar 20, 2025 | 2D Object DetectionDistributed Computing | CodeCode Available | 1 | 5 |
| REFLECT: Summarizing Robot Experiences for Failure Explanation and Correction | Jun 27, 2023 | Common Sense ReasoningLarge Language Model | CodeCode Available | 1 | 5 |
| Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resources | Sep 18, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| MoqaGPT : Zero-Shot Multi-modal Open-domain Question Answering with Large Language Model | Oct 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CoVR-2: Automatic Data Construction for Composed Video Retrieval | Aug 28, 2023 | Composed Image Retrieval (CoIR)Composed Video Retrieval (CoVR) | CodeCode Available | 1 | 5 |
| Hallucinations in Large Multilingual Translation Models | Mar 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPLLM: Clinical Prediction with Large Language Models | Sep 20, 2023 | Disease PredictionLanguage Modeling | CodeCode Available | 1 | 5 |
| AllSpark: A Multimodal Spatio-Temporal General Intelligence Model with Ten Modalities via Language as a Reference Framework | Dec 31, 2023 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 | 5 |
| HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics | Oct 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human Preferences | Nov 10, 2023 | Dialogue GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning | Aug 21, 2024 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| ChemMLLM: Chemical Multimodal Large Language Model | May 22, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| ChemLLM: A Chemical Large Language Model | Feb 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning | Mar 2, 2025 | Large Language ModelMulti-Instance Retrieval | CodeCode Available | 1 | 5 |
| MolecularGPT: Open Large Language Model (LLM) for Few-Shot Molecular Property Prediction | Jun 18, 2024 | Drug DiscoveryGraph Neural Network | CodeCode Available | 1 | 5 |
| CRAKEN: Cybersecurity LLM Agent with Knowledge-Based Execution | May 21, 2025 | Large Language ModelTask Planning | CodeCode Available | 1 | 5 |
| Automatic Evaluation of Attribution by Large Language Models | May 10, 2023 | Fact CheckingLanguage Modeling | CodeCode Available | 1 | 5 |
| DesCo: Learning Object Recognition with Rich Language Descriptions | Jun 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach | May 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |