| A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration | Oct 3, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 | 5 |
| AttributionBench: How Hard is Automatic Attribution Evaluation? | Feb 23, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time | Jul 1, 2024 | AUDIO-VISUAL QUESTION ANSWERING (MUSIC-AVQA-v2.0)Fact Checking | CodeCode Available | 1 | 5 |
| Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit | Aug 19, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| RARR: Researching and Revising What Language Models Say, Using Language Models | Oct 17, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge | Sep 11, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| DualAD: Dual-Layer Planning for Reasoning in Autonomous Driving | Sep 26, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 1 | 5 |
| A Large Language Model Enhanced Sequential Recommender for Joint Video and Comment Recommendation | Mar 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DStruct2Design: Data and Benchmarks for Data Structure Driven Generative Floor Plan Design | Jul 22, 2024 | Image GenerationLanguage Modelling | CodeCode Available | 1 | 5 |
| MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis | Jun 23, 2025 | DiagnosticLarge Language Model | CodeCode Available | 1 | 5 |
| Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity | Apr 22, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Measuring General Intelligence with Generated Games | May 12, 2025 | In-Context LearningLarge Language Model | CodeCode Available | 1 | 5 |
| DrugAssist: A Large Language Model for Molecule Optimization | Dec 28, 2023 | Drug DiscoveryLanguage Modeling | CodeCode Available | 1 | 5 |
| Meaning Typed Prompting: A Technique for Efficient, Reliable Structured Output Generation | Oct 22, 2024 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 | 5 |
| MechAgents: Large language model multi-agent collaborations can solve mechanics problems, generate new data, and integrate knowledge | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing | Jan 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DRG-LLaMA : Tuning LLaMA Model to Predict Diagnosis-related Group for Hospitalized Patients | Sep 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| MedFILIP: Medical Fine-grained Language-Image Pre-training | Jan 18, 2025 | Contrastive LearningDiagnostic | CodeCode Available | 1 | 5 |
| MELLM: Exploring LLM-Powered Micro-Expression Understanding Enhanced by Subtle Motion Perception | May 11, 2025 | Emotion ClassificationLarge Language Model | CodeCode Available | 1 | 5 |
| DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer | Nov 27, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| DOMINO: A Dual-System for Multi-step Visual Language Reasoning | Oct 4, 2023 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 | 5 |
| Matching Patients to Clinical Trials with Large Language Models | Jul 27, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Do Large Language Model Benchmarks Test Reliability? | Feb 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For Driving | Jun 21, 2025 | Autonomous DrivingDescriptive | CodeCode Available | 1 | 5 |
| Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking | May 13, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Making Language Models Better Tool Learners with Execution Feedback | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems | May 23, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration | Nov 14, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 1 | 5 |
| MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models | Feb 2, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQL | Jun 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling | Mar 2, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| A Survey on Self-Supervised Graph Foundation Models: Knowledge-Based Perspective | Mar 24, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| An In-Context Learning Agent for Formal Theorem-Proving | Oct 6, 2023 | Automated Theorem ProvingIn-Context Learning | CodeCode Available | 1 | 5 |
| Dynamic Updates for Language Adaptation in Visual-Language Tracking | Mar 9, 2025 | Large Language Model | CodeCode Available | 1 | 5 |
| Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward Passes | Oct 22, 2024 | GSM8KLanguage Modeling | CodeCode Available | 1 | 5 |
| Lshan-1.0 Technical Report | Mar 10, 2025 | Large Language Model | CodeCode Available | 1 | 5 |
| Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections | Nov 17, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| LUMA: A Benchmark Dataset for Learning from Uncertain and Multimodal Data | Jun 14, 2024 | BenchmarkingDecision Making | CodeCode Available | 1 | 5 |
| Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding | Apr 10, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Dissecting Human and LLM Preferences | Feb 17, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions | Mar 20, 2025 | 2D Object DetectionDistributed Computing | CodeCode Available | 1 | 5 |
| LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text | Mar 25, 2025 | Cross-Modal RetrievalHallucination | CodeCode Available | 1 | 5 |
| LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT | Jun 29, 2023 | Automatic Lyrics TranscriptionLanguage Modeling | CodeCode Available | 1 | 5 |
| DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model | Mar 31, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition | Mar 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Loop Copilot: Conducting AI Ensembles for Music Generation and Iterative Editing | Oct 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene Descriptions | Jun 9, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| Distillation Matters: Empowering Sequential Recommenders to Match the Performance of Large Language Model | May 1, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| CoditT5: Pretraining for Source Code and Natural Language Editing | Aug 10, 2022 | Bug fixingLanguage Modeling | CodeCode Available | 1 | 5 |
| Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resources | Sep 18, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |