| AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZ | Sep 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model | Aug 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Inverse Constitutional AI: Compressing Preferences into Principles | Jun 2, 2024 | ChatbotLanguage Modelling | CodeCode Available | 1 | 5 |
| LLMDet: A Third Party Large Language Models Generated Text Detection Tool | May 24, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| DOMINO: A Dual-System for Multi-step Visual Language Reasoning | Oct 4, 2023 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 | 5 |
| InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains | Jul 16, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI Agents | Nov 6, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| The Machine Psychology of Cooperation: Can GPT models operationalise prompts for altruism, cooperation, competitiveness and selfishness in economic games? | May 13, 2023 | Experimental DesignLanguage Modelling | CodeCode Available | 1 | 5 |
| LLMBind: A Unified Modality-Task Integration Framework | Feb 22, 2024 | AI AgentAudio Generation | CodeCode Available | 1 | 5 |
| ChatCFD: an End-to-End CFD Agent with Domain-specific Structured Thinking | May 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| ChatCounselor: A Large Language Models for Mental Health Support | Sep 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment | Oct 28, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 1 | 5 |
| ChatEDA: A Large Language Model Powered Autonomous Agent for EDA | Aug 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Automatic Model Selection with Large Language Models for Reasoning | May 23, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 | 5 |
| Do Large Language Model Benchmarks Test Reliability? | Feb 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tools and Self-Explanations | Jan 23, 2024 | counterfactualFact Checking | CodeCode Available | 1 | 5 |
| LLM experiments with simulation: Large Language Model Multi-Agent System for Simulation Model Parametrization in Digital Twins | May 28, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models | Feb 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs | Jul 1, 2025 | Large Language Model | CodeCode Available | 1 | 5 |
| DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling | Mar 2, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Automatic Evaluation of Attribution by Large Language Models | May 10, 2023 | Fact CheckingLanguage Modeling | CodeCode Available | 1 | 5 |
| Prompting as Probing: Using Language Models for Knowledge Base Construction | Aug 23, 2022 | Knowledge Base ConstructionLanguage Modeling | CodeCode Available | 1 | 5 |
| DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines | Nov 17, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning | Oct 10, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial Relations | Dec 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |