| DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuning | Oct 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain | Oct 22, 2023 | Dialogue GenerationDialogue Understanding | CodeCode Available | 2 |
| Making Large Language Models Perform Better in Knowledge Graph Completion | Oct 10, 2023 | In-Context LearningKnowledge Graph Completion | CodeCode Available | 2 |
| OptiMUS: Optimization Modeling Using MIP Solvers and large language models | Oct 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction | Oct 5, 2023 | Event Argument ExtractionEvent Extraction | CodeCode Available | 2 |
| InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists | Sep 30, 2023 | Depth EstimationImage Generation | CodeCode Available | 2 |
| Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training | Sep 29, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| RLLTE: Long-Term Evolution Project of Reinforcement Learning | Sep 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| AnglE-optimized Text Embeddings | Sep 22, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent | Sep 21, 2023 | 3D visual groundingLanguage Modeling | CodeCode Available | 2 |
| StructChart: On the Schema, Metric, and Augmentation for Visual Chart Understanding | Sep 20, 2023 | Chart Question AnsweringChart Understanding | CodeCode Available | 2 |
| DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal Services | Sep 20, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| OWL: A Large Language Model for IT Operations | Sep 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding | Sep 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Unified Human-Scene Interaction via Prompted Chain-of-Contacts | Sep 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization | Sep 9, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Automated Bioinformatics Analysis via AutoBA | Sep 6, 2023 | AI AgentLanguage Modeling | CodeCode Available | 2 |
| Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following | Sep 1, 2023 | 3D Generation3D Question Answering (3D-QA) | CodeCode Available | 2 |
| OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models | Aug 25, 2023 | Common Sense ReasoningComputational Efficiency | CodeCode Available | 2 |
| Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning | Aug 22, 2023 | Caption GenerationLarge Language Model | CodeCode Available | 2 |
| SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding | Aug 21, 2023 | Entity TypingEvent Extraction | CodeCode Available | 2 |
| Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes | Aug 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models | Aug 17, 2023 | Decision MakingHallucination | CodeCode Available | 2 |
| EcomGPT: Instruction-tuning Large Language Models with Chain-of-Task Tasks for E-commerce | Aug 14, 2023 | DiversityInstruction Following | CodeCode Available | 2 |
| AgentSims: An Open-Source Sandbox for Large Language Model Evaluation | Aug 8, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 2 |
| SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI Tool | Aug 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-world Multi-turn Dialogue | Aug 7, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| LP-MusicCaps: LLM-Based Pseudo Music Captioning | Jul 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormer | Jul 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Planting a SEED of Vision in Large Language Model | Jul 16, 2023 | Image GenerationImage to text | CodeCode Available | 2 |
| Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph | Jul 15, 2023 | HallucinationKnowledge Graphs | CodeCode Available | 2 |
| Drive Like a Human: Rethinking Autonomous Driving with Large Language Models | Jul 14, 2023 | Autonomous DrivingCommon Sense Reasoning | CodeCode Available | 2 |
| GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest | Jul 7, 2023 | AttributeCommon Sense Reasoning | CodeCode Available | 2 |
| HyenaDNA: Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution | Jun 27, 2023 | 4kIn-Context Learning | CodeCode Available | 2 |
| MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models | Jun 23, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| CMMLU: Measuring massive multitask language understanding in Chinese | Jun 15, 2023 | Large Language Model | CodeCode Available | 2 |
| XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models | Jun 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Valley: Video Assistant with Large Language model Enhanced abilitY | Jun 12, 2023 | Action RecognitionInstruction Following | CodeCode Available | 2 |
| PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization | Jun 8, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for Finance | Jun 8, 2023 | Conversational Question AnsweringLanguage Modeling | CodeCode Available | 2 |
| RETA-LLM: A Retrieval-Augmented Large Language Model Toolkit | Jun 8, 2023 | Answer GenerationFact Checking | CodeCode Available | 2 |
| Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks | Jun 7, 2023 | Cross-Modal RetrievalLanguage Modelling | CodeCode Available | 2 |
| SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression | Jun 5, 2023 | GPULanguage Modelling | CodeCode Available | 2 |
| User Behavior Simulation with Large Language Model based Agents | Jun 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction | May 30, 2023 | Image GenerationInstruction Following | CodeCode Available | 2 |
| VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset | May 29, 2023 | Audio captioningAudio-Visual Captioning | CodeCode Available | 2 |
| KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model Application | May 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Language Models Can Improve Event Prediction by Few-Shot Abductive Reasoning | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ExpertPrompting: Instructing Large Language Models to be Distinguished Experts | May 24, 2023 | In-Context LearningInstruction Following | CodeCode Available | 2 |
| LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models | May 23, 2023 | Common Sense ReasoningImage Generation | CodeCode Available | 2 |