| XMainframe: A Large Language Model for Mainframe Modernization | Aug 5, 2024 | Code SummarizationLanguage Modeling | CodeCode Available | 2 |
| DeliLaw: A Chinese Legal Counselling System Based on a Large Language Model | Aug 1, 2024 | ArticlesHallucination | CodeCode Available | 2 |
| T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation | Jul 19, 2024 | AttributeLanguage Modeling | CodeCode Available | 2 |
| RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering | Jul 19, 2024 | Domain GeneralizationForm | CodeCode Available | 2 |
| Spectra: Surprising Effectiveness of Pretraining Ternary Language Models at Scale | Jul 17, 2024 | GPULAMBADA | CodeCode Available | 2 |
| UrbanWorld: An Urban World Model for 3D City Generation | Jul 16, 2024 | Decision MakingLanguage Modelling | CodeCode Available | 2 |
| DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading Systems | Jul 15, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Think-on-Graph 2.0: Deep and Faithful Large Language Model Reasoning with Knowledge-guided Retrieval Augmented Generation | Jul 15, 2024 | Information RetrievalKnowledge Graphs | CodeCode Available | 2 |
| FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation | Jul 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvement | Jul 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |