| AgentSims: An Open-Source Sandbox for Large Language Model Evaluation | Aug 8, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 2 | 5 |
| Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions | Sep 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 | 5 |
| Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement | May 13, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 2 | 5 |
| Language Models Can Improve Event Prediction by Few-Shot Abductive Reasoning | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| biorecap: an R package for summarizing bioRxiv preprints with a local LLM | Aug 21, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 | 5 |
| Language Models can Solve Computer Tasks | Mar 30, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 | 5 |
| AgentReview: Exploring Peer Review Dynamics with LLM Agents | Jun 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction | Mar 12, 2024 | Code GenerationLanguage Modelling | CodeCode Available | 2 | 5 |
| Control Industrial Automation System with Large Language Model Agents | Sep 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model Application | May 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |