| Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models | Apr 7, 2025 | Dialogue EvaluationFairness | CodeCode Available | 2 |
| Towards Trustworthy Retrieval Augmented Generation for Large Language Models: A Survey | Feb 8, 2025 | FairnessRAG | CodeCode Available | 2 |
| AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving | Dec 19, 2024 | Autonomous DrivingBenchmarking | CodeCode Available | 2 |
| A Comprehensive Guide to Explainable AI: From Classical Models to LLMs | Dec 1, 2024 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| Multi-Agent Large Language Models for Conversational Task-Solving | Oct 30, 2024 | FairnessQuestion Answering | CodeCode Available | 2 |
| On the State of NLP Approaches to Modeling Depression in Social Media: A Post-COVID-19 Outlook | Oct 11, 2024 | EthicsFairness | CodeCode Available | 2 |
| COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act | Oct 10, 2024 | BenchmarkingFairness | CodeCode Available | 2 |
| LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models | Sep 30, 2024 | Fairness | CodeCode Available | 2 |
| LibMOON: A Gradient-based MultiObjective OptimizatioN Library in PyTorch | Sep 4, 2024 | Evolutionary AlgorithmsFairness | CodeCode Available | 2 |
| Towards AI-Powered Video Assistant Referee System (VARS) for Association Football | Jul 17, 2024 | Fairness | CodeCode Available | 2 |