| Benchmarking state-of-the-art gradient boosting algorithms for classification | May 26, 2023 | Bayesian OptimizationBenchmarking | —Unverified | 0 |
| CSS: A Large-scale Cross-schema Chinese Text-to-SQL Medical Dataset | May 25, 2023 | BenchmarkingText to SQL | CodeCode Available | 0 |
| Investigation of UAV Detection in Images with Complex Backgrounds and Rainy Artifacts | May 25, 2023 | Benchmarkingobject-detection | CodeCode Available | 0 |
| Analysis of modular CMA-ES on strict box-constrained problems in the SBOX-COST benchmarking suite | May 24, 2023 | Benchmarking | —Unverified | 0 |
| GPT4Graph: Can Large Language Models Understand Graph Structured Data ? An Empirical Evaluation and Benchmarking | May 24, 2023 | BenchmarkingGraph Mining | CodeCode Available | 0 |
| BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer | May 24, 2023 | BenchmarkingCross-Lingual Transfer | —Unverified | 0 |
| LAraBench: Benchmarking Arabic AI with Large Language Models | May 24, 2023 | BenchmarkingFew-Shot Learning | —Unverified | 0 |
| Barkour: Benchmarking Animal-level Agility with Quadruped Robots | May 24, 2023 | BenchmarkingNavigate | —Unverified | 0 |
| R2H: Building Multimodal Navigation Helpers that Respond to Help Requests | May 23, 2023 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| When the Music Stops: Tip-of-the-Tongue Retrieval for Music | May 23, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |