| CLLMate: A Multimodal Benchmark for Weather and Climate Events Forecasting | Sep 27, 2024 | ArticlesBenchmarking | —Unverified | 0 |
| MCUBench: A Benchmark of Tiny Object Detectors on MCUs | Sep 27, 2024 | BenchmarkingModel Selection | —Unverified | 0 |
| Data Analysis in the Era of Generative AI | Sep 27, 2024 | Benchmarking | —Unverified | 0 |
| Constructing Confidence Intervals for 'the' Generalization Error -- a Comprehensive Benchmark Study | Sep 27, 2024 | Benchmarkingtabular-regression | CodeCode Available | 0 |
| ARLBench: Flexible and Efficient Benchmarking for Hyperparameter Optimization in Reinforcement Learning | Sep 27, 2024 | AutoMLBenchmarking | CodeCode Available | 1 |
| The Elephant in the Room: Towards A Reliable Time-Series Anomaly Detection Benchmark | Sep 26, 2024 | Anomaly DetectionBenchmarking | CodeCode Available | 3 |
| Conformal Prediction: A Theoretical Note and Benchmarking Transductive Node Classification in Graphs | Sep 26, 2024 | BenchmarkingConformal Prediction | CodeCode Available | 0 |
| MALPOLON: A Framework for Deep Species Distribution Modeling | Sep 26, 2024 | BenchmarkingGPU | CodeCode Available | 1 |
| Omnibenchmark (alpha) for continuous and open benchmarking in bioinformatics | Sep 25, 2024 | Benchmarking | —Unverified | 0 |
| Proof of Thought : Neurosymbolic Program Synthesis allows Robust and Interpretable Reasoning | Sep 25, 2024 | BenchmarkingFormal Logic | —Unverified | 0 |