| Arena 4.0: A Comprehensive ROS2 Development and Benchmarking Platform for Human-centric Navigation Using Generative-Model-based Environment Generation | Sep 19, 2024 | BenchmarkingSocial Navigation | —Unverified | 0 |
| MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines | Sep 19, 2024 | Benchmarking | —Unverified | 0 |
| Efficient Performance Tracking: Leveraging Large Language Models for Automated Construction of Scientific Leaderboards | Sep 19, 2024 | Benchmarking | CodeCode Available | 0 |
| ASR Benchmarking: Need for a More Representative Conversational Dataset | Sep 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Efficacy of Synthetic Data as a Benchmark | Sep 18, 2024 | BenchmarkingFew-Shot Learning | —Unverified | 0 |
| Hard-Label Cryptanalytic Extraction of Neural Network Models | Sep 18, 2024 | Benchmarking | CodeCode Available | 0 |
| PARAPHRASUS : A Comprehensive Benchmark for Evaluating Paraphrase Detection Models | Sep 18, 2024 | BenchmarkingModel Selection | CodeCode Available | 0 |
| Improve Machine Learning carbon footprint using Parquet dataset format and Mixed Precision training for regression models -- Part II | Sep 17, 2024 | BenchmarkingDescriptive | CodeCode Available | 0 |
| WER We Stand: Benchmarking Urdu ASR Models | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event Detection | Sep 17, 2024 | BenchmarkingEvent Detection | CodeCode Available | 0 |