| UDTIRI: An Online Open-Source Intelligent Road Inspection Benchmark Suite | Apr 18, 2023 | BenchmarkingInstance Segmentation | —Unverified | 0 |
| OOD-CV-v2: An extended Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images | Apr 17, 2023 | 3D Pose EstimationBenchmarking | —Unverified | 0 |
| Towards Computational Performance Engineering for Unsupervised Concept Drift Detection -- Complexities, Benchmarking, Performance Analysis | Apr 17, 2023 | BenchmarkingDrift Detection | CodeCode Available | 0 |
| Dialogue Games for Benchmarking Language Understanding: Motivation, Taxonomy, Strategy | Apr 14, 2023 | Benchmarking | —Unverified | 0 |
| Improving Items and Contexts Understanding with Descriptive Graph for Conversational Recommendation | Apr 11, 2023 | BenchmarkingConversational Recommendation | —Unverified | 0 |
| Benchmarking the Physical-world Adversarial Robustness of Vehicle Detection | Apr 11, 2023 | Adversarial AttackAdversarial Robustness | —Unverified | 0 |
| OpenAGI: When LLM Meets Domain Experts | Apr 10, 2023 | BenchmarkingNatural Language Queries | CodeCode Available | 4 |
| NeuroBench: A Framework for Benchmarking Neuromorphic Computing Algorithms and Systems | Apr 10, 2023 | Benchmarking | CodeCode Available | 1 |
| Certifiable Black-Box Attacks with Randomized Adversarial Examples: Breaking Defenses with Provable Confidence | Apr 10, 2023 | Benchmarkingspeech-recognition | CodeCode Available | 0 |
| On Evaluation of Bangla Word Analogies | Apr 10, 2023 | BenchmarkingWord Embeddings | —Unverified | 0 |
| ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit | Apr 10, 2023 | BenchmarkingSimultaneous Speech-to-Text Translation | —Unverified | 0 |
| RoboPianist: Dexterous Piano Playing with Deep Reinforcement Learning | Apr 9, 2023 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 2 |
| ForamViT-GAN: Exploring New Paradigms in Deep Learning for Micropaleontological Image Analysis | Apr 9, 2023 | BenchmarkingDeep Learning | —Unverified | 0 |
| Benchmarking the Robustness of Quantized Models | Apr 8, 2023 | BenchmarkingQuantization | —Unverified | 0 |
| SimbaML: Connecting Mechanistic Models and Machine Learning with Augmented Data | Apr 8, 2023 | BenchmarkingData Augmentation | CodeCode Available | 0 |
| Probing Conceptual Understanding of Large Visual-Language Models | Apr 7, 2023 | Benchmarking | CodeCode Available | 0 |
| Interpretable statistical representations of neural population dynamics and geometry | Apr 6, 2023 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Benchmarking Robustness to Text-Guided Corruptions | Apr 6, 2023 | BenchmarkingData Augmentation | CodeCode Available | 0 |
| DRAC: Diabetic Retinopathy Analysis Challenge with Ultra-Wide Optical Coherence Tomography Angiography Images | Apr 5, 2023 | BenchmarkingData Augmentation | —Unverified | 0 |
| MMVC: Learned Multi-Mode Video Compression with Block-based Prediction Mode Selection and Density-Adaptive Entropy Coding | Apr 5, 2023 | BenchmarkingMS-SSIM | CodeCode Available | 1 |
| LogoNet: a fine-grained network for instance-level logo sketch retrieval | Apr 5, 2023 | 2kBenchmarking | CodeCode Available | 0 |
| IHCV: Discovery of Hidden Time-Dependent Control Variables in Non-Linear Dynamical Systems | Apr 5, 2023 | Benchmarking | CodeCode Available | 0 |
| The Saudi Privacy Policy Dataset | Apr 5, 2023 | Benchmarking | CodeCode Available | 0 |
| OpenContrails: Benchmarking Contrail Detection on GOES-16 ABI | Apr 4, 2023 | Benchmarking | —Unverified | 0 |
| SLPerf: a Unified Framework for Benchmarking Split Learning | Apr 4, 2023 | BenchmarkingDiversity | CodeCode Available | 1 |