| SATBench: Benchmarking the speed-accuracy tradeoff in object recognition by humans and dynamic neural networks | Jun 16, 2022 | BenchmarkingDynamic neural networks | CodeCode Available | 0 |
| Benchmarking Heterogeneous Treatment Effect Models through the Lens of Interpretability | Jun 16, 2022 | BenchmarkingFeature Importance | —Unverified | 0 |
| Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models | Jun 16, 2022 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| BEHAVIOR in Habitat 2.0: Simulator-Independent Logical Task Description for Benchmarking Embodied AI Agents | Jun 13, 2022 | Benchmarking | —Unverified | 0 |
| EmProx: Neural Network Performance Estimation For Neural Architecture Search | Jun 13, 2022 | BenchmarkingDecoder | CodeCode Available | 0 |
| CodeS: Towards Code Model Generalization Under Distribution Shift | Jun 11, 2022 | BenchmarkingCode Classification | CodeCode Available | 0 |
| SAIBench: Benchmarking AI for Science | Jun 11, 2022 | BenchmarkingFriction | —Unverified | 0 |
| Functional Code Building Genetic Programming | Jun 9, 2022 | BenchmarkingProgram Synthesis | —Unverified | 0 |
| FedHPO-B: A Benchmark Suite for Federated Hyperparameter Optimization | Jun 8, 2022 | BenchmarkingFederated Learning | —Unverified | 0 |
| Benchmarking Bayesian neural networks and evaluation metrics for regression tasks | Jun 8, 2022 | BenchmarkingOpen-Ended Question Answering | —Unverified | 0 |