| VMAS: A Vectorized Multi-Agent Simulator for Collective Robot Learning | Jul 7, 2022 | BenchmarkingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Benefits and Challenges of Dynamic Modelling of Cascading Failures in Power Systems | Jul 7, 2022 | Benchmarking | —Unverified | 0 |
| Understanding Performance of Long-Document Ranking Models through Comprehensive Evaluation and Leaderboarding | Jul 4, 2022 | BenchmarkingDocument Ranking | CodeCode Available | 2 |
| Identifying the Context Shift between Test Benchmarks and Production Data | Jul 3, 2022 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk | Jul 2, 2022 | BenchmarkingMachine Translation | CodeCode Available | 1 |
| Less Is More: A Comparison of Active Learning Strategies for 3D Medical Image Segmentation | Jul 2, 2022 | Active LearningBenchmarking | CodeCode Available | 1 |
| HATE-ITA: New Baselines for Hate Speech Detection in Italian | Jul 1, 2022 | BenchmarkingHate Speech Detection | CodeCode Available | 0 |
| SentSpace: Large-Scale Benchmarking and Evaluation of Text using Cognitively Motivated Lexical, Syntactic, and Semantic Features | Jul 1, 2022 | BenchmarkingSentence | —Unverified | 0 |
| Towards Toxic Positivity Detection | Jul 1, 2022 | BenchmarkingClassification | —Unverified | 0 |
| Benchmarking Intersectional Biases in NLP | Jul 1, 2022 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 0 |
| Beyond Emotion: A Multi-Modal Dataset for Human Desire Understanding | Jul 1, 2022 | Benchmarking | —Unverified | 0 |
| DACSA: A large-scale Dataset for Automatic summarization of Catalan and Spanish newspaper Articles | Jul 1, 2022 | Abstractive Text SummarizationArticles | —Unverified | 0 |
| Dyna-bAbI: unlocking bAbI’s potential with dynamic synthetic benchmarking | Jul 1, 2022 | BenchmarkingNatural Language Understanding | —Unverified | 0 |
| Benchmarking Language-agnostic Intent Classification for Virtual Assistant Platforms | Jul 1, 2022 | BenchmarkingClassification | CodeCode Available | 0 |
| Local manifold learning and its link to domain-based physics knowledge | Jul 1, 2022 | BenchmarkingDimensionality Reduction | CodeCode Available | 0 |
| Analyzing the behaviour of D'WAVE quantum annealer: fine-tuning parameterization and tests with restrictive Hamiltonian formulations | Jul 1, 2022 | BenchmarkingCombinatorial Optimization | —Unverified | 0 |
| DFGC 2022: The Second DeepFake Game Competition | Jun 30, 2022 | BenchmarkingFace Swapping | CodeCode Available | 1 |
| Benchmarking the Robustness of Deep Neural Networks to Common Corruptions in Digital Pathology | Jun 30, 2022 | BenchmarkingDiagnostic | CodeCode Available | 1 |
| Computer-aided diagnosis and prediction in brain disorders | Jun 29, 2022 | BenchmarkingDecision Making | —Unverified | 0 |
| An extensible Benchmarking Graph-Mesh dataset for studying Steady-State Incompressible Navier-Stokes Equations | Jun 29, 2022 | Benchmarking | CodeCode Available | 0 |
| Beyond neural scaling laws: beating power law scaling via data pruning | Jun 29, 2022 | Benchmarking | CodeCode Available | 1 |
| Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of the Video Frames | Jun 29, 2022 | BenchmarkingDiversity | CodeCode Available | 1 |
| Toward an ImageNet Library of Functions for Global Optimization Benchmarking | Jun 27, 2022 | Benchmarkingglobal-optimization | —Unverified | 0 |
| Benchopt: Reproducible, efficient and collaborative optimization benchmarks | Jun 27, 2022 | Benchmarkingimage-classification | CodeCode Available | 4 |
| The DEBS 2022 Grand Challenge: Detecting Trading Trends in Financial Tick Data | Jun 23, 2022 | Benchmarking | CodeCode Available | 1 |