| Dynabench: Rethinking Benchmarking in NLP | Apr 7, 2021 | Benchmarking | —Unverified | 0 |
| Efficient and Accurate In-Database Machine Learning with SQL Code Generation in Python | Apr 7, 2021 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Robust Semantic Interpretability: Revisiting Concept Activation Vectors | Apr 6, 2021 | Benchmarkingcounterfactual | CodeCode Available | 1 |
| CBench: Towards Better Evaluation of Question Answering Over Knowledge Graphs | Apr 5, 2021 | BenchmarkingKnowledge Graphs | CodeCode Available | 1 |
| What Will it Take to Fix Benchmarking in Natural Language Understanding? | Apr 5, 2021 | BenchmarkingNatural Language Understanding | —Unverified | 0 |
| The Multi-speaker Multi-style Voice Cloning Challenge 2021 | Apr 5, 2021 | BenchmarkingVoice Cloning | —Unverified | 0 |
| Improving Pretrained Models for Zero-shot Multi-label Text Classification through Reinforced Label Hierarchy Reasoning | Apr 4, 2021 | BenchmarkingMulti Label Text Classification | CodeCode Available | 0 |
| An Empirical Evaluation of Cost-based Federated SPARQL Query Processing Engines | Apr 2, 2021 | Benchmarking | CodeCode Available | 0 |
| Benchmarking Transformer-based Language Models for Arabic Sentiment and Sarcasm Detection | Apr 1, 2021 | BenchmarkingSarcasm Detection | —Unverified | 0 |
| Benchmarking Pre-trained Language Models for Multilingual NER: TraSpaS at the BSNLP2021 Shared Task | Apr 1, 2021 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| Findings of the Shared Task on Offensive Language Identification in Tamil, Malayalam, and Kannada | Apr 1, 2021 | BenchmarkingLanguage Identification | —Unverified | 0 |
| Benchmarking a transformer-FREE model for ad-hoc retrieval | Apr 1, 2021 | BenchmarkingCPU | CodeCode Available | 0 |
| Remote Sensing Image Classification with the SEN12MS Dataset | Apr 1, 2021 | BenchmarkingClassification | CodeCode Available | 1 |
| Generalized Conflict-directed Search for Optimal Ordering Problems | Mar 31, 2021 | BenchmarkingScheduling | —Unverified | 0 |
| Simultaneous Navigation and Construction Benchmarking Environments | Mar 31, 2021 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 |
| Benchmarks for Deep Off-Policy Evaluation | Mar 30, 2021 | Benchmarkingcontinuous-control | CodeCode Available | 1 |
| Unsupervised Learning of 3D Object Categories from Videos in the Wild | Mar 30, 2021 | BenchmarkingMonocular Reconstruction | —Unverified | 0 |
| 3D AffordanceNet: A Benchmark for Visual Object Affordance Understanding | Mar 30, 2021 | Affordance DetectionBenchmarking | CodeCode Available | 1 |
| Benchmarking Representation Learning for Natural World Image Collections | Mar 30, 2021 | BenchmarkingBinary Classification | CodeCode Available | 0 |
| RAN-GNNs: breaking the capacity limits of graph neural networks | Mar 29, 2021 | AttributeBenchmarking | —Unverified | 0 |
| Deep Image Compositing | Mar 29, 2021 | Benchmarking | —Unverified | 0 |
| SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events | Mar 29, 2021 | Autonomous VehiclesBenchmarking | CodeCode Available | 1 |
| Exploiting Adam-like Optimization Algorithms to Improve the Performance of Convolutional Neural Networks | Mar 26, 2021 | Benchmarking | —Unverified | 0 |
| Marine Snow Removal Benchmarking Dataset | Mar 26, 2021 | BenchmarkingSand | CodeCode Available | 1 |
| Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Co-design | Mar 25, 2021 | BenchmarkingEdge-computing | —Unverified | 0 |