| A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect | May 7, 2021 | BenchmarkingSpeech-to-Text | —Unverified | 0 |
| PathBench: A Benchmarking Platform for Classical and Learned Path Planning Algorithms | May 4, 2021 | Benchmarking | —Unverified | 0 |
| Event Camera Simulator Design for Modeling Attention-based Inference Architectures | May 3, 2021 | Benchmarking | —Unverified | 0 |
| dEchorate: a Calibrated Room Impulse Response Database for Echo-aware Signal Processing | Apr 27, 2021 | BenchmarkingRetrieval | CodeCode Available | 1 |
| A Complementarity Analysis of the COCO Benchmark Problems and Artificially Generated Problems | Apr 27, 2021 | Benchmarking | —Unverified | 0 |
| 2.5D Visual Relationship Detection | Apr 26, 2021 | BenchmarkingDepth Estimation | CodeCode Available | 1 |
| OPTION: OPTImization Algorithm Benchmarking ONtology | Apr 24, 2021 | BenchmarkingData Integration | —Unverified | 0 |
| Towards Trustworthy Deception Detection: Benchmarking Model Robustness across Domains, Modalities, and Languages | Apr 23, 2021 | BenchmarkingDeception Detection | —Unverified | 0 |
| Knodle: Modular Weakly Supervised Learning with PyTorch | Apr 23, 2021 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 1 |
| Measuring what Really Matters: Optimizing Neural Networks for TinyML | Apr 21, 2021 | Benchmarking | CodeCode Available | 0 |
| Model-predictive control and reinforcement learning in multi-energy system case studies | Apr 20, 2021 | BenchmarkingModel Predictive Control | —Unverified | 0 |
| Benchmarking the Benchmark -- Analysis of Synthetic NIDS Datasets | Apr 19, 2021 | BenchmarkingIntrusion Detection | —Unverified | 0 |
| FedNLP: Benchmarking Federated Learning Methods for Natural Language Processing Tasks | Apr 18, 2021 | BenchmarkingFederated Learning | CodeCode Available | 0 |
| The Impact of ASR on the Automatic Analysis of Linguistic Complexity and Sophistication in Spontaneous L2 Speech | Apr 17, 2021 | Benchmarking | —Unverified | 0 |
| BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models | Apr 17, 2021 | Argument RetrievalBenchmarking | CodeCode Available | 2 |
| Towards Standardising Reinforcement Learning Approaches for Production Scheduling Problems | Apr 16, 2021 | Benchmarkingreinforcement-learning | CodeCode Available | 1 |
| Data Generating Process to Evaluate Causal Discovery Techniques for Time Series Data | Apr 16, 2021 | BenchmarkingCausal Discovery | CodeCode Available | 1 |
| Jointly Modeling and Clustering Tensors in High Dimensions | Apr 15, 2021 | BenchmarkingClustering | —Unverified | 0 |
| On the Assessment of Benchmark Suites for Algorithm Comparison | Apr 15, 2021 | Benchmarking | —Unverified | 0 |
| Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking Reasoning Interpretability | Apr 14, 2021 | BenchmarkingLink Prediction | CodeCode Available | 1 |
| Safety-enhanced UAV Path Planning with Spherical Vector-based Particle Swarm Optimization | Apr 13, 2021 | BenchmarkingMetaheuristic Optimization | CodeCode Available | 1 |
| StylePTB: A Compositional Benchmark for Fine-grained Controllable Text Style Transfer | Apr 12, 2021 | BenchmarkingSentence | CodeCode Available | 1 |
| A Probabilistic Framework for Lexicon-based Keyword Spotting in Handwritten Text Images | Apr 9, 2021 | BenchmarkingKeyword Spotting | —Unverified | 0 |
| Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam | Apr 9, 2021 | BenchmarkingScene Text Recognition | —Unverified | 0 |
| BERT-based Chinese Text Classification for Emergency Domain with a Novel Loss Function | Apr 9, 2021 | BenchmarkingGeneral Classification | —Unverified | 0 |
| Dynabench: Rethinking Benchmarking in NLP | Apr 7, 2021 | Benchmarking | —Unverified | 0 |
| Efficient and Accurate In-Database Machine Learning with SQL Code Generation in Python | Apr 7, 2021 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Robust Semantic Interpretability: Revisiting Concept Activation Vectors | Apr 6, 2021 | Benchmarkingcounterfactual | CodeCode Available | 1 |
| CBench: Towards Better Evaluation of Question Answering Over Knowledge Graphs | Apr 5, 2021 | BenchmarkingKnowledge Graphs | CodeCode Available | 1 |
| What Will it Take to Fix Benchmarking in Natural Language Understanding? | Apr 5, 2021 | BenchmarkingNatural Language Understanding | —Unverified | 0 |
| The Multi-speaker Multi-style Voice Cloning Challenge 2021 | Apr 5, 2021 | BenchmarkingVoice Cloning | —Unverified | 0 |
| Improving Pretrained Models for Zero-shot Multi-label Text Classification through Reinforced Label Hierarchy Reasoning | Apr 4, 2021 | BenchmarkingMulti Label Text Classification | CodeCode Available | 0 |
| An Empirical Evaluation of Cost-based Federated SPARQL Query Processing Engines | Apr 2, 2021 | Benchmarking | CodeCode Available | 0 |
| Benchmarking Transformer-based Language Models for Arabic Sentiment and Sarcasm Detection | Apr 1, 2021 | BenchmarkingSarcasm Detection | —Unverified | 0 |
| Benchmarking Pre-trained Language Models for Multilingual NER: TraSpaS at the BSNLP2021 Shared Task | Apr 1, 2021 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| Findings of the Shared Task on Offensive Language Identification in Tamil, Malayalam, and Kannada | Apr 1, 2021 | BenchmarkingLanguage Identification | —Unverified | 0 |
| Benchmarking a transformer-FREE model for ad-hoc retrieval | Apr 1, 2021 | BenchmarkingCPU | CodeCode Available | 0 |
| Remote Sensing Image Classification with the SEN12MS Dataset | Apr 1, 2021 | BenchmarkingClassification | CodeCode Available | 1 |
| Generalized Conflict-directed Search for Optimal Ordering Problems | Mar 31, 2021 | BenchmarkingScheduling | —Unverified | 0 |
| Simultaneous Navigation and Construction Benchmarking Environments | Mar 31, 2021 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 |
| Benchmarks for Deep Off-Policy Evaluation | Mar 30, 2021 | Benchmarkingcontinuous-control | CodeCode Available | 1 |
| Unsupervised Learning of 3D Object Categories from Videos in the Wild | Mar 30, 2021 | BenchmarkingMonocular Reconstruction | —Unverified | 0 |
| 3D AffordanceNet: A Benchmark for Visual Object Affordance Understanding | Mar 30, 2021 | Affordance DetectionBenchmarking | CodeCode Available | 1 |
| Benchmarking Representation Learning for Natural World Image Collections | Mar 30, 2021 | BenchmarkingBinary Classification | CodeCode Available | 0 |
| RAN-GNNs: breaking the capacity limits of graph neural networks | Mar 29, 2021 | AttributeBenchmarking | —Unverified | 0 |
| Deep Image Compositing | Mar 29, 2021 | Benchmarking | —Unverified | 0 |
| SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events | Mar 29, 2021 | Autonomous VehiclesBenchmarking | CodeCode Available | 1 |
| Exploiting Adam-like Optimization Algorithms to Improve the Performance of Convolutional Neural Networks | Mar 26, 2021 | Benchmarking | —Unverified | 0 |
| Marine Snow Removal Benchmarking Dataset | Mar 26, 2021 | BenchmarkingSand | CodeCode Available | 1 |
| Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Co-design | Mar 25, 2021 | BenchmarkingEdge-computing | —Unverified | 0 |