| Performance Evaluation of Deep Transfer Learning on Multiclass Identification of Common Weed Species in Cotton Production Systems | Oct 11, 2021 | BenchmarkingManagement | CodeCode Available | 1 |
| SERAB: A multi-lingual benchmark for speech emotion recognition | Oct 7, 2021 | BenchmarkingEmotion Recognition | CodeCode Available | 1 |
| EntQA: Entity Linking as Question Answering | Oct 5, 2021 | BenchmarkingEntity Linking | CodeCode Available | 1 |
| Revisiting Self-Training for Few-Shot Learning of Language Model | Oct 4, 2021 | BenchmarkingFew-Shot Learning | CodeCode Available | 1 |
| Machine Learning with Knowledge Constraints for Process Optimization of Open-Air Perovskite Solar Cell Manufacturing | Oct 1, 2021 | Bayesian OptimizationBenchmarking | CodeCode Available | 1 |
| Phonetic Word Embeddings | Sep 30, 2021 | BenchmarkingWord Embeddings | CodeCode Available | 1 |
| MedPerf: Open Benchmarking Platform for Medical Artificial Intelligence using Federated Evaluation | Sep 29, 2021 | BenchmarkingPhilosophy | CodeCode Available | 1 |
| Benchmarking Graph Neural Networks on Dynamic Link Prediction | Sep 29, 2021 | BenchmarkingDynamic Link Prediction | CodeCode Available | 1 |
| "How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations | Sep 28, 2021 | BenchmarkingDialogue State Tracking | CodeCode Available | 1 |
| FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding | Sep 27, 2021 | BenchmarkingNatural Language Understanding | CodeCode Available | 1 |
| PASS: An ImageNet replacement for self-supervised pretraining without humans | Sep 27, 2021 | BenchmarkingEthics | CodeCode Available | 1 |
| Disentangled Feature Representation for Few-shot Image Classification | Sep 26, 2021 | BenchmarkingClassification | CodeCode Available | 1 |
| Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System | Sep 23, 2021 | BenchmarkingResponse Generation | CodeCode Available | 1 |
| SubseasonalClimateUSA: A Dataset for Subseasonal Forecasting and Benchmarking | Sep 21, 2021 | Benchmarking | CodeCode Available | 1 |
| Benchmarking the Combinatorial Generalizability of Complex Query Answering on Knowledge Graphs | Sep 18, 2021 | BenchmarkingComplex Query Answering | CodeCode Available | 1 |
| AI Accelerator Survey and Trends | Sep 18, 2021 | BenchmarkingComputational Efficiency | CodeCode Available | 1 |
| Benchmarking Commonsense Knowledge Base Population with an Effective Evaluation Dataset | Sep 16, 2021 | BenchmarkingKnowledge Base Population | CodeCode Available | 1 |
| OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication | Sep 16, 2021 | 3D Object DetectionBenchmarking | CodeCode Available | 1 |
| Benchmarking the Spectrum of Agent Capabilities | Sep 14, 2021 | Benchmarking | CodeCode Available | 1 |
| RobustART: Benchmarking Robustness on Architecture Design and Training Techniques | Sep 11, 2021 | Adversarial RobustnessBenchmarking | CodeCode Available | 1 |
| Scikit-dimension: a Python package for intrinsic dimension estimation | Sep 6, 2021 | Benchmarking | CodeCode Available | 1 |
| Does BERT Learn as Humans Perceive? Understanding Linguistic Styles through Lexica | Sep 6, 2021 | Benchmarking | CodeCode Available | 1 |
| Biomedical Data-to-Text Generation via Fine-Tuning Transformers | Sep 3, 2021 | BenchmarkingData-to-Text Generation | CodeCode Available | 1 |
| ReMeDi: Resources for Multi-domain, Multi-service, Medical Dialogues | Sep 1, 2021 | BenchmarkingContrastive Learning | CodeCode Available | 1 |
| Tune It or Don't Use It: Benchmarking Data-Efficient Image Classification | Aug 30, 2021 | Benchmarkingimage-classification | CodeCode Available | 1 |