| Deep Learning-Based Synchronization for Uplink NB-IoT | May 22, 2022 | BenchmarkingDeep Learning | CodeCode Available | 1 |
| Oracle-MNIST: a Realistic Image Dataset for Benchmarking Machine Learning Algorithms | May 19, 2022 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 1 |
| The VoicePrivacy 2020 Challenge Evaluation Plan | May 14, 2022 | Benchmarking | CodeCode Available | 1 |
| Federated Learning Under Intermittent Client Availability and Time-Varying Communication Constraints | May 13, 2022 | BenchmarkingFederated Learning | CodeCode Available | 1 |
| Structured, flexible, and robust: benchmarking and improving large language models towards more human-like behavior in out-of-distribution reasoning tasks | May 11, 2022 | BenchmarkingExplanation Generation | CodeCode Available | 1 |
| Clinical Prompt Learning with Frozen Language Models | May 11, 2022 | BenchmarkingGPU | CodeCode Available | 1 |
| GenISP: Neural ISP for Low-Light Machine Cognition | May 7, 2022 | BenchmarkingImage Restoration | CodeCode Available | 1 |
| BiCo-Net: Regress Globally, Match Locally for Robust 6D Pose Estimation | May 7, 2022 | 6D Pose EstimationBenchmarking | CodeCode Available | 1 |
| Benchmarking Econometric and Machine Learning Methodologies in Nowcasting | May 6, 2022 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 1 |
| Creating a Forensic Database of Shoeprints from Online Shoe Tread Photos | May 4, 2022 | BenchmarkingDepth Estimation | CodeCode Available | 1 |
| Continual Learning with Foundation Models: An Empirical Study of Latent Replay | Apr 30, 2022 | BenchmarkingContinual Learning | CodeCode Available | 1 |
| A global analysis of metrics used for measuring performance in natural language processing | Apr 25, 2022 | BenchmarkingMachine Translation | CodeCode Available | 1 |
| NICO++: Towards Better Benchmarking for Domain Generalization | Apr 17, 2022 | BenchmarkingDomain Generalization | CodeCode Available | 1 |
| Stress-Testing Point Cloud Registration on Automotive LiDAR | Apr 16, 2022 | Autonomous DrivingBenchmarking | CodeCode Available | 1 |
| Deep learning model solves change point detection for multiple change types | Apr 15, 2022 | BenchmarkingChange Point Detection | CodeCode Available | 1 |
| Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization | Apr 13, 2022 | BenchmarkingDeepFake Detection | CodeCode Available | 1 |
| Data Splits and Metrics for Method Benchmarking on Surgical Action Triplet Datasets | Apr 11, 2022 | Action Triplet RecognitionBenchmarking | CodeCode Available | 1 |
| BioRED: A Rich Biomedical Relation Extraction Dataset | Apr 8, 2022 | BenchmarkingBinary Relation Extraction | CodeCode Available | 1 |
| The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems | Apr 6, 2022 | AttributeBenchmarking | CodeCode Available | 1 |
| Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks | Apr 5, 2022 | Benchmarking | CodeCode Available | 1 |
| Coarse-to-Fine Q-attention with Learned Path Ranking | Apr 4, 2022 | Benchmarking | CodeCode Available | 1 |
| Earnings-22: A Practical Benchmark for Accents in the Wild | Mar 29, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Parameter-efficient Model Adaptation for Vision Transformers | Mar 29, 2022 | BenchmarkingClassification | CodeCode Available | 1 |
| Visual Abductive Reasoning | Mar 26, 2022 | BenchmarkingSentence | CodeCode Available | 1 |
| Fantastic Questions and Where to Find Them: FairytaleQA -- An Authentic Dataset for Narrative Comprehension | Mar 26, 2022 | BenchmarkingQuestion Answering | CodeCode Available | 1 |