| Needle In A Haystack, Fast: Benchmarking Image Perceptual Similarity Metrics At Scale | Jun 1, 2022 | Benchmarking | CodeCode Available | 1 |
| Jojajovai: A Parallel Guarani-Spanish Corpus for MT Benchmarking | Jun 1, 2022 | BenchmarkingSentence | CodeCode Available | 1 |
| A Japanese Dataset for Subjective and Objective Sentiment Polarity Classification in Micro Blog Domain | Jun 1, 2022 | BenchmarkingEmotion Recognition | CodeCode Available | 1 |
| Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object Detection | May 30, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions | May 27, 2022 | BenchmarkingFew-Shot Image Classification | CodeCode Available | 1 |
| Failure Detection in Medical Image Classification: A Reality Check and Benchmarking Testbed | May 27, 2022 | BenchmarkingBinary Classification | CodeCode Available | 1 |
| MIMII DG: Sound Dataset for Malfunctioning Industrial Machine Investigation and Inspection for Domain Generalization Task | May 27, 2022 | BenchmarkingDomain Generalization | CodeCode Available | 1 |
| GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles | May 25, 2022 | BenchmarkingEvent Argument Extraction | CodeCode Available | 1 |
| Optimizing Performance of Federated Person Re-identification: Benchmarking and Analysis | May 24, 2022 | BenchmarkingFederated Learning | CodeCode Available | 1 |
| PyRelationAL: a python library for active learning research and development | May 23, 2022 | Active LearningBenchmarking | CodeCode Available | 1 |