| The CropAndWeed Dataset: A Multi-Modal Learning Approach for Efficient Crop and Weed Manipulation | Jan 6, 2023 | BenchmarkingCrop Classification | CodeCode Available | 1 |
| Trace Encoding in Process Mining: a survey and benchmarking | Jan 5, 2023 | BenchmarkingPredictive Process Monitoring | CodeCode Available | 1 |
| Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation | Jan 3, 2023 | BenchmarkingFew-shot Instance Segmentation | CodeCode Available | 1 |
| MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs | Jan 1, 2023 | BenchmarkingGPU | CodeCode Available | 1 |
| SQAD: Automatic Smartphone Camera Quality Assessment and Benchmarking | Jan 1, 2023 | Benchmarking | CodeCode Available | 1 |
| Benchmarking Robustness of 3D Object Detection to Common Corruptions | Jan 1, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Benchmarking Spatial Relationships in Text-to-Image Generation | Dec 20, 2022 | BenchmarkingImage Generation | CodeCode Available | 1 |
| A Comprehensive Study of the Robustness for LiDAR-based 3D Object Detectors against Adversarial Attacks | Dec 20, 2022 | 3D Object DetectionBenchmarking | CodeCode Available | 1 |
| Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift | Dec 15, 2022 | BenchmarkingImage Captioning | CodeCode Available | 1 |
| Benchmarking Large Language Models for Automated Verilog RTL Code Generation | Dec 13, 2022 | BenchmarkingCode Generation | CodeCode Available | 1 |
| On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline | Dec 12, 2022 | BenchmarkingData Augmentation | CodeCode Available | 1 |
| Ego-Body Pose Estimation via Ego-Head Pose Estimation | Dec 9, 2022 | BenchmarkingDisentanglement | CodeCode Available | 1 |
| Benchmarking Self-Supervised Learning on Diverse Pathology Datasets | Dec 9, 2022 | BenchmarkingClassification | CodeCode Available | 1 |
| CODEBench: A Neural Architecture and Hardware Accelerator Co-Design Framework | Dec 7, 2022 | Benchmarking | CodeCode Available | 1 |
| RLogist: Fast Observation Strategy on Whole-slide Images with Deep Reinforcement Learning | Dec 4, 2022 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Towards Scene Understanding for Autonomous Operations on Airport Aprons | Dec 4, 2022 | Autonomous DrivingBenchmarking | CodeCode Available | 1 |
| Geoclidean: Few-Shot Generalization in Euclidean Geometry | Nov 30, 2022 | Benchmarking | CodeCode Available | 1 |
| AdsorbML: A Leap in Efficiency for Adsorption Energy Calculations using Generalizable Machine Learning Potentials | Nov 29, 2022 | Benchmarking | CodeCode Available | 1 |
| A Call to Reflect on Evaluation Practices for Failure Detection in Image Classification | Nov 28, 2022 | Benchmarkingimage-classification | CodeCode Available | 1 |
| Multi-Mask Aggregators for Graph Neural Networks | Nov 24, 2022 | BenchmarkingGraph Regression | CodeCode Available | 1 |
| This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish | Nov 23, 2022 | Benchmarking | CodeCode Available | 1 |
| fseval: A Benchmarking Framework for Feature Selection and Feature Ranking Algorithms | Nov 23, 2022 | Automated Feature EngineeringBenchmarking | CodeCode Available | 1 |
| PIC4rl-gym: a ROS2 modular framework for Robots Autonomous Navigation with Deep Reinforcement Learning | Nov 19, 2022 | Autonomous NavigationBenchmarking | CodeCode Available | 1 |
| CryptOpt: Verified Compilation with Randomized Program Search for Cryptographic Primitives (full version) | Nov 19, 2022 | BenchmarkingC++ code | CodeCode Available | 1 |
| Benchmarking Graph Neural Networks for FMRI analysis | Nov 16, 2022 | Benchmarking | CodeCode Available | 1 |
| Hyperparameter optimization in deep multi-target prediction | Nov 8, 2022 | AutoMLBenchmarking | CodeCode Available | 1 |
| EventEA: Benchmarking Entity Alignment for Event-centric Knowledge Graphs | Nov 5, 2022 | AttributeBenchmarking | CodeCode Available | 1 |
| Benchmarking Adversarial Patch Against Aerial Detection | Oct 30, 2022 | Benchmarking | CodeCode Available | 1 |
| Benchmarking Language Models for Code Syntax Understanding | Oct 26, 2022 | Benchmarking | CodeCode Available | 1 |
| A Comparative Attention Framework for Better Few-Shot Object Detection on Aerial Images | Oct 25, 2022 | BenchmarkingFew-Shot Object Detection | CodeCode Available | 1 |
| ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition | Oct 24, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| SpikeSim: An end-to-end Compute-in-Memory Hardware Evaluation Tool for Benchmarking Spiking Neural Networks | Oct 24, 2022 | Benchmarking | CodeCode Available | 1 |
| A Survey on Graph Counterfactual Explanations: Definitions, Methods, Evaluation, and Research Challenges | Oct 21, 2022 | BenchmarkingCommunity Detection | CodeCode Available | 1 |
| RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control | Oct 20, 2022 | BenchmarkingData Augmentation | CodeCode Available | 1 |
| Graphs, Constraints, and Search for the Abstraction and Reasoning Corpus | Oct 18, 2022 | ARCBenchmarking | CodeCode Available | 1 |
| An Open-source Benchmark of Deep Learning Models for Audio-visual Apparent and Self-reported Personality Recognition | Oct 17, 2022 | Benchmarking | CodeCode Available | 1 |
| iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations | Oct 17, 2022 | BenchmarkingText Classification | CodeCode Available | 1 |
| KPI-EDGAR: A Novel Dataset and Accompanying Metric for Relation Extraction from Financial Documents | Oct 17, 2022 | BenchmarkingJoint Entity and Relation Extraction | CodeCode Available | 1 |
| WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments | Oct 14, 2022 | Atari GamesBenchmarking | CodeCode Available | 1 |
| CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling | Oct 14, 2022 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| A Comprehensive Study on Large-Scale Graph Training: Benchmarking and Rethinking | Oct 14, 2022 | BenchmarkingGPU | CodeCode Available | 1 |
| DCL-Net: Deep Correspondence Learning Network for 6D Pose Estimation | Oct 11, 2022 | 6D Pose Estimation6D Pose Estimation using RGB | CodeCode Available | 1 |
| Benchmarking saliency methods for chest X-ray interpretation | Oct 10, 2022 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Benchmarking Reinforcement Learning Techniques for Autonomous Navigation | Oct 10, 2022 | Autonomous NavigationBenchmarking | CodeCode Available | 1 |
| ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial Viewpoints | Oct 8, 2022 | Autonomous DrivingBenchmarking | CodeCode Available | 1 |
| Neural Methods for Logical Reasoning Over Knowledge Graphs | Sep 28, 2022 | BenchmarkingKnowledge Graphs | CodeCode Available | 1 |
| Benchmarking and Analyzing 3D Human Pose and Shape Estimation Beyond Algorithms | Sep 21, 2022 | 3D human pose and shape estimationBenchmarking | CodeCode Available | 1 |
| Sanity Check for External Clustering Validation Benchmarks using Internal Validation Measures | Sep 20, 2022 | BenchmarkingClustering | CodeCode Available | 1 |
| A framework for benchmarking clustering algorithms | Sep 20, 2022 | BenchmarkingClustering | CodeCode Available | 1 |
| Active-Passive SimStereo -- Benchmarking the Cross-Generalization Capabilities of Deep Learning-based Stereo Methods | Sep 17, 2022 | BenchmarkingStereo Matching | CodeCode Available | 1 |