| A Semi-Automated Live Interlingual Communication Workflow Featuring Intralingual Respeaking: Evaluation and Benchmarking | Jun 1, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Needle In A Haystack, Fast: Benchmarking Image Perceptual Similarity Metrics At Scale | Jun 1, 2022 | Benchmarking | CodeCode Available | 1 |
| NEWTS: A Corpus for News Topic-Focused Summarization | May 31, 2022 | BenchmarkingText Summarization | —Unverified | 0 |
| Hide and Seek: on the Stealthiness of Attacks against Deep Learning Systems | May 31, 2022 | Benchmarking | —Unverified | 0 |
| AI-enabled Sound Pattern Recognition on Asthma Medication Adherence: Evaluation with the RDA Benchmark Suite | May 30, 2022 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 0 |
| bsnsing: A decision tree induction method based on recursive optimal boolean rule composition | May 30, 2022 | Benchmarking | CodeCode Available | 0 |
| Benchmarking Unsupervised Anomaly Detection and Localization | May 30, 2022 | Anomaly DetectionBenchmarking | —Unverified | 0 |
| Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object Detection | May 30, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| A Framework for Generating Informative Benchmark Instances | May 29, 2022 | Benchmarking | CodeCode Available | 0 |
| Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions | May 27, 2022 | BenchmarkingFew-Shot Image Classification | CodeCode Available | 1 |
| Bias Reduction via Cooperative Bargaining in Synthetic Graph Dataset Generation | May 27, 2022 | BenchmarkingDataset Generation | CodeCode Available | 0 |
| MIMII DG: Sound Dataset for Malfunctioning Industrial Machine Investigation and Inspection for Domain Generalization Task | May 27, 2022 | BenchmarkingDomain Generalization | CodeCode Available | 1 |
| Failure Detection in Medical Image Classification: A Reality Check and Benchmarking Testbed | May 27, 2022 | BenchmarkingBinary Classification | CodeCode Available | 1 |
| Fast Vision Transformers with HiLo Attention | May 26, 2022 | BenchmarkingEfficient ViTs | CodeCode Available | 2 |
| Benchmarking of Deep Learning models on 2D Laminar Flow behind Cylinder | May 26, 2022 | BenchmarkingDeep Learning | —Unverified | 0 |
| GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles | May 25, 2022 | BenchmarkingEvent Argument Extraction | CodeCode Available | 1 |
| Large Language Models are Few-Shot Clinical Information Extractors | May 25, 2022 | Benchmarkingcoreference-resolution | —Unverified | 0 |
| Optimizing Performance of Federated Person Re-identification: Benchmarking and Analysis | May 24, 2022 | BenchmarkingFederated Learning | CodeCode Available | 1 |
| Advanced Manufacturing Configuration by Sample-efficient Batch Bayesian Optimization | May 24, 2022 | Bayesian OptimizationBenchmarking | —Unverified | 0 |
| RCC-GAN: Regularized Compound Conditional GAN for Large-Scale Tabular Data Synthesis | May 24, 2022 | BenchmarkingGenerative Adversarial Network | —Unverified | 0 |
| Diversity Over Size: On the Effect of Sample and Topic Sizes for Topic-Dependent Argument Mining Datasets | May 23, 2022 | Argument MiningBenchmarking | CodeCode Available | 0 |
| Paddy Doctor: A Visual Image Dataset for Automated Paddy Disease Classification and Benchmarking | May 23, 2022 | BenchmarkingClassification | —Unverified | 0 |
| PyRelationAL: a python library for active learning research and development | May 23, 2022 | Active LearningBenchmarking | CodeCode Available | 1 |
| Graph-theoretical approach to robust 3D normal extraction of LiDAR data | May 23, 2022 | Benchmarking | CodeCode Available | 0 |
| Generalization, Mayhems and Limits in Recurrent Proximal Policy Optimization | May 23, 2022 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Learning-Based Synchronization for Uplink NB-IoT | May 22, 2022 | BenchmarkingDeep Learning | CodeCode Available | 1 |
| Self-Supervised Speech Representation Learning: A Review | May 21, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Deep Learning vs. Gradient Boosting: Benchmarking state-of-the-art machine learning algorithms for credit scoring | May 21, 2022 | BenchmarkingBinary Classification | —Unverified | 0 |
| Oracle-MNIST: a Realistic Image Dataset for Benchmarking Machine Learning Algorithms | May 19, 2022 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 1 |
| BARS: Towards Open Benchmarking for Recommender Systems | May 19, 2022 | BenchmarkingClick-Through Rate Prediction | CodeCode Available | 2 |
| SNaC: Coherence Error Detection for Narrative Summarization | May 19, 2022 | BenchmarkingCoherence Evaluation | CodeCode Available | 0 |
| Entity Alignment For Knowledge Graphs: Progress, Challenges, and Empirical Studies | May 18, 2022 | BenchmarkingEntity Alignment | —Unverified | 0 |
| Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data | May 16, 2022 | Accented Speech RecognitionBenchmarking | —Unverified | 0 |
| Uncertainty estimation for Cross-dataset performance in Trajectory prediction | May 15, 2022 | BenchmarkingPrediction | —Unverified | 0 |
| The VoicePrivacy 2020 Challenge Evaluation Plan | May 14, 2022 | Benchmarking | CodeCode Available | 1 |
| Provably Safe Reinforcement Learning: Conceptual Analysis, Survey, and Benchmarking | May 13, 2022 | Benchmarkingreinforcement-learning | —Unverified | 0 |
| Federated Learning Under Intermittent Client Availability and Time-Varying Communication Constraints | May 13, 2022 | BenchmarkingFederated Learning | CodeCode Available | 1 |
| Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages | May 12, 2022 | BenchmarkingDiversity | —Unverified | 0 |
| Subspace Learning Machine (SLM): Methodology and Performance | May 11, 2022 | Benchmarking | —Unverified | 0 |
| Individual Fairness Guarantees for Neural Networks | May 11, 2022 | BenchmarkingFairness | CodeCode Available | 0 |
| Structured, flexible, and robust: benchmarking and improving large language models towards more human-like behavior in out-of-distribution reasoning tasks | May 11, 2022 | BenchmarkingExplanation Generation | CodeCode Available | 1 |
| Clinical Prompt Learning with Frozen Language Models | May 11, 2022 | BenchmarkingGPU | CodeCode Available | 1 |
| Towards Intersectionality in Machine Learning: Including More Identities, Handling Underrepresentation, and Performing Evaluation | May 10, 2022 | AttributeBenchmarking | CodeCode Available | 0 |
| LayoutXLM vs. GNN: An Empirical Evaluation of Relation Extraction for Documents | May 9, 2022 | BenchmarkingGraph Neural Network | —Unverified | 0 |
| Assigning Species Information to Corresponding Genes by a Sequence Labeling Framework | May 8, 2022 | BenchmarkingBinary Classification | CodeCode Available | 0 |
| BiCo-Net: Regress Globally, Match Locally for Robust 6D Pose Estimation | May 7, 2022 | 6D Pose EstimationBenchmarking | CodeCode Available | 1 |
| GenISP: Neural ISP for Low-Light Machine Cognition | May 7, 2022 | BenchmarkingImage Restoration | CodeCode Available | 1 |
| VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution | May 6, 2022 | BenchmarkingSpeaker Identification | —Unverified | 0 |
| Benchmarking Econometric and Machine Learning Methodologies in Nowcasting | May 6, 2022 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 1 |
| Design Target Achievement Index: A Differentiable Metric to Enhance Deep Generative Models in Multi-Objective Inverse Design | May 6, 2022 | Benchmarking | —Unverified | 0 |