Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3851–3900 of 5548 papers

Title	Date	Tasks	Status	Hype
A Semi-Automated Live Interlingual Communication Workflow Featuring Intralingual Respeaking: Evaluation and Benchmarking	Jun 1, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Needle In A Haystack, Fast: Benchmarking Image Perceptual Similarity Metrics At Scale	Jun 1, 2022	Benchmarking	CodeCode Available	1
NEWTS: A Corpus for News Topic-Focused Summarization	May 31, 2022	BenchmarkingText Summarization	—Unverified	0
Hide and Seek: on the Stealthiness of Attacks against Deep Learning Systems	May 31, 2022	Benchmarking	—Unverified	0
AI-enabled Sound Pattern Recognition on Asthma Medication Adherence: Evaluation with the RDA Benchmark Suite	May 30, 2022	BenchmarkingBIG-bench Machine Learning	CodeCode Available	0
bsnsing: A decision tree induction method based on recursive optimal boolean rule composition	May 30, 2022	Benchmarking	CodeCode Available	0
Benchmarking Unsupervised Anomaly Detection and Localization	May 30, 2022	Anomaly DetectionBenchmarking	—Unverified	0
Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object Detection	May 30, 2022	3D Object DetectionAutonomous Driving	CodeCode Available	1
A Framework for Generating Informative Benchmark Instances	May 29, 2022	Benchmarking	CodeCode Available	0
Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions	May 27, 2022	BenchmarkingFew-Shot Image Classification	CodeCode Available	1
Bias Reduction via Cooperative Bargaining in Synthetic Graph Dataset Generation	May 27, 2022	BenchmarkingDataset Generation	CodeCode Available	0
MIMII DG: Sound Dataset for Malfunctioning Industrial Machine Investigation and Inspection for Domain Generalization Task	May 27, 2022	BenchmarkingDomain Generalization	CodeCode Available	1
Failure Detection in Medical Image Classification: A Reality Check and Benchmarking Testbed	May 27, 2022	BenchmarkingBinary Classification	CodeCode Available	1
Fast Vision Transformers with HiLo Attention	May 26, 2022	BenchmarkingEfficient ViTs	CodeCode Available	2
Benchmarking of Deep Learning models on 2D Laminar Flow behind Cylinder	May 26, 2022	BenchmarkingDeep Learning	—Unverified	0
GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles	May 25, 2022	BenchmarkingEvent Argument Extraction	CodeCode Available	1
Large Language Models are Few-Shot Clinical Information Extractors	May 25, 2022	Benchmarkingcoreference-resolution	—Unverified	0
Optimizing Performance of Federated Person Re-identification: Benchmarking and Analysis	May 24, 2022	BenchmarkingFederated Learning	CodeCode Available	1
Advanced Manufacturing Configuration by Sample-efficient Batch Bayesian Optimization	May 24, 2022	Bayesian OptimizationBenchmarking	—Unverified	0
RCC-GAN: Regularized Compound Conditional GAN for Large-Scale Tabular Data Synthesis	May 24, 2022	BenchmarkingGenerative Adversarial Network	—Unverified	0
Diversity Over Size: On the Effect of Sample and Topic Sizes for Topic-Dependent Argument Mining Datasets	May 23, 2022	Argument MiningBenchmarking	CodeCode Available	0
Paddy Doctor: A Visual Image Dataset for Automated Paddy Disease Classification and Benchmarking	May 23, 2022	BenchmarkingClassification	—Unverified	0
PyRelationAL: a python library for active learning research and development	May 23, 2022	Active LearningBenchmarking	CodeCode Available	1
Graph-theoretical approach to robust 3D normal extraction of LiDAR data	May 23, 2022	Benchmarking	CodeCode Available	0
Generalization, Mayhems and Limits in Recurrent Proximal Policy Optimization	May 23, 2022	BenchmarkingDeep Reinforcement Learning	—Unverified	0
Deep Learning-Based Synchronization for Uplink NB-IoT	May 22, 2022	BenchmarkingDeep Learning	CodeCode Available	1
Self-Supervised Speech Representation Learning: A Review	May 21, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Deep Learning vs. Gradient Boosting: Benchmarking state-of-the-art machine learning algorithms for credit scoring	May 21, 2022	BenchmarkingBinary Classification	—Unverified	0
Oracle-MNIST: a Realistic Image Dataset for Benchmarking Machine Learning Algorithms	May 19, 2022	BenchmarkingBIG-bench Machine Learning	CodeCode Available	1
BARS: Towards Open Benchmarking for Recommender Systems	May 19, 2022	BenchmarkingClick-Through Rate Prediction	CodeCode Available	2
SNaC: Coherence Error Detection for Narrative Summarization	May 19, 2022	BenchmarkingCoherence Evaluation	CodeCode Available	0
Entity Alignment For Knowledge Graphs: Progress, Challenges, and Empirical Studies	May 18, 2022	BenchmarkingEntity Alignment	—Unverified	0
Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data	May 16, 2022	Accented Speech RecognitionBenchmarking	—Unverified	0
Uncertainty estimation for Cross-dataset performance in Trajectory prediction	May 15, 2022	BenchmarkingPrediction	—Unverified	0
The VoicePrivacy 2020 Challenge Evaluation Plan	May 14, 2022	Benchmarking	CodeCode Available	1
Provably Safe Reinforcement Learning: Conceptual Analysis, Survey, and Benchmarking	May 13, 2022	Benchmarkingreinforcement-learning	—Unverified	0
Federated Learning Under Intermittent Client Availability and Time-Varying Communication Constraints	May 13, 2022	BenchmarkingFederated Learning	CodeCode Available	1
Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages	May 12, 2022	BenchmarkingDiversity	—Unverified	0
Subspace Learning Machine (SLM): Methodology and Performance	May 11, 2022	Benchmarking	—Unverified	0
Individual Fairness Guarantees for Neural Networks	May 11, 2022	BenchmarkingFairness	CodeCode Available	0
Structured, flexible, and robust: benchmarking and improving large language models towards more human-like behavior in out-of-distribution reasoning tasks	May 11, 2022	BenchmarkingExplanation Generation	CodeCode Available	1
Clinical Prompt Learning with Frozen Language Models	May 11, 2022	BenchmarkingGPU	CodeCode Available	1
Towards Intersectionality in Machine Learning: Including More Identities, Handling Underrepresentation, and Performing Evaluation	May 10, 2022	AttributeBenchmarking	CodeCode Available	0
LayoutXLM vs. GNN: An Empirical Evaluation of Relation Extraction for Documents	May 9, 2022	BenchmarkingGraph Neural Network	—Unverified	0
Assigning Species Information to Corresponding Genes by a Sequence Labeling Framework	May 8, 2022	BenchmarkingBinary Classification	CodeCode Available	0
BiCo-Net: Regress Globally, Match Locally for Robust 6D Pose Estimation	May 7, 2022	6D Pose EstimationBenchmarking	CodeCode Available	1
GenISP: Neural ISP for Low-Light Machine Cognition	May 7, 2022	BenchmarkingImage Restoration	CodeCode Available	1
VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution	May 6, 2022	BenchmarkingSpeaker Identification	—Unverified	0
Benchmarking Econometric and Machine Learning Methodologies in Nowcasting	May 6, 2022	BenchmarkingBIG-bench Machine Learning	CodeCode Available	1
Design Target Achievement Index: A Differentiable Metric to Enhance Deep Generative Models in Multi-Objective Inverse Design	May 6, 2022	Benchmarking	—Unverified	0

Show:10 25 50

← PrevPage 78 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified