Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1226–1250 of 5548 papers

Title	Date	Tasks	Status	Hype
FedCV: A Federated Learning Framework for Diverse Computer Vision Tasks	Nov 22, 2021	BenchmarkingFederated Learning	CodeCode Available	1
GRecX: An Efficient and Unified Benchmark for GNN-based Recommendation	Nov 19, 2021	BenchmarkingManagement	CodeCode Available	1
Benchmarking and scaling of deep learning models for land cover image classification	Nov 18, 2021	BenchmarkingClassification	CodeCode Available	1
Which priors matter? Benchmarking models for learning latent dynamics	Nov 9, 2021	Autonomous DrivingBenchmarking	CodeCode Available	1
Graph Robustness Benchmark: Benchmarking the Adversarial Robustness of Graph Machine Learning	Nov 8, 2021	Adversarial RobustnessBenchmarking	CodeCode Available	1
IOHexperimenter: Benchmarking Platform for Iterative Optimization Heuristics	Nov 7, 2021	Bayesian OptimizationBenchmarking	CodeCode Available	1
Benchmarking Data-driven Surrogate Simulators for Artificial Electromagnetic Materials	Nov 6, 2021	BenchmarkingNeural Network simulation	CodeCode Available	1
OpenFWI: Large-Scale Multi-Structural Benchmark Datasets for Seismic Full Waveform Inversion	Nov 4, 2021	2kBenchmarking	CodeCode Available	1
B-Pref: Benchmarking Preference-Based Reinforcement Learning	Nov 4, 2021	Benchmarkingreinforcement-learning	CodeCode Available	1
AdaPool: Exponential Adaptive Pooling for Information-Retaining Downsampling	Nov 1, 2021	Benchmarkingobject-detection	CodeCode Available	1
OPF-Learn: An Open-Source Framework for Creating Representative AC Optimal Power Flow Datasets	Nov 1, 2021	Benchmarking	CodeCode Available	1
Don’t be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System	Nov 1, 2021	BenchmarkingResponse Generation	CodeCode Available	1
Benchmarking Meta-embeddings: What Works and What Does Not	Nov 1, 2021	BenchmarkingEmbeddings Evaluation	CodeCode Available	1
FTNet: Feature Transverse Network for Thermal Image Semantic Segmentation	Oct 26, 2021	BenchmarkingScene Segmentation	CodeCode Available	1
Learning with Noisy Labels Revisited: A Study Using Real-World Human Annotations	Oct 22, 2021	BenchmarkingLearning with noisy labels	CodeCode Available	1
OpenABC-D: A Large-Scale Dataset For Machine Learning Guided Integrated Circuit Synthesis	Oct 21, 2021	BenchmarkingBIG-bench Machine Learning	CodeCode Available	1
Text-Based Person Search with Limited Data	Oct 20, 2021	BenchmarkingContrastive Learning	CodeCode Available	1
NAS-HPO-Bench-II: A Benchmark Dataset on Joint Optimization of Convolutional Neural Network Architecture and Training Hyperparameters	Oct 19, 2021	4kBenchmarking	CodeCode Available	1
HUMAN4D: A Human-Centric Multimodal Dataset for Motions and Immersive Media	Oct 14, 2021	3D Pose EstimationBenchmarking	CodeCode Available	1
Benchmarking the Robustness of Spatial-Temporal Models Against Corruptions	Oct 13, 2021	BenchmarkingComputational Efficiency	CodeCode Available	1
Codabench: Flexible, Easy-to-Use and Reproducible Benchmarking Platform	Oct 12, 2021	Benchmarking	CodeCode Available	1
NAS-Bench-360: Benchmarking Neural Architecture Search on Diverse Tasks	Oct 12, 2021	Benchmarkingimage-classification	CodeCode Available	1
S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations	Oct 12, 2021	BenchmarkingVoice Conversion	CodeCode Available	1
EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale Dataset	Oct 11, 2021	BenchmarkingFace Hallucination	CodeCode Available	1
Performance Evaluation of Deep Transfer Learning on Multiclass Identification of Common Weed Species in Cotton Production Systems	Oct 11, 2021	BenchmarkingManagement	CodeCode Available	1

Show:10 25 50

← PrevPage 50 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified