Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3476–3500 of 5548 papers

Title	Date	Tasks	Status	Hype
Benchmarking Large Language Models for News Summarization	Jan 31, 2023	BenchmarkingNews Summarization	CodeCode Available	1
Benchmarking Model Predictive Control Algorithms in Building Optimization Testing Framework (BOPTEST)	Jan 31, 2023	BenchmarkingModel Predictive Control	—Unverified	0
Sport Task: Fine Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2022	Jan 31, 2023	Action DetectionBenchmarking	CodeCode Available	0
Benchmarking Robustness to Adversarial Image Obfuscations	Jan 30, 2023	Benchmarking	CodeCode Available	1
Benchmarking optimality of time series classification methods in distinguishing diffusions	Jan 30, 2023	BenchmarkingGaussian Processes	CodeCode Available	0
Cross-Subject Deep Transfer Models for Evoked Potentials in Brain-Computer Interface	Jan 29, 2023	BenchmarkingBrain Computer Interface	—Unverified	0
Heterogeneous Datasets for Federated Survival Analysis Simulation	Jan 28, 2023	BenchmarkingFederated Learning	CodeCode Available	0
Quality Indicators for Preference-based Evolutionary Multi-objective Optimization Using a Reference Point: A Review and Analysis	Jan 28, 2023	BenchmarkingDecision Making	CodeCode Available	0
TemporAI: Facilitating Machine Learning Innovation in Time Domain Tasks for Medicine	Jan 28, 2023	BenchmarkingCausal Inference	CodeCode Available	1
Task-Agnostic Graph Neural Network Evaluation via Adversarial Collaboration	Jan 27, 2023	BenchmarkingGraph Classification	CodeCode Available	0
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning	Jan 26, 2023	BenchmarkingDeep Reinforcement Learning	CodeCode Available	3
A Systematic Review of Green AI	Jan 26, 2023	Benchmarking	CodeCode Available	0
BiBench: Benchmarking and Analyzing Network Binarization	Jan 26, 2023	BenchmarkingBinarization	CodeCode Available	1
Out of Distribution Performance of State of Art Vision Model	Jan 25, 2023	Benchmarking	—Unverified	0
Towards Robust Metrics for Concept Representation Evaluation	Jan 25, 2023	BenchmarkingDisentanglement	CodeCode Available	0
SpaceTx: A Roadmap for Benchmarking Spatial Transcriptomics Exploration of the Brain	Jan 20, 2023	BenchmarkingCell Segmentation	—Unverified	0
Benchmarking YOLOv5 and YOLOv7 models with DeepSORT for droplet tracking applications	Jan 19, 2023	BenchmarkingGPU	CodeCode Available	0
Job recommendations: benchmarking of collaborative filtering methods for classifieds	Jan 19, 2023	BenchmarkingCollaborative Filtering	—Unverified	0
Vision Learners Meet Web Image-Text Pairs	Jan 17, 2023	BenchmarkingSelf-Supervised Learning	—Unverified	0
Hawk: An Industrial-strength Multi-label Document Classifier	Jan 15, 2023	BenchmarkingDocument Classification	—Unverified	0
Desbordante: from benchmarking suite to high-performance science-intensive data profiler (preprint)	Jan 14, 2023	Benchmarking	CodeCode Available	2
Young Labeled Faces in the Wild (YLFW): A Dataset for Children Faces Recognition	Jan 13, 2023	BenchmarkingFace Recognition	CodeCode Available	1
Evaluating the Transferability of Machine-Learned Force Fields for Material Property Modeling	Jan 10, 2023	BenchmarkingGraph Neural Network	CodeCode Available	0
Critical review of conformational B-cell epitope prediction methods	Jan 10, 2023	BenchmarkingDrug Design	CodeCode Available	0
Benchmarking Robustness in Neural Radiance Fields	Jan 10, 2023	BenchmarkingCamera Calibration	—Unverified	0

Show:10 25 50

← PrevPage 140 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified