Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4226–4250 of 5548 papers

Title	Date	Tasks	Status
Privacy Protection in Street-View Panoramas using Depth and Multi-View Imagery	Mar 27, 2019	BenchmarkingObject	—Unverified
Automating Code Adaptation for MLOps -- A Benchmarking Study on LLMs	May 10, 2024	BenchmarkingHyperparameter Optimization	—Unverified
Probabilistic Robustness in Deep Learning: A Concise yet Comprehensive Guide	Feb 20, 2025	Adversarial RobustnessBenchmarking	—Unverified
ProBench: Benchmarking Large Language Models in Competitive Programming	Feb 28, 2025	AttributeBenchmarking	—Unverified
UCLID-Net: Single View Reconstruction in Object Space	Jun 6, 2020	BenchmarkingDecoder	—Unverified
UDTIRI: An Online Open-Source Intelligent Road Inspection Benchmark Suite	Apr 18, 2023	BenchmarkingInstance Segmentation	—Unverified
A Comprehensive Multi-Illuminant Dataset for Benchmarking of the Intrinsic Image Algorithms	Dec 1, 2015	BenchmarkingImage Generation	—Unverified
Automatic vehicle trajectory data reconstruction at scale	Dec 15, 2022	Benchmarkingvehicle detection	—Unverified
Problem-solving benefits of down-sampled lexicase selection	Jun 10, 2021	Benchmarking	—Unverified
Automatic Target Recognition on Synthetic Aperture Radar Imagery: A Survey	Jul 4, 2020	BenchmarkingSurvey	—Unverified
Procedural Content Generation: Better Benchmarks for Transfer Reinforcement Learning	May 31, 2021	BenchmarkingDeep Learning	—Unverified
Procedural Generalization by Planning with Self-Supervised World Models	Nov 2, 2021	BenchmarkingMeta-Learning	—Unverified
UGSL: A Unified Framework for Benchmarking Graph Structure Learning	Aug 21, 2023	BenchmarkingGraph structure learning	—Unverified
ProductAgent: Benchmarking Conversational Product Search Agent with Asking Clarification Questions	Jul 1, 2024	BenchmarkingQuestion Generation	—Unverified
Profit: Benchmarking Personalization and Robustness Trade-off in Federated Prompt Tuning	Oct 6, 2023	BenchmarkingFederated Learning	—Unverified
Progressive Class-level Distillation	May 30, 2025	BenchmarkingKnowledge Distillation	—Unverified
Progressive Multi-view Human Mesh Recovery with Self-Supervision	Dec 10, 2022	BenchmarkingDiversity	—Unverified
Progressive with Purpose: Guiding Progressive Inpainting DNNs through Context and Structure	Sep 21, 2022	BenchmarkingImage Inpainting	—Unverified
Projective simulation applied to the grid-world and the mountain-car problem	May 21, 2014	Benchmarkingreinforcement-learning	—Unverified
Project MPG: towards a generalized performance benchmark for LLM capabilities	Oct 28, 2024	BenchmarkingChatbot	—Unverified
Automatic segmenting teeth in X-ray images: Trends, a novel data set, benchmarking and future perspectives	Feb 9, 2018	BenchmarkingImage Segmentation	—Unverified
Prompting ChatGPT for Chinese Learning as L2: A CEFR and EBCL Level Study	Jan 25, 2025	Benchmarking	—Unverified
Prompting Scientific Names for Zero-Shot Species Recognition	Oct 15, 2023	BenchmarkingZero-Shot Learning	—Unverified
Automatic Microprocessor Performance Bug Detection	Nov 17, 2020	Benchmarking	—Unverified
Prompt Sketching for Large Language Models	Nov 8, 2023	Arithmetic ReasoningBenchmarking	—Unverified

Show:10 25 50

← PrevPage 170 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified