Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3076–3100 of 5548 papers

Title	Date	Tasks	Status	Hype
Benchmarking Performance of Deep Learning Model for Material Segmentation on Two HPC Systems	Jul 27, 2023	BenchmarkingGPU	—Unverified	0
Quantitative Metrics for Benchmarking Human-Aware Robot Navigation	Jul 26, 2023	BenchmarkingRobot Navigation	CodeCode Available	0
YOLOBench: Benchmarking Efficient Object Detectors on Embedded Systems	Jul 26, 2023	BenchmarkingCPU	CodeCode Available	0
Fluorescent Neuronal Cells v2: Multi-Task, Multi-Format Annotations for Deep Learning in Microscopy	Jul 26, 2023	Benchmarkingobject-detection	—Unverified	0
Foundational Models Defining a New Era in Vision: A Survey and Outlook	Jul 25, 2023	Benchmarking	CodeCode Available	2
Towards Long-Term predictions of Turbulence using Neural Operators	Jul 25, 2023	Benchmarking	—Unverified	0
When Multi-Task Learning Meets Partial Supervision: A Computer Vision Review	Jul 25, 2023	BenchmarkingMulti-Task Learning	CodeCode Available	0
UPREVE: An End-to-End Causal Discovery Benchmarking System	Jul 25, 2023	BenchmarkingCausal Discovery	—Unverified	0
Implementing and Benchmarking the Locally Competitive Algorithm on the Loihi 2 Neuromorphic Processor	Jul 25, 2023	BenchmarkingCPU	—Unverified	0
Benchmarking and Analyzing Generative Data for Visual Recognition	Jul 25, 2023	BenchmarkingRetrieval	—Unverified	0
Towards an AI Accountability Policy	Jul 25, 2023	BenchmarkingFairness	—Unverified	0
The Impact of Genomic Variation on Function (IGVF) Consortium	Jul 24, 2023	Benchmarking	—Unverified	0
Remote Bio-Sensing: Open Source Benchmark Framework for Fair Evaluation of rPPG	Jul 24, 2023	Benchmarking	CodeCode Available	2
PLANTAIN: Diffusion-inspired Pose Score Minimization for Fast and Accurate Molecular Docking	Jul 22, 2023	BenchmarkingMolecular Docking	CodeCode Available	1
Selecting the motion ground truth for loose-fitting wearables: benchmarking optical MoCap methods	Jul 21, 2023	Benchmarking	CodeCode Available	0
JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning	Jul 21, 2023	BenchmarkingCombinatorial Optimization	CodeCode Available	1
Decoding the Enigma: Benchmarking Humans and AIs on the Many Facets of Working Memory	Jul 20, 2023	BenchmarkingDecision Making	CodeCode Available	1
The Extractive-Abstractive Axis: Measuring Content "Borrowing" in Generative Language Models	Jul 20, 2023	Benchmarking	—Unverified	0
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models	Jul 20, 2023	BenchmarkingLanguage Modeling	CodeCode Available	1
Benchmarking Potential Based Rewards for Learning Humanoid Locomotion	Jul 19, 2023	BenchmarkingReinforcement Learning (RL)	CodeCode Available	2
On the Real-Time Semantic Segmentation of Aphid Clusters in the Wild	Jul 17, 2023	BenchmarkingReal-Time Semantic Segmentation	—Unverified	0
Efficient Prediction of Peptide Self-assembly through Sequential and Graphical Encoding	Jul 17, 2023	BenchmarkingDeep Learning	CodeCode Available	1
Examining the Effects of Degree Distribution and Homophily in Graph Learning Models	Jul 17, 2023	BenchmarkingGraph Clustering	CodeCode Available	1
Towards Heterogeneous Long-tailed Learning: Benchmarking, Metrics, and Toolbox	Jul 17, 2023	Benchmarking	CodeCode Available	1
Approaches for benchmarking single-cell gene regulatory network inference methods	Jul 17, 2023	Benchmarking	—Unverified	0

Show:10 25 50

← PrevPage 124 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified