SOTAVerified

Benchmarking

Papers

Showing 30013025 of 5548 papers

TitleStatusHype
Benchmarking the Gerchberg-Saxton Algorithm0
Benchmarking the Fidelity and Utility of Synthetic Relational Data0
Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web0
ImageNet performance correlates with pose estimation robustness and generalization on out-of-domain data0
ImagePairs: Realistic Super Resolution Dataset via Beam Splitter Camera Rig0
Imagining and building wise machines: The centrality of AI metacognition0
Benchmarking the Effectiveness of Classification Algorithms and SVM Kernels for Dry Beans0
SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World0
Imitation Learning Datasets: A Toolkit For Creating Datasets, Training Agents and Benchmarking0
Imitation Learning from Pixel Observations for Continuous Control0
Practical Guidelines for Cell Segmentation Models Under Optical Aberrations in Microscopy0
A Functional Analysis Approach to Symbolic Regression0
Benchmarking the Capabilities of Large Language Models in Transportation System Engineering: Accuracy, Consistency, and Reasoning Behaviors0
A Framework for Large Scale Synthetic Graph Dataset Generation0
Dataset Properties Shape the Success of Neuroimaging-Based Patient Stratification: A Benchmarking Analysis Across Clustering Algorithms0
A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data0
Impact of spatial transformations on landscape features of CEC2022 basic benchmark problems0
Implementing and Benchmarking the Locally Competitive Algorithm on the Loihi 2 Neuromorphic Processor0
Implementing hosting capacity analysis in distribution networks: Practical considerations, advancements and future directions0
Benchmarking the Benchmark -- Analysis of Synthetic NIDS Datasets0
Implicit Causality-biases in humans and LLMs as a tool for benchmarking LLM discourse capabilities0
Benchmarking the Accuracy and Robustness of Feedback Alignment Algorithms0
Implicit to Explicit Entropy Regularization: Benchmarking ViT Fine-tuning under Noisy Labels0
The Moral Mind(s) of Large Language Models0
Benchmarking Test-Time Unsupervised Deep Neural Network Adaptation on Edge Devices0
Show:102550
← PrevPage 121 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified