SOTAVerified

Benchmarking

Papers

Showing 541550 of 5548 papers

TitleStatusHype
ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional DependenciesCode1
Benchmarking Fish Dataset and Evaluation Metric in Keypoint Detection -- Towards Precise Fish Morphological Assessment in Aquaculture BreedingCode1
Comprehensive benchmarking of large language models for RNA secondary structure predictionCode1
Comics Datasets Framework: Mix of Comics datasets for detection benchmarkingCode1
CommonPower: A Framework for Safe Data-Driven Smart Grid ControlCode1
CombiBench: Benchmarking LLM Capability for Combinatorial MathematicsCode1
A Benchmarking Study of Kolmogorov-Arnold Networks on Tabular DataCode1
Combinatorial Optimization with Policy Adaptation using Latent Space SearchCode1
CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity QuantificationCode1
Constellation Dataset: Benchmarking High-Altitude Object Detection for an Urban IntersectionCode1
Show:102550
← PrevPage 55 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified