SOTAVerified

Benchmarking

Papers

Showing 521530 of 5548 papers

TitleStatusHype
Benchmarking Encoder-Decoder Architectures for Biplanar X-ray to 3D Shape ReconstructionCode1
CombiBench: Benchmarking LLM Capability for Combinatorial MathematicsCode1
An Empirical Study on Google Research Football Multi-agent ScenariosCode1
Addressing the generalization of 3D registration methods with a featureless baseline and an unbiased benchmarkCode1
Benchmarking Embedding Aggregation Methods in Computational Pathology: A Clinical Data PerspectiveCode1
Combinatorial Optimization with Policy Adaptation using Latent Space SearchCode1
Addressing Shortcomings in Fair Graph Learning Datasets: Towards a New BenchmarkCode1
An Empirical Study of GPT-4o Image Generation CapabilitiesCode1
Benchmarking Econometric and Machine Learning Methodologies in NowcastingCode1
Benchmarking End-to-End Behavioural Cloning on Video GamesCode1
Show:102550
← PrevPage 53 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified