SOTAVerified

Benchmarking

Papers

Showing 46114620 of 5548 papers

TitleStatusHype
Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation LearningCode0
Balancing policy constraint and ensemble size in uncertainty-based offline reinforcement learningCode0
AnaloBench: Benchmarking the Identification of Abstract and Long-context AnalogiesCode0
Hybrid Random FeaturesCode0
Beyond Slow Signs in High-fidelity Model ExtractionCode0
Hybrid Machine Learning Models of Classifying Residential Requests for Smart DispatchingCode0
BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis DatasetCode0
HuSc3D: Human Sculpture dataset for 3D object reconstructionCode0
LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model CompressionCode0
HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language ModelsCode0
Show:102550
← PrevPage 462 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified