SOTAVerified

Benchmarking

Papers

Showing 771780 of 5548 papers

TitleStatusHype
dMelodies: A Music Dataset for Disentanglement LearningCode1
Benchmarking the Spectrum of Agent CapabilitiesCode1
Foundation Model of Electronic Medical Records for Adaptive Risk EstimationCode1
Benchmarking TinyML Systems: Challenges and DirectionCode1
Benchmarking Transcriptomics Foundation Models for Perturbation Analysis : one PCA still rules them allCode1
fseval: A Benchmarking Framework for Feature Selection and Feature Ranking AlgorithmsCode1
Benchmarking tree species classification from proximally-sensed laser scanning data: introducing the FOR-species20K datasetCode1
FullFront: Benchmarking MLLMs Across the Full Front-End Engineering WorkflowCode1
Don’t be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue SystemCode1
Formalizing Multimedia Recommendation through Multimodal Deep LearningCode1
Show:102550
← PrevPage 78 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified