SOTAVerified

Benchmarking

Papers

Showing 951960 of 5548 papers

TitleStatusHype
Machine learning for modelling unstructured grid data in computational physics: a review0
LOB-Bench: Benchmarking Generative AI for Finance -- an Application to Limit Order Book DataCode1
SkyRover: A Modular Simulator for Cross-Domain Pathfinding0
Handwritten Text Recognition: A Survey0
Fino1: On the Transferability of Reasoning Enhanced LLMs to FinanceCode2
One-Shot Federated Learning with Classifier-Free Diffusion Models0
Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors0
The Devil is in the Prompts: De-Identification Traces Enhance Memorization Risks in Synthetic Chest X-Ray GenerationCode0
exHarmony: Authorship and Citations for Benchmarking the Reviewer Assignment ProblemCode0
Foundation Model of Electronic Medical Records for Adaptive Risk EstimationCode1
Show:102550
← PrevPage 96 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified