SOTAVerified

Benchmarking

Papers

Showing 38813890 of 5548 papers

TitleStatusHype
Analyzing Hong Kong's Legal Judgments from a Computational Linguistics point-of-view0
A Simulation-Augmented Benchmarking Framework for Automatic RSO Streak Detection in Single-Frame Space Images0
Benchmarking Automated Machine Learning Methods for Price Forecasting Applications0
ChatGPT vs State-of-the-Art Models: A Benchmarking Study in Keyphrase Generation Task0
On Pitfalls of RemOve-And-Retrain: Data Processing Inequality PerspectiveCode0
Scalable, Distributed AI Frameworks: Leveraging Cloud Computing for Enhanced Deep Learning Performance and Efficiency0
CIMLA: Interpretable AI for inference of differential causal networks0
Unsupervised Synthetic Image Refinement via Contrastive Learning and Consistent Semantic-Structural Constraints0
Benchmarking ChatGPT-4 on ACR Radiation Oncology In-Training (TXIT) Exam and Red Journal Gray Zone Cases: Potentials and Challenges for AI-Assisted Medical Education and Decision Making in Radiation OncologyCode0
A Framework for Benchmarking Real-Time Embedded Object Detection0
Show:102550
← PrevPage 389 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified