SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3881–3890 of 5548 papers

Title	Date	Tasks	Status	Hype
Analyzing Hong Kong's Legal Judgments from a Computational Linguistics point-of-view	May 4, 2023	BenchmarkingGraph Generation	—Unverified	0
A Simulation-Augmented Benchmarking Framework for Automatic RSO Streak Detection in Single-Frame Space Images	Apr 30, 2023	Benchmarkingobject-detection	—Unverified	0
Benchmarking Automated Machine Learning Methods for Price Forecasting Applications	Apr 28, 2023	AutoMLBenchmarking	—Unverified	0
ChatGPT vs State-of-the-Art Models: A Benchmarking Study in Keyphrase Generation Task	Apr 27, 2023	ArticlesBenchmarking	—Unverified	0
On Pitfalls of RemOve-And-Retrain: Data Processing Inequality Perspective	Apr 26, 2023	BenchmarkingFeature Importance	CodeCode Available	0
Scalable, Distributed AI Frameworks: Leveraging Cloud Computing for Enhanced Deep Learning Performance and Efficiency	Apr 26, 2023	BenchmarkingCloud Computing	—Unverified	0
CIMLA: Interpretable AI for inference of differential causal networks	Apr 25, 2023	Benchmarking	—Unverified	0
Unsupervised Synthetic Image Refinement via Contrastive Learning and Consistent Semantic-Structural Constraints	Apr 25, 2023	BenchmarkingContrastive Learning	—Unverified	0
Benchmarking ChatGPT-4 on ACR Radiation Oncology In-Training (TXIT) Exam and Red Journal Gray Zone Cases: Potentials and Challenges for AI-Assisted Medical Education and Decision Making in Radiation Oncology	Apr 24, 2023	BenchmarkingDecision Making	CodeCode Available	0
A Framework for Benchmarking Real-Time Embedded Object Detection	Apr 23, 2023	BenchmarkingObject	—Unverified	0

Show:10 25 50

← PrevPage 389 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified