SOTAVerified

Benchmarking

Papers

Showing 43914400 of 5548 papers

TitleStatusHype
Large Language Models are Few-Shot Clinical Information Extractors0
Large Language Models as Automated Aligners for benchmarking Vision-Language Models0
Large Language Models Have Intrinsic Meta-Cognition, but Need a Good Lens0
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level0
Large Malaysian Language Model Based on Mistral for Enhanced Local Language Understanding0
Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models0
Large-scale Benchmarking of Metaphor-based Optimization Heuristics0
Large-Scale Quantum Separability Through a Reproducible Machine Learning Lens0
Latency-aware Road Anomaly Segmentation in Videos: A Photorealistic Dataset and New Metrics0
Latent Variable Models for Visual Question Answering0
Show:102550
← PrevPage 440 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified