SOTAVerified

Benchmarking

Papers

Showing 48714880 of 5548 papers

TitleStatusHype
MSAMSum: Towards Benchmarking Multi-lingual Dialogue SummarizationCode0
Alchemy: A Quantum Chemistry Dataset for Benchmarking AI ModelsCode0
FHBench: Towards Efficient and Personalized Federated Learning for Multimodal HealthcareCode0
Benchmarking quantum machine learning kernel training for classification tasksCode0
The Saudi Privacy Policy DatasetCode0
MST: Adaptive Multi-Scale Tokens Guided Interactive SegmentationCode0
ferret: a Framework for Benchmarking Explainers on TransformersCode0
Benchmarking Procedural Language Understanding for Low-Resource Languages: A Case Study on TurkishCode0
FEET: A Framework for Evaluating Embedding TechniquesCode0
Benchmarking Probabilistic Deep Learning Methods for License Plate RecognitionCode0
Show:102550
← PrevPage 488 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified