SOTAVerified

Benchmarking

Papers

Showing 26212630 of 5548 papers

TitleStatusHype
FlowerTune: A Cross-Domain Benchmark for Federated Fine-Tuning of Large Language Models0
FlowMind: Automatic Workflow Generation with LLMs0
FastEnsemble: Benchmarking and Accelerating Ensemble-based Uncertainty Estimation for Image-to-Image Translation0
Fast Empirical Scenarios0
Benchmarking Quantum Convolutional Neural Networks for Signal Classification in Simulated Gamma-Ray Burst Detection0
A Survey on Model Compression for Large Language Models0
FastDraft: How to Train Your Draft0
AI-Powered Cow Detection in Complex Farm Environments0
Benchmarking Sample Selection Strategies for Batch Reinforcement Learning0
Benchmarking quantized LLaMa-based models on the Brazilian Secondary School Exam0
Show:102550
← PrevPage 263 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified