SOTAVerified

Benchmarking

Papers

Showing 581590 of 5548 papers

TitleStatusHype
Benchmarking Generated Poses: How Rational is Structure-based Drug Design with Generative Models?Code1
Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable SummarizationCode1
Anabranch Network for Camouflaged Object SegmentationCode1
Benchmarking Large Language Models for News SummarizationCode1
A Closer Look at Mortality Risk Prediction from ElectrocardiogramsCode1
CHOICE: Benchmarking the Remote Sensing Capabilities of Large Vision-Language ModelsCode1
COVID-19 event extraction from Twitter via extractive question answering with continuous promptsCode1
Deep Learning-Based Synchronization for Uplink NB-IoTCode1
DocuMint: Docstring Generation for Python using Small Language ModelsCode1
Evaluating Graph Neural Networks for Link Prediction: Current Pitfalls and New BenchmarkingCode1
Show:102550
← PrevPage 59 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified