SOTAVerified

Benchmarking

Papers

Showing 23512360 of 5548 papers

TitleStatusHype
Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation dataCode0
Graph Convolutional Networks Meet with High Dimensionality ReductionCode0
GiantHunter: Accurate detection of giant virus in metagenomic data using reinforcement-learning and Monte Carlo tree searchCode0
Improve Machine Learning carbon footprint using Parquet dataset format and Mixed Precision training for regression models -- Part IICode0
Global Prediction of COVID-19 Variant Emergence Using Dynamics-Informed Graph Neural NetworksCode0
Benchmarking LLM-based Relevance Judgment MethodsCode0
Enhancing Treatment Effect Estimation via Active Learning: A Counterfactual Covering PerspectiveCode0
Improving Pretrained Models for Zero-shot Multi-label Text Classification through Reinforced Label Hierarchy ReasoningCode0
Geological Inference from Textual Data using Word EmbeddingsCode0
DyKgChat: Benchmarking Dialogue Generation Grounding on Dynamic Knowledge GraphsCode0
Show:102550
← PrevPage 236 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified