SOTAVerified

Human Judgment Correlation

A task where an algorithm should generate the judgment scores correlating with human judgments.

Papers

No papers found.

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MIDKendall's Tau-c54.9Unverified
2SoftSPICEKendall's Tau-c54.2Unverified
3RefCLIP-SKendall's Tau-c53Unverified
4CLIP-SKendall's Tau-c51.2Unverified
#ModelMetricClaimedVerifiedStatus
1MIDKendall's Tau-b37.3Unverified
2RefCLIP-SKendall's Tau-b36.4Unverified
3CLIP-SKendall's Tau-b34.4Unverified