SOTAVerified

Benchmarking

Papers

Showing 35013510 of 5548 papers

TitleStatusHype
AERF: Adaptive ensemble random fuzzy algorithm for anomaly detection in cloud computing0
Logically at Factify 2: A Multi-Modal Fact Checking System Based on Evidence Retrieval techniques and Transformer Encoder Architecture0
"It's a Match!" -- A Benchmark of Task Affinity Scores for Joint Learning0
The CropAndWeed Dataset: A Multi-Modal Learning Approach for Efficient Crop and Weed ManipulationCode1
The Evolutionary Computation Methods No One Should Use0
ANNA: Abstractive Text-to-Image Synthesis with Filtered News CaptionsCode0
Trace Encoding in Process Mining: a survey and benchmarkingCode1
HaN-Seg: The head and neck organ-at-risk CT and MR segmentation dataset0
Improving Sequential Recommendation Models with an Enhanced Loss FunctionCode0
Benchmarking common uncertainty estimation methods with histopathological images under domain shift and label noise0
Show:102550
← PrevPage 351 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified