SOTAVerified

Benchmarking

Papers

Showing 16611670 of 5548 papers

TitleStatusHype
An Empirical Evaluation of Cost-based Federated SPARQL Query Processing EnginesCode0
Benchmarking and optimizing organism wide single-cell RNA alignment methodsCode0
An empirical comparison between stochastic and deterministic centroid initialisation for K-Means variationsCode0
JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language ModelsCode0
JExplore: Design Space Exploration Tool for Nvidia Jetson BoardsCode0
A Dataset for Web-Scale Knowledge Base PopulationCode0
Benchmarking and Improving Text-to-SQL Generation under AmbiguityCode0
An Efficient Two-stage Gradient Boosting Framework for Short-term Traffic State EstimationCode0
DyKnow: Dynamically Verifying Time-Sensitive Factual Knowledge in LLMsCode0
A Benchmark on Extremely Weakly Supervised Text Classification: Reconcile Seed Matching and Prompting ApproachesCode0
Show:102550
← PrevPage 167 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified