SOTAVerified

Benchmarking

Papers

Showing 641650 of 5548 papers

TitleStatusHype
Data-Driven Denoising of Stationary Accelerometer SignalsCode1
D2S: Document-to-Slide Generation Via Query-Based Text SummarizationCode1
DACBench: A Benchmark Library for Dynamic Algorithm ConfigurationCode1
Data Generating Process to Evaluate Causal Discovery Techniques for Time Series DataCode1
Align and Distill: Unifying and Improving Domain Adaptive Object DetectionCode1
Benchmarking Graph Neural Networks on Dynamic Link PredictionCode1
Curious Hierarchical Actor-Critic Reinforcement LearningCode1
CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language ModelsCode1
DataRec: A Python Library for Standardized and Reproducible Data Management in Recommender SystemsCode1
CRoW: Benchmarking Commonsense Reasoning in Real-World TasksCode1
Show:102550
← PrevPage 65 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified