SOTAVerified

Benchmarking

Papers

Showing 15711580 of 5548 papers

TitleStatusHype
An Optical Control Environment for Benchmarking Reinforcement Learning AlgorithmsCode0
Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question AnsweringCode0
KamNet: An Integrated Spatiotemporal Deep Neural Network for Rare Event Search in KamLAND-ZenCode0
KArSL: Arabic Sign Language DatabaseCode0
Benchmarking Deep Learning Models on NVIDIA Jetson Nano for Real-Time Systems: An Empirical InvestigationCode0
An open unified deep graph learning framework for discovering drug leadsCode0
Advancing and Benchmarking Personalized Tool Invocation for LLMsCode0
Joint Multi-Scale Tone Mapping and Denoising for HDR Image EnhancementCode0
Benchmarking Robustness of Deep Learning Classifiers Using Two-Factor PerturbationCode0
Can LLMs replace Neil deGrasse Tyson? Evaluating the Reliability of LLMs as Science CommunicatorsCode0
Show:102550
← PrevPage 158 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified