SOTAVerified

Benchmarking

Papers

Showing 12711280 of 5548 papers

TitleStatusHype
Does BERT Learn as Humans Perceive? Understanding Linguistic Styles through LexicaCode1
Scikit-dimension: a Python package for intrinsic dimension estimationCode1
Biomedical Data-to-Text Generation via Fine-Tuning TransformersCode1
ReMeDi: Resources for Multi-domain, Multi-service, Medical DialoguesCode1
Tune It or Don't Use It: Benchmarking Data-Efficient Image ClassificationCode1
Semi-Supervised Exaggeration Detection of Health Science Press ReleasesCode1
Searching for an Effective Defender: Benchmarking Defense against Adversarial Word SubstitutionCode1
KO codes: Inventing Nonlinear Encoding and Decoding for Reliable Wireless Communication via Deep-learningCode1
Pulling Up by the Causal Bootstraps: Causal Data Augmentation for Pre-training DebiasingCode1
A Unified Taxonomy and Multimodal Dataset for Events in Invasion GamesCode1
Show:102550
← PrevPage 128 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified