SOTAVerified

Benchmarking

Papers

Showing 541550 of 5548 papers

TitleStatusHype
QuantBench: Benchmarking AI Methods for Quantitative Investment0
From Past to Present: A Survey of Malicious URL Detection Techniques, Datasets and Code RepositoriesCode0
MAYA: Addressing Inconsistencies in Generative Password Guessing through a Unified BenchmarkCode0
LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field EnlargementCode1
Enhancing TCR-Peptide Interaction Prediction with Pretrained Language Models and Molecular Representations0
Benchmarking machine learning models for predicting aerofoil performance0
Fluorescence Reference Target Quantitative Analysis LibraryCode0
CLIRudit: Cross-Lingual Information Retrieval of Scientific Documents0
Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V30
A Large-scale Class-level Benchmark Dataset for Code Generation with LLMs0
Show:102550
← PrevPage 55 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified