SOTAVerified

Benchmarking

Papers

Showing 551560 of 5548 papers

TitleStatusHype
CosPGD: an efficient white-box adversarial attack for pixel-wise prediction tasksCode1
Benchmarking for Biomedical Natural Language Processing Tasks with a Domain Specific ALBERTCode1
A Benchmarking Study of Embedding-based Entity Alignment for Knowledge GraphsCode1
ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional DependenciesCode1
Benchmarking Fish Dataset and Evaluation Metric in Keypoint Detection -- Towards Precise Fish Morphological Assessment in Aquaculture BreedingCode1
A Dataset for Answering Time-Sensitive QuestionsCode1
Comprehensive benchmarking of large language models for RNA secondary structure predictionCode1
CommonPower: A Framework for Safe Data-Driven Smart Grid ControlCode1
CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity QuantificationCode1
Combinatorial Optimization with Policy Adaptation using Latent Space SearchCode1
Show:102550
← PrevPage 56 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified