SOTAVerified

Benchmarking

Papers

Showing 49765000 of 5548 papers

TitleStatusHype
Evaluating Feature Attribution Methods in the Image DomainCode0
NegBio: a high-performance tool for negation and uncertainty detection in radiology reportsCode0
A Comprehensive Comparison of Multi-Dimensional Image Denoising MethodsCode0
NeMig -- A Bilingual News Collection and Knowledge Graph about MigrationCode0
NengoDL: Combining deep learning and neuromorphic modelling methodsCode0
Evaluating AI Recruitment Sourcing Tools by Human PreferenceCode0
EvalAI: Towards Better Evaluation Systems for AI AgentsCode0
Essential guidelines for computational method benchmarkingCode0
Benchmarking of LSTM NetworksCode0
NerveNet: Learning Structured Policy with Graph Neural NetworksCode0
How Fragile is Relation Extraction under Entity Replacements?Code0
Benchmarking Network Embedding Models for Link Prediction: Are We Making Progress?Code0
Sequence-Aware Recommender SystemsCode0
WCEbleedGen: A wireless capsule endoscopy dataset and its benchmarking for automatic bleeding classification, detection, and segmentationCode0
Enterprise Benchmarks for Large Language Model EvaluationCode0
Enriching Social Science Research via Survey Item LinkingCode0
Sequential Large Language Model-Based Hyper-parameter OptimizationCode0
Neural Network Design: Learning from Neural Architecture SearchCode0
Benchmarking of image registration methods for differently stained histological slidesCode0
BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed GraphsCode0
Enhancing Video Summarization with Context AwarenessCode0
Enhancing Treatment Effect Estimation via Active Learning: A Counterfactual Covering PerspectiveCode0
Benchmarking Neural Machine Translation for Southern African LanguagesCode0
Enhancing Hyper-To-Real Space Projections Through Euclidean Norm Meta-Heuristic OptimizationCode0
Enhancing Biomedical Knowledge Discovery for Diseases: An Open-Source Framework Applied on Rett Syndrome and Alzheimer's DiseaseCode0
Show:102550
← PrevPage 200 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified