SOTAVerified

Benchmarking

Papers

Showing 33513375 of 5548 papers

TitleStatusHype
BdSLW60: A Word-Level Bangla Sign Language DatasetCode0
Impact of spatial transformations on landscape features of CEC2022 basic benchmark problems0
Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT0
EvoGPT-f: An Evolutionary GPT Framework for Benchmarking Formal Math Languages0
Can Tree Based Approaches Surpass Deep Learning in Anomaly Detection? A Benchmarking StudyCode0
Estimating the Effect of Crosstalk Error on Circuit Fidelity Using Noisy Intermediate-Scale Quantum Devices0
ProtIR: Iterative Refinement between Retrievers and Predictors for Protein Function Annotation0
Improving 2D-3D Dense Correspondences with Diffusion Models for 6D Object Pose Estimation0
A Functional Analysis Approach to Symbolic Regression0
LLaVA-Docent: Instruction Tuning with Multimodal Large Language Model to Support Art Appreciation Education0
Efficient Expression Neutrality Estimation with Application to Face Recognition Utility Prediction0
Transparent and Scrutable Recommendations Using Natural Language User ProfilesCode0
Benchmarking Large Language Models on Communicative Medical Coaching: a Novel System and DatasetCode0
Towards Biologically Plausible and Private Gene Expression Data GenerationCode0
BRI3L: A Brightness Illusion Image Dataset for Identification and Localization of Regions of Illusory PerceptionCode0
AttackNet: Enhancing Biometric Security via Tailored Convolutional Neural Network Architectures for Liveness DetectionCode0
Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification0
Quantitative Metrics for Benchmarking Medical Image Harmonization0
PowerGraph: A power grid benchmark dataset for graph neural networks0
Architecture Analysis and Benchmarking of 3D U-shaped Deep Learning Models for Thoracic Anatomical SegmentationCode0
Vi(E)va LLM! A Conceptual Stack for Evaluating and Interpreting Generative AI-based VisualizationsCode0
Probing Critical Learning Dynamics of PLMs for Hate Speech DetectionCode0
Can LLMs perform structured graph reasoning?Code0
Variational Quantum Circuits Enhanced Generative Adversarial Network0
Benchmarking Spiking Neural Network Learning Methods with Varying Locality0
Show:102550
← PrevPage 135 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified