SOTAVerified

Benchmarking

Papers

Showing 54015425 of 5548 papers

TitleStatusHype
Probing Acoustic Representations for Phonetic PropertiesCode0
Probing Conceptual Understanding of Large Visual-Language ModelsCode0
Probing Critical Learning Dynamics of PLMs for Hate Speech DetectionCode0
Using Color To Identify Insider ThreatsCode0
An Exploration of Exploration: Measuring the ability of lexicase selection to find obscure pathways to optimalityCode0
Towards Computational Performance Engineering for Unsupervised Concept Drift Detection -- Complexities, Benchmarking, Performance AnalysisCode0
Transfer Learning between Motor Imagery Datasets using Deep Learning -- Validation of Framework and Comparison of DatasetsCode0
Synth4bench: a framework for generating synthetic genomics data for the evaluation of tumor-only somatic variant calling algorithmsCode0
Process Extraction from Text: Benchmarking the State of the Art and Paving the Way for Future ChallengesCode0
Transfer Learning for Prosthetics Using Imitation LearningCode0
Benchmarking datasets for Anomaly-based Network Intrusion Detection: KDD CUP 99 alternativesCode0
Synthetic Datasets for Machine Learning on Spatio-Temporal Graphs using PDEsCode0
Synthetic location trajectory generation using categorical diffusion modelsCode0
Synthetic Porous Microstructures: Automatic Design, Simulation, and Permeability AnalysisCode0
Synthetic Time Series Forecasting with Transformer Architectures: Extensive Simulation BenchmarksCode0
An Experimental Study of the Transferability of Spectral Graph NetworksCode0
Comparison Performance of Spectrogram and Scalogram as Input of Acoustic Recognition TaskCode0
Can LLMs replace Neil deGrasse Tyson? Evaluating the Reliability of LLMs as Science CommunicatorsCode0
Benchmarking Data Heterogeneity Evaluation Approaches for Personalized Federated LearningCode0
Comparing Machine Learning Algorithms by Union-Free Generic DepthCode0
SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckListsCode0
Transformation-Interaction-Rational Representation for Symbolic RegressionCode0
Towards Enhancing Fault Tolerance in Neural NetworksCode0
Robust Model-Based Optimization for Challenging Fitness LandscapesCode0
Air Learning: A Deep Reinforcement Learning Gym for Autonomous Aerial Robot Visual NavigationCode0
Show:102550
← PrevPage 217 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified