SOTAVerified

Benchmarking

Papers

Showing 46264650 of 5548 papers

TitleStatusHype
Laughing Heads: Can Transformers Detect What Makes a Sentence Funny?Code0
Global Wheat Head Dataset 2021: more diversity to improve the benchmarking of wheat head localization methods0
Quantifying the Impact of Boundary Constraint Handling Methods on Differential Evolution0
Sanity Simulations for Saliency MethodsCode0
Benchmarking down-scaled (not so large) pre-trained language modelsCode0
Towards Benchmarking the Utility of Explanations for Model Debugging0
CREPO: An Open Repository to Benchmark Credal Network AlgorithmsCode0
Examining convolutional feature extraction using Maximum Entropy (ME) and Signal-to-Noise Ratio (SNR) for image classification0
Beyond Monocular Deraining: Parallel Stereo Deraining Network Via Semantic Prior0
MS MARCO: Benchmarking Ranking Models in the Large-Data Regime0
Covariance Matrix Adaptation Evolution Strategy Assisted by Principal Component Analysis0
A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect0
Building and benchmarking an Arabic Speech Commands dataset for small-footprint keyword spottingCode0
PathBench: A Benchmarking Platform for Classical and Learned Path Planning Algorithms0
Event Camera Simulator Design for Modeling Attention-based Inference Architectures0
A Complementarity Analysis of the COCO Benchmark Problems and Artificially Generated Problems0
OPTION: OPTImization Algorithm Benchmarking ONtology0
Towards Trustworthy Deception Detection: Benchmarking Model Robustness across Domains, Modalities, and Languages0
Measuring what Really Matters: Optimizing Neural Networks for TinyMLCode0
Model-predictive control and reinforcement learning in multi-energy system case studies0
Benchmarking the Benchmark -- Analysis of Synthetic NIDS Datasets0
FedNLP: Benchmarking Federated Learning Methods for Natural Language Processing TasksCode0
The Impact of ASR on the Automatic Analysis of Linguistic Complexity and Sophistication in Spontaneous L2 Speech0
On the Assessment of Benchmark Suites for Algorithm Comparison0
Jointly Modeling and Clustering Tensors in High Dimensions0
Show:102550
← PrevPage 186 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified