SOTAVerified

Benchmarking

Papers

Showing 46014650 of 5548 papers

TitleStatusHype
Application of DEA in International Market Selection for the export of products from Spain0
ACI-BENCH: a Novel Ambient Clinical Intelligence Dataset for Benchmarking Automatic Visit Note Generation0
Application Inference using Machine Learning based Side Channel Analysis0
Score-Based Generative Models for Molecule Generation0
SCPO: Safe Reinforcement Learning with Safety Critic Policy Optimization0
Application based Evaluation of an Efficient Spike-Encoder, "Spiketrum"0
Applicability and Challenges of Deep Reinforcement Learning for Satellite Frequency Plan Design0
Apples to Apples: Learning Semantics of Common Entities Through a Novel Comprehension Task0
SDFR: Synthetic Data for Face Recognition Competition0
A Platform for Event Extraction in Hindi0
Uncertainty in GNN Learning Evaluations: The Importance of a Consistent Benchmark for Community Detection0
A Pipeline for Post-Crisis Twitter Data Acquisition0
A Perspective on Neural Capacity Estimation: Viability and Reliability0
SE Arena: An Interactive Platform for Evaluating Foundation Models in Software Engineering0
SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification0
SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification0
SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity0
A Parallel Corpus for Evaluating Machine Translation between Arabic and European Languages0
AnyTOD: A Programmable Task-Oriented Dialog System0
SecRepoBench: Benchmarking LLMs for Secure Code Generation in Real-World Repositories0
Anytime Bi-Objective Optimization with a Hybrid Multi-Objective CMA-ES (HMO-CMA-ES)0
Secure Neuroimaging Analysis using Federated Learning with Homomorphic Encryption0
Securing the Skies: A Comprehensive Survey on Anti-UAV Methods, Benchmarking, and Future Directions0
Anytime Behavior of Inexact TSP Solvers and Perspectives for Automated Algorithm Selection0
Any Image Restoration via Efficient Spatial-Frequency Degradation Adaptation0
AntBO: Towards Real-World Automated Antibody Design with Combinatorial Bayesian Optimisation0
Ansatz-free Hamiltonian learning with Heisenberg-limited scaling0
Seeing in the Dark: Benchmarking Egocentric 3D Vision with the Oxford Day-and-Night Dataset0
A Novel Momentum-Based Deep Learning Techniques for Medical Image Classification and Segmentation0
Seg2Reg: Differentiable 2D Segmentation to 1D Regression Rendering for 360 Room Layout Reconstruction0
A Novel Hybrid Ordinal Learning Model with Health Care Application0
Validation of neural spike sorting algorithms without ground-truth information0
Segmenting Maxillofacial Structures in CBCT Volumes0
Segment Together: A Versatile Paradigm for Semi-Supervised Medical Image Segmentation0
SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios0
Selecting Differential Splicing Methods: Practical Considerations0
A novel machine learning based framework for detection of Autism Spectrum Disorder (ASD)0
Selective Shot Learning for Code Explanation0
Value-at-Risk-Based Portfolio Insurance: Performance Evaluation and Benchmarking Against CPPI in a Markov-Modulated Regime-Switching Market0
A novel database of Children's Spontaneous Facial Expressions (LIRIS-CSE)0
Self-supervised Benchmark Lottery on ImageNet: Do Marginal Improvements Translate to Improvements on Similar Datasets?0
Self-Supervised Speech Representation Learning: A Review0
A Novel Cluster Detection of COVID-19 Patients and Medical Disease Conditions Using Improved Evolutionary Clustering Algorithm Star0
A Characterization Study of Arabic Twitter Data with a Benchmarking for State-of-the-Art Opinion Mining Models0
Semantic Segmentation using Vision Transformers: A survey0
SemanticST: Spatially Informed Semantic Graph Learning for Clustering, Integration, and Scalable Analysis of Spatial Transcriptomics0
Semi and Weakly Supervised Semantic Segmentation Using Generative Adversarial Network0
Semi-implicit Continuous Newton Method for Power Flow Analysis0
A Novel Benchmarking Paradigm and a Scale- and Motion-Aware Model for Egocentric Pedestrian Trajectory Prediction0
Towards Efficient Educational Chatbots: Benchmarking RAG Frameworks0
Show:102550
← PrevPage 93 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified