SOTAVerified

Benchmarking

Papers

Showing 30763100 of 5548 papers

TitleStatusHype
Ev-Layout: A Large-scale Event-based Multi-modal Dataset for Indoor Layout Estimation and Tracking0
EvoGPT-f: An Evolutionary GPT Framework for Benchmarking Formal Math Languages0
Evolutionary Multimodal Optimization: A Short Survey0
Evolving Evolutionary Algorithms using Linear Genetic Programming0
Evolving Hard Maximum Cut Instances for Quantum Approximate Optimization Algorithms0
EVOPS Benchmark: Evaluation of Plane Segmentation from RGBD and LiDAR Data0
Exact lattice-based stochastic cell culture simulation algorithms incorporating spontaneous and contact-dependent reactions0
Exact Mean Computation in Dynamic Time Warping Spaces0
EXACT: Towards a platform for empirically benchmarking Machine Learning model explanation methods0
Examining convolutional feature extraction using Maximum Entropy (ME) and Signal-to-Noise Ratio (SNR) for image classification0
Experimental Benchmarking of Energy-saving Sub-Optimal Sliding Mode Control0
Experimental robustness benchmark of quantum neural network on a superconducting quantum processor0
Experimenting with robotic intra-logistics domains0
ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists0
Explainable AI using expressive Boolean formulas0
Explainable Rumor Detection using Inter and Intra-feature Attention Networks0
Explaining Unreliable Perception in Automated Driving: A Fuzzy-based Monitoring Approach0
Explicitly Multi-Modal Benchmarks for Multi-Objective Optimization0
Exploitation-Guided Exploration for Semantic Embodied Navigation0
Exploiting Adam-like Optimization Algorithms to Improve the Performance of Convolutional Neural Networks0
Exploiting Database Management Systems and Treewidth for Counting0
Exploration of TPUs for AI Applications0
Exploring and Benchmarking the Planning Capabilities of Large Language Models0
Exploring Capabilities of Time Series Foundation Models in Building Analytics0
Exploring Continual Learning of Diffusion Models0
Show:102550
← PrevPage 124 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified