SOTAVerified

Benchmarking

Papers

Showing 53515400 of 5548 papers

TitleStatusHype
Evaluation of simulation methods for tumor subclonal reconstruction0
Evaluation of Three Welsh Language POS Taggers0
TARGO: Benchmarking Target-driven Object Grasping under Occlusions0
EvAnimate: Event-conditioned Image-to-Video Generation for Human Animation0
EventAid: Benchmarking Event-aided Image/Video Enhancement Algorithms with Real-captured Hybrid Dataset0
Event-based Continuous Color Video Decompression from Single Frames0
Event-based Feature Extraction Using Adaptive Selection Thresholds0
CausalRivers -- Scaling up benchmarking of causal discovery for real-world time-series0
Event Camera Simulator Design for Modeling Attention-based Inference Architectures0
Causal Reasoning Meets Visual Representation Learning: A Prospective Study0
Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors0
Eventprop training for efficient neuromorphic applications0
EvEntS ReaLM: Event Reasoning of Entity States via Language Models0
Evetac: An Event-based Optical Tactile Sensor for Robotic Manipulation0
A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect0
Ev-Layout: A Large-scale Event-based Multi-modal Dataset for Indoor Layout Estimation and Tracking0
EvoGPT-f: An Evolutionary GPT Framework for Benchmarking Formal Math Languages0
A Large-Scale Evaluation of Speech Foundation Models0
Categorization of 33 computational methods to detect spatially variable genes from spatially resolved transcriptomics data0
Evolutionary Multimodal Optimization: A Short Survey0
Evolving Evolutionary Algorithms using Linear Genetic Programming0
A Large-scale Evaluation of Pretraining Paradigms for the Detection of Defects in Electroluminescence Solar Cell Images0
Evolving Hard Maximum Cut Instances for Quantum Approximate Optimization Algorithms0
EVOPS Benchmark: Evaluation of Plane Segmentation from RGBD and LiDAR Data0
CaT-BENCH: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans0
Exact lattice-based stochastic cell culture simulation algorithms incorporating spontaneous and contact-dependent reactions0
Exact Mean Computation in Dynamic Time Warping Spaces0
EXACT: Towards a platform for empirically benchmarking Machine Learning model explanation methods0
Examining convolutional feature extraction using Maximum Entropy (ME) and Signal-to-Noise Ratio (SNR) for image classification0
CATBench: A Compiler Autotuning Benchmarking Suite for Black-box Optimization0
Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection0
Task-oriented Over-the-air Computation for Edge-device Co-inference with Balanced Classification Accuracy0
A Large-scale Class-level Benchmark Dataset for Code Generation with LLMs0
Cash versus Kind: Benchmarking a Child Nutrition Program against Unconditional Cash Transfers in Rwanda0
A Large-scale Benchmark on Geological Fault Delineation Models: Domain Shift, Training Dynamics, Generalizability, Evaluation and Inferential Behavior0
TBD: Benchmarking and Analyzing Deep Neural Network Training0
Experimental Benchmarking of Energy-saving Sub-Optimal Sliding Mode Control0
Experimental robustness benchmark of quantum neural network on a superconducting quantum processor0
Cascaded two-stage feature clustering and selection via separability and consistency in fuzzy decision systems0
Experimenting with robotic intra-logistics domains0
ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists0
Explainable AI using expressive Boolean formulas0
CardioTabNet: A Novel Hybrid Transformer Model for Heart Disease Prediction using Tabular Medical Data0
Capsule Neural Networks for Graph Classification using Explicit Tensorial Graph Representations0
Explainable Rumor Detection using Inter and Intra-feature Attention Networks0
Explaining Unreliable Perception in Automated Driving: A Fuzzy-based Monitoring Approach0
Explicitly Multi-Modal Benchmarks for Multi-Objective Optimization0
Exploitation-Guided Exploration for Semantic Embodied Navigation0
Exploiting Adam-like Optimization Algorithms to Improve the Performance of Convolutional Neural Networks0
Exploiting Database Management Systems and Treewidth for Counting0
Show:102550
← PrevPage 108 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified