SOTAVerified

Benchmarking

Papers

Showing 29512975 of 5548 papers

TitleStatusHype
Hybrid Precoder and Combiner Designs for Decentralized Parameter Estimation in mmWave MIMO Wireless Sensor Networks0
Hybrid Quantum Computing -- Tabu Search Algorithm for Partitioning Problems: preliminary study on the Traveling Salesman Problem0
The Interactive Effects of Operators and Parameters to GA Performance Under Different Problem Sizes0
Hybrid Transceiver Design for Tera-Hertz MIMO Systems Relying on Bayesian Learning Aided Sparse Channel Estimation0
Hydra: Marker-Free RGB-D Hand-Eye Calibration0
Hydrological time series forecasting using simple combinations: Big data testing and investigations on one-year ahead river flow predictability0
Benchmarking the Spatial Robustness of DNNs via Natural and Adversarial Localized Corruptions0
Hyperbolic Anomaly Detection0
V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning0
HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere0
Hypergraph Neural Networks through the Lens of Message Passing: A Common Perspective to Homophily and Architecture Design0
The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine0
The Jungle of Generative Drug Discovery: Traps, Treasures, and Ways Out0
Benchmarking the Sim-to-Real Gap in Cloth Manipulation0
Hyperparameter optimization, quantum-assisted model performance prediction, and benchmarking of AI-based High Energy Physics workloads using HPC0
Hyperspectral Anomaly Detection Methods: A Survey and Comparative Study0
v-SVR Polynomial Kernel for Predicting the Defect Density in New Software Projects0
Benchmarking the Robustness of Semantic Segmentation Models0
The Karp Dataset0
HySpecNet-11k: A Large-Scale Hyperspectral Dataset for Benchmarking Learning-Based Hyperspectral Image Compression Methods0
Benchmarking the Robustness of Quantized Models0
Vulnerability of Face Morphing Attacks: A Case Study on Lookalike and Identical Twins0
Ice Cream Doesn't Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference0
ICE-ID: A Novel Historical Census Data Benchmark Comparing NARS against LLMs, \& a ML Ensemble on Longitudinal Identity Resolution0
ICON^2: Reliably Benchmarking Predictive Inequity in Object Detection0
Show:102550
← PrevPage 119 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified