SOTAVerified

Benchmarking

Papers

Showing 29513000 of 5548 papers

TitleStatusHype
Hybrid Precoder and Combiner Designs for Decentralized Parameter Estimation in mmWave MIMO Wireless Sensor Networks0
Hybrid Quantum Computing -- Tabu Search Algorithm for Partitioning Problems: preliminary study on the Traveling Salesman Problem0
The Interactive Effects of Operators and Parameters to GA Performance Under Different Problem Sizes0
Hybrid Transceiver Design for Tera-Hertz MIMO Systems Relying on Bayesian Learning Aided Sparse Channel Estimation0
Hydra: Marker-Free RGB-D Hand-Eye Calibration0
Hydrological time series forecasting using simple combinations: Big data testing and investigations on one-year ahead river flow predictability0
Benchmarking the Spatial Robustness of DNNs via Natural and Adversarial Localized Corruptions0
Hyperbolic Anomaly Detection0
V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning0
HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere0
Hypergraph Neural Networks through the Lens of Message Passing: A Common Perspective to Homophily and Architecture Design0
The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine0
The Jungle of Generative Drug Discovery: Traps, Treasures, and Ways Out0
Benchmarking the Sim-to-Real Gap in Cloth Manipulation0
Hyperparameter optimization, quantum-assisted model performance prediction, and benchmarking of AI-based High Energy Physics workloads using HPC0
Hyperspectral Anomaly Detection Methods: A Survey and Comparative Study0
v-SVR Polynomial Kernel for Predicting the Defect Density in New Software Projects0
Benchmarking the Robustness of Semantic Segmentation Models0
The Karp Dataset0
HySpecNet-11k: A Large-Scale Hyperspectral Dataset for Benchmarking Learning-Based Hyperspectral Image Compression Methods0
Benchmarking the Robustness of Quantized Models0
Vulnerability of Face Morphing Attacks: A Case Study on Lookalike and Identical Twins0
Ice Cream Doesn't Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference0
ICE-ID: A Novel Historical Census Data Benchmark Comparing NARS against LLMs, \& a ML Ensemble on Longitudinal Identity Resolution0
ICON^2: Reliably Benchmarking Predictive Inequity in Object Detection0
Benchmarking the Robustness of Panoptic Segmentation for Automated Driving0
The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs0
Identifiable Convex-Concave Regression via Sub-gradient Regularised Least Squares0
Identification of vortex in unstructured mesh with graph neural networks0
The Leaderboard Illusion0
XCSP3: An Integrated Format for Benchmarking Combinatorial Constrained Problems0
Identifying patterns and recommendations of and for sustainable open data initiatives: a benchmarking-driven analysis of open government data initiatives among European countries0
Identifying the Context Shift between Test Benchmarks and Production Data0
The Liouville Generator for Producing Integrable Expressions0
Benchmarking the Robustness of Instance Segmentation Models0
Benchmarking the Reliability of Post-training Quantization: a Particular Focus on Worst-case Performance0
IEA: Inner Ensemble Average within a convolutional neural network0
Benchmarking the rationality of AI decision making using the transitivity axiom0
A Gap in Time: The Challenge of Processing Heterogeneous IoT Data in Digitalized Buildings0
Exploring the Decentraland Economy: Multifaceted Parcel Attributes, Key Insights, and Benchmarking0
A2Perf: Real-World Autonomous Agents Benchmark0
Benchmarking the Physical-world Adversarial Robustness of Vehicle Detection0
Benchmarking the Neural Linear Model for Regression0
The Low Emission Oil&Gas Open (LEOGO) Reference Platform of an Off-Grid Energy System for Renewable Integration Studies0
From Attack to Protection: Leveraging Watermarking Attack Network for Advanced Add-on Watermarking0
Image2Struct: Benchmarking Structure Extraction for Vision-Language Models0
Image-Based Benchmarking and Visualization for Large-Scale Global Optimization0
Benchmarking the Impact of Noise on Deep Learning-based Classification of Atrial Fibrillation in 12-Lead ECG0
Benchmarking the human brain against computational architectures0
Image Matching: An Application-oriented Benchmark0
Show:102550
← PrevPage 60 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified