SOTAVerified

Benchmarking

Papers

Showing 29763000 of 5548 papers

TitleStatusHype
Benchmarking the Robustness of Panoptic Segmentation for Automated Driving0
The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs0
Identifiable Convex-Concave Regression via Sub-gradient Regularised Least Squares0
Identification of vortex in unstructured mesh with graph neural networks0
The Leaderboard Illusion0
XCSP3: An Integrated Format for Benchmarking Combinatorial Constrained Problems0
Identifying patterns and recommendations of and for sustainable open data initiatives: a benchmarking-driven analysis of open government data initiatives among European countries0
Identifying the Context Shift between Test Benchmarks and Production Data0
The Liouville Generator for Producing Integrable Expressions0
Benchmarking the Robustness of Instance Segmentation Models0
Benchmarking the Reliability of Post-training Quantization: a Particular Focus on Worst-case Performance0
IEA: Inner Ensemble Average within a convolutional neural network0
Benchmarking the rationality of AI decision making using the transitivity axiom0
A Gap in Time: The Challenge of Processing Heterogeneous IoT Data in Digitalized Buildings0
Exploring the Decentraland Economy: Multifaceted Parcel Attributes, Key Insights, and Benchmarking0
A2Perf: Real-World Autonomous Agents Benchmark0
Benchmarking the Physical-world Adversarial Robustness of Vehicle Detection0
Benchmarking the Neural Linear Model for Regression0
The Low Emission Oil&Gas Open (LEOGO) Reference Platform of an Off-Grid Energy System for Renewable Integration Studies0
From Attack to Protection: Leveraging Watermarking Attack Network for Advanced Add-on Watermarking0
Image2Struct: Benchmarking Structure Extraction for Vision-Language Models0
Image-Based Benchmarking and Visualization for Large-Scale Global Optimization0
Benchmarking the Impact of Noise on Deep Learning-based Classification of Atrial Fibrillation in 12-Lead ECG0
Benchmarking the human brain against computational architectures0
Image Matching: An Application-oriented Benchmark0
Show:102550
← PrevPage 120 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified