SOTAVerified

Benchmarking

Papers

Showing 41264150 of 5548 papers

TitleStatusHype
The Interactive Effects of Operators and Parameters to GA Performance Under Different Problem Sizes0
The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine0
The Jungle of Generative Drug Discovery: Traps, Treasures, and Ways Out0
The Karp Dataset0
The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs0
The Leaderboard Illusion0
The Liouville Generator for Producing Integrable Expressions0
The Low Emission Oil&Gas Open (LEOGO) Reference Platform of an Off-Grid Energy System for Renewable Integration Studies0
The Moral Mind(s) of Large Language Models0
The Multi-speaker Multi-style Voice Cloning Challenge 20210
The Neural Painter: Multi-Turn Image Generation0
The ObjectFolder Benchmark: Multisensory Learning with Neural and Real Objects0
Theory of Mind in Large Language Models: Examining Performance of 11 State-of-the-Art models vs. Children Aged 7-10 on Advanced Tests0
The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods0
The Paradox of Success in Evolutionary and Bioinspired Optimization: Revisiting Critical Issues, Key Studies, and Methodological Pathways0
The ParClusterers Benchmark Suite (PCBS): A Fine-Grained Analysis of Scalable Graph Clustering0
The Partial Response Network: a neural network nomogram0
The Pitfalls of Benchmarking in Algorithm Selection: What We Are Getting Wrong0
The Protein Engineering Tournament: An Open Science Benchmark for Protein Modeling and Design0
Thermal Image-based Fault Diagnosis in Induction Machines via Self-Organized Operational Neural Networks0
The Role of Local Intrinsic Dimensionality in Benchmarking Nearest Neighbor Search0
The Russian practice of applying cluster approach in regional development0
The Seeker's Dilemma: Realistic Formulation and Benchmarking for Hardware Trojan Detection0
The Sparsity Roofline: Understanding the Hardware Limits of Sparse Neural Networks0
The Trap of Presumed Equivalence: Artificial General Intelligence Should Not Be Assessed on the Scale of Human Intelligence0
Show:102550
← PrevPage 166 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified