SOTAVerified

Benchmarking

Papers

Showing 27012725 of 5548 papers

TitleStatusHype
DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition0
Benchmarking Floworks against OpenAI & Anthropic: A Novel Framework for Enhanced LLM Function Calling0
Dataset and Benchmarking of Real-Time Embedded Object Detection for RoboCup SSL0
Benchmarking fixed-length Fingerprint Representations across different Embedding Sizes and Sensor Types0
Benchmarking five global optimization approaches for nano-optical shape optimization and parameter reconstruction0
A Platform for Event Extraction in Hindi0
Adversarial Reinforcement Learning Framework for Benchmarking Collision Avoidance Mechanisms in Autonomous Vehicles0
Ensuring Reliability of Curated EHR-Derived Data: The Validation of Accuracy for LLM/ML-Extracted Information and Data (VALID) Framework0
Multi-scale data reconstruction of turbulent rotating flows with Gappy POD, Extended POD and Generative Adversarial Networks0
Data needs and challenges for quantum dot devices automation0
Benchmarking federated strategies in Peer-to-Peer Federated learning for biomedical data0
Data-Driven Target Localization: Benchmarking Gradient Descent Using the Cramer-Rao Bound0
Data-driven surrogate modelling and benchmarking for process equipment0
Data-driven Power Flow Linearization: Simulation0
Benchmarking Federated Machine Unlearning methods for Tabular Data0
A Pipeline for Post-Crisis Twitter Data Acquisition0
Data-driven inventory management for new products: An adjusted Dyna-Q approach with transfer learning0
Benchmarking FedAvg and FedCurv for Image Classification Tasks0
Data-driven Approach for Static Hedging of Exchange Traded Options0
Data Collection of Real-Life Knowledge Work in Context: The RLKWiC Dataset0
A Perspective on Neural Capacity Estimation: Viability and Reliability0
Data Augmentation for Traffic Classification0
Data Augmentation for Continual RL via Adversarial Gradient Episodic Memory0
Benchmarking features from different radiomics toolkits / toolboxes using Image Biomarkers Standardization Initiative0
Data and its (dis)contents: A survey of dataset development and use in machine learning research0
Show:102550
← PrevPage 109 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified