SOTAVerified

Benchmarking

Papers

Showing 15011550 of 5548 papers

TitleStatusHype
Benchmarking GPUs on SVBRDF Extractor Model0
Benchmarking GPU and TPU Performance with Graph Neural Networks0
Data Collection of Real-Life Knowledge Work in Context: The RLKWiC Dataset0
Benchmarking GPT-4 on Algorithmic Problems: A Systematic Evaluation of Prompting Strategies0
Approaches for benchmarking single-cell gene regulatory network inference methods0
Applying Standards to Advance Upstream & Downstream Ethics in Large Language Models0
Benchmarking GNNs Using Lightning Network Data0
Benchmarking global optimization techniques for unmanned aerial vehicle path planning0
Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data0
Data-driven Approach for Static Hedging of Exchange Traded Options0
Benchmarking Generative Models on Computational Thinking Tests in Elementary Visual Programming0
Applications in CityLearn Gym Environment for Multi-Objective Control Benchmarking in Grid-Interactive Buildings and Districts0
AEON: Adaptive Estimation of Instance-Dependent In-Distribution and Out-of-Distribution Label Noise for Robust Learning0
Data Augmentation for Continual RL via Adversarial Gradient Episodic Memory0
Benchmarking Generative AI for Scoring Medical Student Interviews in Objective Structured Clinical Examinations (OSCEs)0
Application of Machine Learning for Online Reputation Systems0
Benchmarking General-Purpose In-Context Learning0
Application of DEA in International Market Selection for the export of products from Spain0
Data Augmentation for Traffic Classification0
Application Inference using Machine Learning based Side Channel Analysis0
DarkBench: Benchmarking Dark Patterns in Large Language Models0
Benchmarking Foundation Speech and Language Models for Alzheimer's Disease and Related Dementia Detection from Spontaneous Speech0
Application based Evaluation of an Efficient Spike-Encoder, "Spiketrum"0
DASB -- Discrete Audio and Speech Benchmark0
Benchmarking Foundation Models with Language-Model-as-an-Examiner0
Applicability and Challenges of Deep Reinforcement Learning for Satellite Frequency Plan Design0
Apples to Apples: Learning Semantics of Common Entities Through a Novel Comprehension Task0
Benchmarking Foundation Models for Zero-Shot Biometric Tasks0
Benchmarking foundation models as feature extractors for weakly-supervised computational pathology0
Advocating Character Error Rate for Multilingual ASR Evaluation0
Data Analysis in the Era of Generative AI0
Benchmarking for Public Health Surveillance tasks on Social Media with a Domain-Specific Pretrained Language Model0
Benchmarking for Metaheuristic Black-Box Optimization: Perspectives and Open Challenges0
Adversarial Reinforcement Learning Framework for Benchmarking Collision Avoidance Mechanisms in Autonomous Vehicles0
Benchmarking for Bayesian Reinforcement Learning0
Benchmarking Floworks against OpenAI & Anthropic: A Novel Framework for Enhanced LLM Function Calling0
A Platform for Event Extraction in Hindi0
Ensuring Reliability of Curated EHR-Derived Data: The Validation of Accuracy for LLM/ML-Extracted Information and Data (VALID) Framework0
Benchmarking fixed-length Fingerprint Representations across different Embedding Sizes and Sensor Types0
Benchmarking five global optimization approaches for nano-optical shape optimization and parameter reconstruction0
DailyQA: A Benchmark to Evaluate Web Retrieval Augmented LLMs Based on Capturing Real-World Changes0
Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization0
Data and its (dis)contents: A survey of dataset development and use in machine learning research0
Data-driven inventory management for new products: An adjusted Dyna-Q approach with transfer learning0
Benchmarking federated strategies in Peer-to-Peer Federated learning for biomedical data0
Benchmarking Federated Machine Unlearning methods for Tabular Data0
A Pipeline for Post-Crisis Twitter Data Acquisition0
Benchmarking FedAvg and FedCurv for Image Classification Tasks0
A Perspective on Neural Capacity Estimation: Viability and Reliability0
Accelerating the discovery of steady-states of planetary interior dynamics with machine learning0
Show:102550
← PrevPage 31 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified