SOTAVerified

Benchmarking

Papers

Showing 21262150 of 5548 papers

TitleStatusHype
Byzantine-Robust and Communication-Efficient Distributed Learning via Compressed Momentum Filtering0
Benchmarking Adversarial Robustness of Image Shadow Removal with Shadow-adaptive Attacks0
Analyzing Hong Kong's Legal Judgments from a Computational Linguistics point-of-view0
Benchmarking Adversarial Robustness of Compressed Deep Learning Models0
Business as Rulesual: A Benchmark and Framework for Business Rule Flow Modeling with LLMs0
A Benchmarking Protocol for Pansharpening: Dataset, Preprocessing, and Quality Assessment0
Benchmarking Adversarial Robustness0
Experimenting with robotic intra-logistics domains0
Building benchmarking frameworks for supporting replicability and reproducibility: spatial and textual analysis as an example0
Experimental robustness benchmark of quantum neural network on a superconducting quantum processor0
Benchmarking Adversarially Robust Quantum Machine Learning at Scale0
Analysis of modular CMA-ES on strict box-constrained problems in the SBOX-COST benchmarking suite0
ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists0
Building a Multivariate Time Series Benchmarking Datasets Inspired by Natural Language Processing (NLP)0
Benchmarking adversarial attacks and defenses for time-series data0
Analysis of different disparity estimation techniques on aerial stereo image datasets0
Building a De-identification System for Real Swedish Clinical Text Using Pseudonymised Clinical Text0
Building a continuous benchmarking ecosystem in bioinformatics0
Benchmarking Advanced Text Anonymisation Methods: A Comparative Study on Novel and Traditional Approaches0
Benchmarking Adaptive Intelligence and Computer Vision on Human-Robot Collaboration0
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer0
AT-Drone: Benchmarking Adaptive Teaming in Multi-Drone Pursuit0
BuckTales : A multi-UAV dataset for multi-object tracking and re-identification of wild antelopes0
Benchmarking Adaptative Variational Quantum Algorithms on QUBO Instances0
Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark0
Show:102550
← PrevPage 86 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified