SOTAVerified

Benchmarking

Papers

Showing 35513600 of 5548 papers

TitleStatusHype
Benchmarking AutoML algorithms on a collection of synthetic classification problemsCode0
INCLUSIFY: A benchmark and a model for gender-inclusive German0
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation0
DFEE: Interactive DataFlow Execution and Evaluation KitCode0
Towards Scene Understanding for Autonomous Operations on Airport ApronsCode1
RLogist: Fast Observation Strategy on Whole-slide Images with Deep Reinforcement LearningCode1
Multi-view deep learning based molecule design and structural optimization accelerates the SARS-CoV-2 inhibitor discovery0
Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking0
BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture SearchCode0
Geoclidean: Few-Shot Generalization in Euclidean GeometryCode1
BBOB Instance Analysis: Landscape Properties and Algorithm Performance across Problem Instances0
Device Modeling Bias in ReRAM-based Neural Network Simulations0
AdsorbML: A Leap in Efficiency for Adsorption Energy Calculations using Generalizable Machine Learning PotentialsCode1
Why do tree-based models still outperform deep learning on typical tabular data?Code2
Predicting Football Match Outcomes with eXplainable Machine Learning and the Kelly Index0
A Boosting Approach to Constructing an Ensemble Stack0
Benchmarking simulated and physical quantum processing units using quantum and hybrid algorithms0
A Call to Reflect on Evaluation Practices for Failure Detection in Image ClassificationCode1
Tackling Visual Control via Multi-View Exploration Maximization0
Efficient Demand Response Location Targeting for Price Spike Mitigation by Exploiting Price-demand Relationship0
Multi-Mask Aggregators for Graph Neural NetworksCode1
ScanNeRF: a Scalable Benchmark for Neural Radiance Fields0
Immersive Neural Graphics PrimitivesCode2
SnCQA: A hardware-efficient equivariant quantum convolutional circuit architecture0
Benchmarking Bayesian Deep Learning on Diabetic Retinopathy Detection Tasks0
fseval: A Benchmarking Framework for Feature Selection and Feature Ranking AlgorithmsCode1
Benchmarking Adversarially Robust Quantum Machine Learning at Scale0
FAIRification of MLC data0
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for PolishCode1
Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition0
SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting with Neural Radiance Fields0
Challenges and perspectives in computational deconvolution of genomics data0
Benchmarking Edge Computing Devices for Grape Bunches and Trunks Detection using Accelerated Object Detection Single Shot MultiBox Deep Learning Models0
L3Cube-MahaSBERT and HindSBERT: Sentence BERT Models and Benchmarking BERT Sentence Representations for Hindi and Marathi0
OPTION: OPTImization Algorithm Benchmarking ONtology0
Estimating Task Completion Times for Network Rollouts using Statistical Models within Partitioning-based Regression Methods0
LidarGait: Benchmarking 3D Gait Recognition with Point Clouds0
CryptOpt: Verified Compilation with Randomized Program Search for Cryptographic Primitives (full version)Code1
PIC4rl-gym: a ROS2 modular framework for Robots Autonomous Navigation with Deep Reinforcement LearningCode1
SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification0
DSLOB: A Synthetic Limit Order Book Dataset for Benchmarking Forecasting Algorithms under Distributional Shift0
Optimal Design of Volt/VAR Control Rules of Inverters using Deep Learning0
Near-Term Quantum Computing Techniques: Variational Quantum Algorithms, Error Mitigation, Circuit Compilation, Benchmarking and Classical Simulation0
Benchmarking Graph Neural Networks for FMRI analysisCode1
A Review of Intelligent Music Generation Systems0
Deep Emotion Recognition in Textual Conversations: A SurveyCode0
Joint Linear Precoding and DFT Beamforming Design for Massive MIMO Satellite Communication0
Harmonization Benchmarking Tool for Neuroimaging DatasetsCode0
Perona: Robust Infrastructure Fingerprinting for Resource-Efficient Big Data Analytics0
Dealing with missing data using attention and latent space regularizationCode0
Show:102550
← PrevPage 72 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified