SOTAVerified

Benchmarking

Papers

Showing 45014525 of 5548 papers

TitleStatusHype
Measuring CLEVRness: Black-box Testing of Visual Reasoning Models0
Benchmarking Sample Selection Strategies for Batch Reinforcement Learning0
Benchmarking Algorithms from Machine Learning for Low-Budget Black-Box Optimization0
Stabilized Self-training with Negative Sampling on Few-labeled Graph Data0
Learning to Schedule Learning rate with Graph Neural Networks0
A Systematic Evaluation of Domain Adaptation Algorithms On Time Series Data0
Imitation Learning from Pixel Observations for Continuous Control0
Extensible Logging and Empirical Attainment Function for IOHexperimenter0
Context-guided Triple Matching for Multiple Choice Question Answering0
Curb Your Carbon Emissions: Benchmarking Carbon Emissions in Machine Translation0
Benchmarking Lane-changing Decision-making for Deep Reinforcement Learning0
Benchmarking Augmentation Methods for Learning Robust Navigation Agents: the Winning Entry of the 2021 iGibson Challenge0
Efficiently solving the thief orienteering problem with a max-min ant colony optimization approachCode0
A Novel Cluster Detection of COVID-19 Patients and Medical Disease Conditions Using Improved Evolutionary Clustering Algorithm Star0
Hybrid Transceiver Design for Tera-Hertz MIMO Systems Relying on Bayesian Learning Aided Sparse Channel Estimation0
WiSoSuper: Benchmarking Super-Resolution Methods on Wind and Solar Data0
Messing Up 3D Virtual Environments: Transferable Adversarial 3D ObjectsCode0
DiS-ReX: A Multilingual Dataset for Distantly Supervised Relation Extraction0
Benchmarking Answer Verification Methods for Question Answering-Based Summarization Evaluation Metrics0
Benchmarking Feature-based Algorithm Selection Systems for Black-box Numerical OptimizationCode0
A Survey on Temporal Sentence Grounding in Videos0
A Continuous Optimisation Benchmark Suite from Neural Network RegressionCode0
Benchmarking Processor Performance by Multi-Threaded Machine Learning Algorithms0
Application of DEA in International Market Selection for the export of products from Spain0
A framework for benchmarking uncertainty in deep regression0
Show:102550
← PrevPage 181 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified