SOTAVerified

Benchmarking

Papers

Showing 45014550 of 5548 papers

TitleStatusHype
Measuring CLEVRness: Black-box Testing of Visual Reasoning Models0
Benchmarking Sample Selection Strategies for Batch Reinforcement Learning0
Benchmarking Algorithms from Machine Learning for Low-Budget Black-Box Optimization0
Stabilized Self-training with Negative Sampling on Few-labeled Graph Data0
Learning to Schedule Learning rate with Graph Neural Networks0
A Systematic Evaluation of Domain Adaptation Algorithms On Time Series Data0
Imitation Learning from Pixel Observations for Continuous Control0
Extensible Logging and Empirical Attainment Function for IOHexperimenter0
Context-guided Triple Matching for Multiple Choice Question Answering0
Curb Your Carbon Emissions: Benchmarking Carbon Emissions in Machine Translation0
Benchmarking Lane-changing Decision-making for Deep Reinforcement Learning0
Benchmarking Augmentation Methods for Learning Robust Navigation Agents: the Winning Entry of the 2021 iGibson Challenge0
Efficiently solving the thief orienteering problem with a max-min ant colony optimization approachCode0
A Novel Cluster Detection of COVID-19 Patients and Medical Disease Conditions Using Improved Evolutionary Clustering Algorithm Star0
Hybrid Transceiver Design for Tera-Hertz MIMO Systems Relying on Bayesian Learning Aided Sparse Channel Estimation0
WiSoSuper: Benchmarking Super-Resolution Methods on Wind and Solar Data0
Messing Up 3D Virtual Environments: Transferable Adversarial 3D ObjectsCode0
DiS-ReX: A Multilingual Dataset for Distantly Supervised Relation Extraction0
Benchmarking Answer Verification Methods for Question Answering-Based Summarization Evaluation Metrics0
Benchmarking Feature-based Algorithm Selection Systems for Black-box Numerical OptimizationCode0
A Survey on Temporal Sentence Grounding in Videos0
A Continuous Optimisation Benchmark Suite from Neural Network RegressionCode0
Benchmarking Processor Performance by Multi-Threaded Machine Learning Algorithms0
Application of DEA in International Market Selection for the export of products from Spain0
A framework for benchmarking uncertainty in deep regression0
Characterization of Constrained Continuous Multiobjective Optimization Problems: A Feature Space Perspective0
CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization0
Towards Efficient Synchronous Federated Training: A Survey on System Optimization StrategiesCode0
Resistive Neural Hardware Accelerators0
Fine-grained Hand Gesture Recognition in Multi-viewpoint Hand HygieneCode0
Benchmarking the Robustness of Instance Segmentation Models0
Towards Sentiment Analysis of Tobacco Products’ Usage in Social Media0
Benchmarking down-scaled (not so large) pre-trained language modelsCode0
Cross-Lingual Text Classification of Transliterated Hindi and MalayalamCode0
Benchmarking the Accuracy and Robustness of Feedback Alignment Algorithms0
Europarl-ASR: A Large Corpus of Parliamentary Debates for Streaming ASR Benchmarking and Speech Data Filtering/Verbatimization0
BioFors: A Large Biomedical Image Forensics DatasetCode0
Technological Approaches to Detecting Online Disinformation and Manipulation0
Benchmarking high-fidelity pedestrian tracking systems for research, real-time monitoring and crowd control0
A Benchmark for Spray from Nearby Cutting Vehicles0
DeepEdgeBench: Benchmarking Deep Neural Networks on Edge Devices0
Evolving Evolutionary Algorithms using Linear Genetic Programming0
AutoLay: Benchmarking amodal layout estimation for autonomous driving0
Discriminating modelling approaches for Point in Time Economic Scenario Generation0
Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks0
SIAM: Chiplet-based Scalable In-Memory Acceleration with Mesh for Deep Neural Networks0
Distributional Depth-Based Estimation of Object Articulation ModelsCode0
BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture SearchCode0
A Look at the Evaluation Setup of the M5 Forecasting Competition0
Secure Neuroimaging Analysis using Federated Learning with Homomorphic Encryption0
Show:102550
← PrevPage 91 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified