SOTAVerified

Benchmarking

Papers

Showing 43514400 of 5548 papers

TitleStatusHype
Towards IID representation learning and its application on biomedical dataCode0
A predictive analytics approach for stroke prediction using machine learning and neural networksCode0
Prepare for Trouble and Make it Double. Supervised and Unsupervised Stacking for AnomalyBased Intrusion Detection0
Towards Class-agnostic Tracking Using Feature Decorrelation in Point Clouds0
Spatio-Temporal Latent Graph Structure Learning for Traffic Forecasting0
Generalised Gaussian Process Latent Variable Models (GPLVM) with Stochastic Variational Inference0
Measuring CLEVRness: Blackbox testing of Visual Reasoning Models0
SUTD-PRCM Dataset and Neural Architecture Search Approach for Complex Metasurface Design0
Evaluating Feature Attribution Methods in the Image DomainCode0
Benchmarking Generative Latent Variable Models for SpeechCode0
Benchmarking the Linear Algebra Awareness of TensorFlow and PyTorchCode0
How to Manage Tiny Machine Learning at Scale: An Industrial PerspectiveCode0
Rethinking Pareto Frontier for Performance Evaluation of Deep Neural Networks0
Benchmarking missing-values approaches for predictive models on health databasesCode0
On loss functions and evaluation metrics for music source separation0
Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training BenchmarkCode0
Benchmarking Online Sequence-to-Sequence and Character-based Handwriting Recognition from IMU-Enhanced Pens0
Benchmarking Robot Manipulation with the Rubik's Cube0
Dual Task Framework for Improving Persona-grounded Dialogue Dataset0
High Fidelity RF Clutter Modeling and Simulation0
Lightweight Jet Reconstruction and Identification as an Object Detection Task0
Comparative Study Between Distance Measures On Supervised Optimum-Path Forest ClassificationCode0
BIQ2021: A Large-Scale Blind Image Quality Assessment Database0
Evaluation Methods and Measures for Causal Learning Algorithms0
Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm ConfigurationCode0
RerrFact: Reduced Evidence Retrieval Representations for Scientific Claim VerificationCode0
Quality Assessment of Low Light Restored Images: A Subjective Study and an Unsupervised Model0
Structured Prediction Problem ArchiveCode0
Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization0
A quantitative method for benchmarking fair income distribution0
Black-box Bayesian inference for economic agent-based models0
AntBO: Towards Real-World Automated Antibody Design with Combinatorial Bayesian Optimisation0
Benchmarking Conventional Vision Models on Neuromorphic Fall Detection and Action Recognition Dataset0
Benchmarking Resource Usage for Efficient Distributed Deep Learning0
Benchmarking learned non-Cartesian k-space trajectories and reconstruction networks0
MeltpoolNet: Melt pool Characteristic Prediction in Metal Additive Manufacturing Using Machine Learning0
A Multi-rater Comparative Study of Automatic Target Localization Methods for Epilepsy Deep Brain Stimulation Procedures0
Jointly Learning Knowledge Embedding and Neighborhood Consensus with Relational Knowledge Distillation for Entity Alignment0
DrugOOD: Out-of-Distribution (OOD) Dataset Curator and Benchmark for AI-aided Drug Discovery -- A Focus on Affinity Prediction Problems with Noise AnnotationsCode0
Out of Distribution Detection on ImageNet-OCode0
Visual Object Tracking on Multi-modal RGB-D Videos: A Review0
Towards Private Learning on Decentralized Graphs with Local Differential Privacy0
Individual Treatment Effect Estimation Through Controlled Neural Network Training in Two Stages0
High-Level Synthesis Performance Prediction using GNNs: Benchmarking, Modeling, and Advancing0
Benchmarking Subset Selection from Large Candidate Solution Sets in Evolutionary Multi-objective OptimizationCode0
A Simple Evolutionary Algorithm for Multi-modal Multi-objective Optimization0
A Comparative study of Hyper-Parameter Optimization Tools0
Feasibility of BERT Embeddings For Domain-Specific Knowledge Mining0
FedNLP: Benchmarking Federated Learning Methods for Natural Language Processing Tasks0
Context-guided Triple Matching for Multiple Choice Question Answering0
Show:102550
← PrevPage 88 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified