SOTAVerified

Two-sample testing

In statistical hypothesis testing, a two-sample test is a test performed on the data of two random samples, each independently obtained from a different given population. The purpose of the test is to determine whether the difference between these two populations is statistically significant. The statistics used in two-sample tests can be used to solve many machine learning problems, such as domain adaptation, covariate shift and generative adversarial networks.

Papers

Showing 150 of 338 papers

TitleStatusHype
Learning Deep Kernels for Non-Parametric Two-Sample TestsCode1
Expert-Supervised Reinforcement Learning for Offline Policy Learning and EvaluationCode1
Towards Probabilistic Verification of Machine UnlearningCode1
Confidence Sets and Hypothesis Testing in a Likelihood-Free Inference SettingCode1
Testing Goodness of Fit of Conditional Density Models with KernelsCode1
MMD Aggregated Two-Sample TestCode1
Model Equality Testing: Which Model Is This API Serving?Code1
AutoML Two-Sample TestCode1
AI Feynman 2.0: Pareto-optimal symbolic regression exploiting graph modularityCode1
Decision-Making with Auto-Encoding Variational BayesCode1
Learning Opinion Dynamics From Social TracesCode1
Safe TestingCode1
Addressing Maximization Bias in Reinforcement Learning with Two-Sample TestingCode1
Statistical comparison of classifiers through Bayesian hierarchical modellingCode1
Computer Vision and Metrics Learning for Hypothesis Testing: An Application of Q-Q Plot for Normality Test0
Adaptivity and Computation-Statistics Tradeoffs for Kernel and Distance based High Dimensional Two Sample Testing0
Concept Drift Detection and Adaptation with Hierarchical Hypothesis Testing0
Adaptive learning of density ratios in RKHS0
Adversarially Robust Classification based on GLRT0
Active Sequential Two-Sample Testing0
Conditional Word Embedding and Hypothesis Testing via Bayes-by-Backprop0
A New Approach to Distributed Hypothesis Testing and Non-Bayesian Learning: Improved Learning Rate and Byzantine-Resilience0
A New Framework for Distance and Kernel-based Metrics in High Dimensions0
An explainable deep vision system for animal classification and detection in trail-camera images with automatic post-deployment retraining0
Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing0
Anomaly Detection Under Controlled Sensing Using Actor-Critic Reinforcement Learning0
A Flexible Framework for Hypothesis Testing in High-dimensions0
A novel family of non-parametric cumulative based divergences for point processes0
A framework for paired-sample hypothesis testing for high-dimensional data0
A powerful and efficient set test for genetic markers that handles confounders0
Adversarial learning for product recommendation0
Closing the AI Knowledge Gap0
Adaptive Concentration Inequalities for Sequential Decision Problems0
Advanced Tutorial: Label-Efficient Two-Sample Tests0
CleanML: A Study for Evaluating the Impact of Data Cleaning on ML Classification Tasks0
Collaborative non-parametric two-sample testing0
A Structured Review of the Validity of BLEU0
A strong converse bound for multiple hypothesis testing, with applications to high-dimensional estimation0
Asymptotically Optimal One- and Two-Sample Testing with Kernels0
Asymptotic Analysis of Sampling Estimators for Randomized Numerical Linear Algebra Algorithms0
A More Powerful Two-Sample Test in High Dimensions using Random Projection0
A tutorial on MDL hypothesis testing for graph analysis0
A New Approach for Distributed Hypothesis Testing with Extensions to Byzantine-Resilience0
A Mean-Field Theory for Kernel Alignment with Random Features in Generative and Discriminative Models0
Bayesian Hypothesis Testing for Block Sparse Signal Recovery0
Bayesian hypothesis testing for one bit compressed sensing with sensing matrix perturbation0
Bayes Test of Precision, Recall, and F1 Measure for Comparison of Two Natural Language Processing Models0
Bootstrapped Edge Count Tests for Nonparametric Two-Sample Inference Under Heterogeneity0
Bottleneck Problems: Information and Estimation-Theoretic View0
A Sparse Linear Model and Significance Test for Individual Consumption Prediction0
Show:102550
← PrevPage 1 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MMD-DAvg accuracy98.5Unverified
#ModelMetricClaimedVerifiedStatus
1MMD-DAvg accuracy74.4Unverified
#ModelMetricClaimedVerifiedStatus
1MMD-DAvg accuracy65.9Unverified
#ModelMetricClaimedVerifiedStatus
1MMD-DAvg accuracy57.9Unverified
#ModelMetricClaimedVerifiedStatus
1MMD-DAvg accuracy91Unverified