Two-sample testing

In statistical hypothesis testing, a two-sample test is a test performed on the data of two random samples, each independently obtained from a different given population. The purpose of the test is to determine whether the difference between these two populations is statistically significant. The statistics used in two-sample tests can be used to solve many machine learning problems, such as domain adaptation, covariate shift and generative adversarial networks.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 338 papers

Title	Date	Tasks	Status	Hype
Learning Deep Kernels for Non-Parametric Two-Sample Tests	Feb 21, 2020	Two-sample testingVocal Bursts Valence Prediction	CodeCode Available	1
Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation	Jun 23, 2020	reinforcement-learningReinforcement Learning	CodeCode Available	1
Towards Probabilistic Verification of Machine Unlearning	Mar 9, 2020	backdoor defenseMachine Unlearning	CodeCode Available	1
Confidence Sets and Hypothesis Testing in a Likelihood-Free Inference Setting	Feb 24, 2020	parameter estimationTwo-sample testing	CodeCode Available	1
Testing Goodness of Fit of Conditional Density Models with Kernels	Feb 24, 2020	Two-sample testing	CodeCode Available	1
MMD Aggregated Two-Sample Test	Oct 28, 2021	TranslationTwo-sample testing	CodeCode Available	1
Model Equality Testing: Which Model Is This API Serving?	Oct 26, 2024	modelTwo-sample testing	CodeCode Available	1
AutoML Two-Sample Test	Jun 17, 2022	AutoMLscientific discovery	CodeCode Available	1
AI Feynman 2.0: Pareto-optimal symbolic regression exploiting graph modularity	Jun 18, 2020	regressionSymbolic Regression	CodeCode Available	1
Decision-Making with Auto-Encoding Variational Bayes	Feb 17, 2020	Decision MakingTwo-sample testing	CodeCode Available	1
Learning Opinion Dynamics From Social Traces	Jun 2, 2020	Graph MiningLink Sign Prediction	CodeCode Available	1
Safe Testing	Jun 18, 2019	Two-sample testing	CodeCode Available	1
Addressing Maximization Bias in Reinforcement Learning with Two-Sample Testing	Jan 20, 2022	Q-Learningreinforcement-learning	CodeCode Available	1
Statistical comparison of classifiers through Bayesian hierarchical modelling	Sep 28, 2016	Two-sample testing	CodeCode Available	1
Computer Vision and Metrics Learning for Hypothesis Testing: An Application of Q-Q Plot for Normality Test	Jan 23, 2019	Dimensionality ReductionMetric Learning	—Unverified	0
Adaptivity and Computation-Statistics Tradeoffs for Kernel and Distance based High Dimensional Two Sample Testing	Aug 4, 2015	Two-sample testing	—Unverified	0
Concept Drift Detection and Adaptation with Hierarchical Hypothesis Testing	Jul 25, 2017	Drift DetectionGeneral Classification	—Unverified	0
Adaptive learning of density ratios in RKHS	Jul 30, 2023	Density EstimationDensity Ratio Estimation	—Unverified	0
Adversarially Robust Classification based on GLRT	Nov 16, 2020	ClassificationGeneral Classification	—Unverified	0
Active Sequential Two-Sample Testing	Jan 30, 2023	Two-sample testingvalid	—Unverified	0
Conditional Word Embedding and Hypothesis Testing via Bayes-by-Backprop	Oct 1, 2018	Two-sample testingWord Embeddings	—Unverified	0
A New Approach to Distributed Hypothesis Testing and Non-Bayesian Learning: Improved Learning Rate and Byzantine-Resilience	Jul 5, 2019	MisinformationTwo-sample testing	—Unverified	0
A New Framework for Distance and Kernel-based Metrics in High Dimensions	Sep 30, 2019	Two-sample testing	—Unverified	0
An explainable deep vision system for animal classification and detection in trail-camera images with automatic post-deployment retraining	Oct 22, 2020	General ClassificationSensitivity	—Unverified	0
Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing	Oct 17, 2019	FairnessTwo-sample testing	—Unverified	0
Anomaly Detection Under Controlled Sensing Using Actor-Critic Reinforcement Learning	May 26, 2020	Anomaly DetectionDecision Making	—Unverified	0
A Flexible Framework for Hypothesis Testing in High-dimensions	Apr 26, 2017	regressionTwo-sample testing	—Unverified	0
A novel family of non-parametric cumulative based divergences for point processes	Dec 1, 2010	Point ProcessesTwo-sample testing	—Unverified	0
A framework for paired-sample hypothesis testing for high-dimensional data	Sep 28, 2023	Two-sample testing	—Unverified	0
A powerful and efficient set test for genetic markers that handles confounders	May 3, 2012	Two-sample testing	—Unverified	0
Adversarial learning for product recommendation	Jul 7, 2020	Generative Adversarial NetworkProduct Recommendation	—Unverified	0
Closing the AI Knowledge Gap	Mar 20, 2018	FairnessTwo-sample testing	—Unverified	0
Adaptive Concentration Inequalities for Sequential Decision Problems	Dec 1, 2016	Two-sample testing	—Unverified	0
Advanced Tutorial: Label-Efficient Two-Sample Tests	Jan 7, 2025	Active LearningTwo-sample testing	—Unverified	0
CleanML: A Study for Evaluating the Impact of Data Cleaning on ML Classification Tasks	Apr 20, 2019	General ClassificationTwo-sample testing	—Unverified	0
Collaborative non-parametric two-sample testing	Feb 8, 2024	Two-sample testing	—Unverified	0
A Structured Review of the Validity of BLEU	Sep 1, 2018	DiagnosticMachine Translation	—Unverified	0
A strong converse bound for multiple hypothesis testing, with applications to high-dimensional estimation	Jun 14, 2017	Active Learningcompressed sensing	—Unverified	0
Asymptotically Optimal One- and Two-Sample Testing with Kernels	Aug 27, 2019	Change DetectionTwo-sample testing	—Unverified	0
Asymptotic Analysis of Sampling Estimators for Randomized Numerical Linear Algebra Algorithms	Feb 24, 2020	Two-sample testing	—Unverified	0
A More Powerful Two-Sample Test in High Dimensions using Random Projection	Aug 11, 2011	Two-sample testing	—Unverified	0
A tutorial on MDL hypothesis testing for graph analysis	Oct 31, 2018	Two-sample testing	—Unverified	0
A New Approach for Distributed Hypothesis Testing with Extensions to Byzantine-Resilience	Mar 14, 2019	Two-sample testing	—Unverified	0
A Mean-Field Theory for Kernel Alignment with Random Features in Generative and Discriminative Models	Sep 25, 2019	Two-sample testing	—Unverified	0
Bayesian Hypothesis Testing for Block Sparse Signal Recovery	Aug 22, 2015	Two-sample testing	—Unverified	0
Bayesian hypothesis testing for one bit compressed sensing with sensing matrix perturbation	Nov 18, 2015	compressed sensingTwo-sample testing	—Unverified	0
Bayes Test of Precision, Recall, and F1 Measure for Comparison of Two Natural Language Processing Models	Jul 1, 2019	ChunkingTwo-sample testing	—Unverified	0
Bootstrapped Edge Count Tests for Nonparametric Two-Sample Inference Under Heterogeneity	Apr 26, 2023	Two-sample testingVocal Bursts Valence Prediction	—Unverified	0
Bottleneck Problems: Information and Estimation-Theoretic View	Nov 12, 2020	LEMMATwo-sample testing	—Unverified	0
A Sparse Linear Model and Significance Test for Individual Consumption Prediction	Nov 5, 2015	PredictionTwo-sample testing	—Unverified	0

Show:10 25 50

← PrevPage 1 of 7Next →

All datasets Blob (9 modes, 40 for each)CIFAR-10 vs CIFAR-10.1 (1000 samples)HDGM (d=10, N=4000)HIGGS Data Set MNIST vs Fake MNIST

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MMD-D	Avg accuracy	98.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MMD-D	Avg accuracy	74.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MMD-D	Avg accuracy	65.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MMD-D	Avg accuracy	57.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MMD-D	Avg accuracy	91	—	Unverified