Two-sample testing

In statistical hypothesis testing, a two-sample test is a test performed on the data of two random samples, each independently obtained from a different given population. The purpose of the test is to determine whether the difference between these two populations is statistically significant. The statistics used in two-sample tests can be used to solve many machine learning problems, such as domain adaptation, covariate shift and generative adversarial networks.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 338 papers

Title	Date	Tasks	Status	Hype
Model Equality Testing: Which Model Is This API Serving?	Oct 26, 2024	modelTwo-sample testing	CodeCode Available	1
AutoML Two-Sample Test	Jun 17, 2022	AutoMLscientific discovery	CodeCode Available	1
Addressing Maximization Bias in Reinforcement Learning with Two-Sample Testing	Jan 20, 2022	Q-Learningreinforcement-learning	CodeCode Available	1
MMD Aggregated Two-Sample Test	Oct 28, 2021	TranslationTwo-sample testing	CodeCode Available	1
Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation	Jun 23, 2020	reinforcement-learningReinforcement Learning	CodeCode Available	1
AI Feynman 2.0: Pareto-optimal symbolic regression exploiting graph modularity	Jun 18, 2020	regressionSymbolic Regression	CodeCode Available	1
Learning Opinion Dynamics From Social Traces	Jun 2, 2020	Graph MiningLink Sign Prediction	CodeCode Available	1
Towards Probabilistic Verification of Machine Unlearning	Mar 9, 2020	backdoor defenseMachine Unlearning	CodeCode Available	1
Testing Goodness of Fit of Conditional Density Models with Kernels	Feb 24, 2020	Two-sample testing	CodeCode Available	1
Confidence Sets and Hypothesis Testing in a Likelihood-Free Inference Setting	Feb 24, 2020	parameter estimationTwo-sample testing	CodeCode Available	1
Learning Deep Kernels for Non-Parametric Two-Sample Tests	Feb 21, 2020	Two-sample testingVocal Bursts Valence Prediction	CodeCode Available	1
Decision-Making with Auto-Encoding Variational Bayes	Feb 17, 2020	Decision MakingTwo-sample testing	CodeCode Available	1
Safe Testing	Jun 18, 2019	Two-sample testing	CodeCode Available	1
Statistical comparison of classifiers through Bayesian hierarchical modelling	Sep 28, 2016	Two-sample testing	CodeCode Available	1
Leveraging Optimal Transport for Distributed Two-Sample Testing: An Integrated Transportation Distance-based Framework	Jun 19, 2025	Federated LearningTwo-sample testing	—Unverified	0
Signature Maximum Mean Discrepancy Two-Sample Statistical Tests	Jun 2, 2025	Two-sample testing	—Unverified	0
From Two Sample Testing to Singular Gaussian Discrimination	May 7, 2025	Two-sample testing	—Unverified	0
Advanced Tutorial: Label-Efficient Two-Sample Tests	Jan 7, 2025	Active LearningTwo-sample testing	—Unverified	0
Optimal Algorithms for Augmented Testing of Discrete Distributions	Dec 1, 2024	Two-sample testing	—Unverified	0
A Unified Data Representation Learning for Non-parametric Two-sample Testing	Nov 30, 2024	Representation LearningTwo-sample testing	—Unverified	0
Minimax Optimal Two-Sample Testing under Local Differential Privacy	Nov 13, 2024	Two-sample testing	CodeCode Available	0
General Frameworks for Conditional Two-Sample Testing	Oct 22, 2024	Domain AdaptationFairness	CodeCode Available	0
Credal Two-Sample Tests of Epistemic Uncertainty	Oct 16, 2024	Two-sample testing	CodeCode Available	0
Machine Learning for Two-Sample Testing under Right-Censored Data: A Simulation Study	Sep 12, 2024	Two-sample testing	CodeCode Available	0
Computational-Statistical Trade-off in Kernel Two-Sample Testing with Random Fourier Features	Jul 12, 2024	Two-sample testing	CodeCode Available	0

Show:10 25 50

← PrevPage 1 of 14Next →

All datasets Blob (9 modes, 40 for each)CIFAR-10 vs CIFAR-10.1 (1000 samples)HDGM (d=10, N=4000)HIGGS Data Set MNIST vs Fake MNIST

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MMD-D	Avg accuracy	98.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MMD-D	Avg accuracy	74.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MMD-D	Avg accuracy	65.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MMD-D	Avg accuracy	57.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MMD-D	Avg accuracy	91	—	Unverified