SOTAVerified

Two-sample testing

In statistical hypothesis testing, a two-sample test is a test performed on the data of two random samples, each independently obtained from a different given population. The purpose of the test is to determine whether the difference between these two populations is statistically significant. The statistics used in two-sample tests can be used to solve many machine learning problems, such as domain adaptation, covariate shift and generative adversarial networks.

Papers

Showing 101150 of 338 papers

TitleStatusHype
Decision-Making with Auto-Encoding Variational BayesCode1
HypoML: Visual Analysis for Hypothesis-based Evaluation of Machine Learning Models0
Limits of Detecting Text Generated by Large-Scale Language Models0
Two-Sample Testing for Event Impacts in Time SeriesCode0
Modelling and Quantifying Membership Information Leakage in Machine Learning0
Tight Regret Bounds for Noisy Optimization of a Brownian Motion0
Fundamental Limits of Testing the Independence of Irrelevant Alternatives in Discrete Choice0
Breaking hypothesis testing for failure ratesCode0
Goodness-of-Fit Tests for Inhomogeneous Random Graphs0
Copy Move Source-Target Disambiguation through Multi-Branch CNNsCode0
Approval policies for modifications to Machine Learning-Based Software as a Medical Device: A study of bio-creepCode0
Statistical Agnostic Mapping: a Framework in Neuroimaging based on Concentration Inequalities0
Event Outlier Detection in Continuous TimeCode0
Statistical Testing on ASR Performance via Blockwise Bootstrap0
The power of synergy in differential privacy: Combining a small curator with local randomizers0
Comparing distributions: _1 geometry improves kernel two-sample testingCode0
Efficient Near-Optimal Testing of Community Changes in Balanced Stochastic Block Models0
Line-based Camera Pose Estimation in Point Cloud of Structured Environments0
Minimax Nonparametric Two-sample Test under Smoothing0
Training Neural Networks for Likelihood/Density Ratio Estimation0
Noiseless Privacy0
Sequential Controlled Sensing for Composite Multihypothesis Testing0
Universally Consistent K-Sample Tests via Dependence Measures0
Identification of Model Uncertainty via Optimal Design of Experiments Applied to a Mechanical Press0
Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing0
Generative Learning of Counterfactual for Synthetic Control Applications in Econometrics0
Two-sample Testing Using Deep LearningCode0
The Power of Batching in Multiple Hypothesis Testing0
Ctrl-Z: Recovering from Instability in Reinforcement Learning0
A Test for Shared Patterns in Cross-modal Brain Activation AnalysisCode0
A New Framework for Distance and Kernel-based Metrics in High Dimensions0
Nonzero-sum Adversarial Hypothesis Testing GamesCode0
Classification Logit Two-sample Testing by Neural NetworksCode0
A Mean-Field Theory for Kernel Alignment with Random Features in Generative and Discriminative Models0
Comparing distributions: _1 geometry improves kernel two-sample testingCode0
Weighted Sampling for Combined Model Selection and Hyperparameter Tuning0
Distance Assessment and Hypothesis Testing of High-Dimensional Samples using Variational Autoencoders0
Can A User Anticipate What Her Followers Want?0
Stock Price Forecasting and Hypothesis Testing Using Neural Networks0
Asymptotically Optimal One- and Two-Sample Testing with Kernels0
Minimum Description Length Revisited0
Extracting relations between outcomes and significance levels in Randomized Controlled Trials (RCTs) publications0
Measuring Gender Bias in Word Embeddings across Domains and Discovering New Gender Bias Word CategoriesCode0
Notes on Computational Hardness of Hypothesis Testing: Predictions using the Low-Degree Likelihood Ratio0
Deciphering Dynamical Nonlinearities in Short Time Series Using Recurrent Neural Networks0
Kernel Hypothesis Testing with Set-valued Data0
Conditional Independence Testing using Generative Adversarial NetworksCode0
A New Approach to Distributed Hypothesis Testing and Non-Bayesian Learning: Improved Learning Rate and Byzantine-Resilience0
hyppo: A Multivariate Hypothesis Testing Python PackageCode0
Bayes Test of Precision, Recall, and F1 Measure for Comparison of Two Natural Language Processing Models0
Show:102550
← PrevPage 3 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MMD-DAvg accuracy98.5Unverified
#ModelMetricClaimedVerifiedStatus
1MMD-DAvg accuracy74.4Unverified
#ModelMetricClaimedVerifiedStatus
1MMD-DAvg accuracy65.9Unverified
#ModelMetricClaimedVerifiedStatus
1MMD-DAvg accuracy57.9Unverified
#ModelMetricClaimedVerifiedStatus
1MMD-DAvg accuracy91Unverified