Tree-SNE: Hierarchical Clustering and Visualization Using t-SNE
Isaac Robinson, Emma Pierce-Hoffman
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/isaacrob/treesneOfficialIn papernone★ 63
Abstract
t-SNE and hierarchical clustering are popular methods of exploratory data analysis, particularly in biology. Building on recent advances in speeding up t-SNE and obtaining finer-grained structure, we combine the two to create tree-SNE, a hierarchical clustering and visualization algorithm based on stacked one-dimensional t-SNE embeddings. We also introduce alpha-clustering, which recommends the optimal cluster assignment, without foreknowledge of the number of clusters, based off of the cluster stability across multiple scales. We demonstrate the effectiveness of tree-SNE and alpha-clustering on images of handwritten digits, mass cytometry (CyTOF) data from blood cells, and single-cell RNA-sequencing (scRNA-seq) data from retinal cells. Furthermore, to demonstrate the validity of the visualization, we use alpha-clustering to obtain unsupervised clustering results competitive with the state of the art on several image data sets. Software is available at https://github.com/isaacrob/treesne.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| coil-100 | Tree-SNE | NMI | 0.93 | — | Unverified |
| Coil-20 | Tree-SNE | NMI | 0.96 | — | Unverified |
| MNIST-full | Tree-SNE | NMI | 0.86 | — | Unverified |
| USPS | Tree-SNE | NMI | 0.89 | — | Unverified |