From 0f7238dc9431a96b5bd8f6794115edaf4c181fbc Mon Sep 17 00:00:00 2001 From: Andy Coenen Date: Wed, 4 Dec 2019 09:27:26 -0800 Subject: [PATCH] Update text and readme to reflect points --- README.md | 4 +++- src/article/Article.svx | 2 +- 2 files changed, 4 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 726d49a..a799dd6 100644 --- a/README.md +++ b/README.md @@ -29,7 +29,9 @@ yarn dev:toy_comparison #### Data preprocessing -_Understanding UMAP_ uses a few tricks to make the data payloads for some of the interactive figures small enough to download in a reasonable time. The `mammoth` figures use a 10-bit encoding scheme to compress the 10,000 3D points into a significantly smaller payload. The `hyperparameters` and `toy_comparison` figures precompute UMAP embeddings for all of their different combinations, then use the same 10-bit encoding scheme to compress the data. +For the mammoth figures, the [raw 3D data](https://github.com/MNoichl/UMAP-examples-mammoth-/blob/master/mammoth_a.csv) was downsampled to 50,000 points before being projected with UMAP / t-SNE. These 50,000 points were then randomly subsampled to 10,000 points in order to minimize the payload size. + +_Understanding UMAP_ uses a few tricks to make the data payloads for some of the interactive figures small enough to download in a reasonable time. The `mammoth` figures use a 10-bit encoding scheme to compress the 10,000 data points into a significantly smaller payload. The `hyperparameters` and `toy_comparison` figures precompute UMAP embeddings for all of their different combinations, then use the same 10-bit encoding scheme to compress the data. ```bash yarn preprocess:hyperparameters diff --git a/src/article/Article.svx b/src/article/Article.svx index 9c75054..786547b 100644 --- a/src/article/Article.svx +++ b/src/article/Article.svx @@ -103,7 +103,7 @@ The following visualization - extended from excellent work by [Max Noichl](https Figure 5: - UMAP projections of a 3D woolly mammoth skeleton (50,000 points) into 2 dimensions, with various settings for the n_neighbors and min_dist parameters. + UMAP projections of a 3D woolly mammoth skeleton (50k points, 10k shown) into 2 dimensions, with various settings for the n_neighbors and min_dist parameters.