-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy path3_normalization_scaling.qmd
118 lines (84 loc) · 3.8 KB
/
3_normalization_scaling.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
---
title: "Normalization and scaling"
engine: knitr
---
Loading the required packages:
```{r}
#| output: false
library(Seurat)
library(ggplot2)
library(clustree)
library(patchwork)
library(dplyr)
```
We load the object that was output of the quality control part:
```{r}
seu <- readRDS("output/seu_part2.rds")
```
## Normalization and scaling with SCTransform
Biological heterogeneity in spatial RNA-seq data is often confounded by technical factors including sequencing depth. The number of molecules detected in each spot can vary significantly between spots, even within the same celltype. Note that the variance in molecular counts/spot can be substantial for spatial datasets, particularly if there are
differences in cell density across the tissue.
Therefore, we apply sctransform normalization (Hafemeister and Satija, Genome Biology 2019), which builds regularized
negative binomial models of gene expression in order to account for technical artifacts while preserving biological variance. During the normalization, we also remove confounding sources of variation (here we take mitochondrial mapping percentage).
We need to apply `SCTransform` on each individual slice. Therefore, we split the object back into a list (with `SplitObject`). Next, we run into a small issue that both slice images are maintained in the split object, so we have keep only the image corresponding to the count table. Then, we apply `SCTransform` on the individual slices and merge the objects back together with `merge`:
```{r}
#| message: FALSE
#| warning: FALSE
seu_list <- SplitObject(seu, split.by = "orig.ident")
# preparing both objects for SCTransform
for(slice in names(seu_list)) {
# images aren't split with SplitObject. Resetting the images.
seu_list[[slice]]@images <- setNames(
list(seu_list[[slice]]@images[[slice]]),
slice)
# bugfix based on https://github.com/satijalab/seurat/issues/8216
seu_list[[slice]][["RNA"]] <- seu_list[[slice]][["Spatial"]]
DefaultAssay(seu_list[[slice]]) <- "RNA"
}
seu_list <- lapply(X = seu_list, FUN = SCTransform, assay = "RNA",
vars.to.regress = "percent_mt")
```
::: {.callout-important}
## Exercise
After running the code, to do the SCT transformation, which assays have been added to the seurat object? Note that you can get assay data with the function `Assays`.
:::
::: {.callout-tip collapse="true"}
## Answer
Just by typing the object name (`seu`) we see which layers are in there:
```{r}
Assays(seu_list[[1]])
```
Showing us that an assay called `SCT` has appeared.
:::
Now that we have done the transformation it is also possible to plot gene experssion information in a spatial context, e.g. `Myl4`:
```{r}
SpatialPlot(seu_list$Anterior,
features = "Myl4") +
SpatialPlot(seu_list$Posterior,
features = "Myl4") +
plot_layout(guides = "collect") &
theme(legend.position = "right")
```
::: {.callout-important}
## Exercise
Create the same plot, but now for the gene `Calb2`. In which two parts of the brain is it primarily expressed?
Hint: check out [Allen Brain Atlas](http://atlas.brain-map.org/atlas?atlas=2&plate=100883804#atlas=2&plate=100884129&resolution=19.04&x=7671.818403764205&y=4000&zoom=-4&structure=549) for the names of the different parts of the brain.
:::
::: {.callout-tip collapse="true"}
## Answer
```{r}
gene <- "Calb2"
SpatialPlot(seu_list$Anterior,
features = gene) +
SpatialPlot(seu_list$Posterior,
features = gene) +
plot_layout(guides = "collect") &
theme(legend.position = "right")
```
It's mainly expressed in the olfactory bulb (left of the anterior slice) and cerebellum (right of the posterior slice).
:::
After quality control and transformation, we can save the output as an rds files:
```{r}
saveRDS(seu_list,
paste0("output/seu_part3.rds"))
```