Changed to decoupler for pseudobulking (#141) #153

alitinet · 2023-02-03T14:08:39Z

No description provided.

review-notebook-app · 2023-02-03T14:08:44Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

review-notebook-app · 2023-02-03T14:13:21Z

View / edit / reply to this conversation on ReviewNB

Zethson commented on 2023-02-03T14:13:21Z
----------------------------------------------------------------

Uhm, why are they all outcommented?

alitinet commented on 2023-02-03T14:28:26Z
----------------------------------------------------------------

oh thanks, forgot to remove, we don't need it any more

review-notebook-app · 2023-02-03T14:13:22Z

View / edit / reply to this conversation on ReviewNB

Zethson commented on 2023-02-03T14:13:22Z
----------------------------------------------------------------

Line #2.    adata_pb = dc.get_pseudobulk(adata, sample_col='sample', groups_col='cell_type', layer='counts', min_prop=0.2, min_smpls=3)

Quite a long line, I'd add a line break after every parameter.

review-notebook-app · 2023-02-03T14:13:23Z

View / edit / reply to this conversation on ReviewNB

Zethson commented on 2023-02-03T14:13:23Z
----------------------------------------------------------------

Line #1.    sc.pp.normalize_total(adata_pb, target_sum=1e4)

What made you do this? Think most normalize to millions?

alitinet commented on 2023-02-03T14:29:53Z
----------------------------------------------------------------

following decoupler tutorial here https://decoupler-py.readthedocs.io/en/latest/notebooks/pseudobulk.html

Zethson commented on 2023-02-03T14:31:05Z
----------------------------------------------------------------

I'd not change this without discussing it with Soroor

review-notebook-app · 2023-02-03T14:13:24Z

View / edit / reply to this conversation on ReviewNB

Zethson commented on 2023-02-03T14:13:24Z
----------------------------------------------------------------

The dimensions are now very very different.

Before: 16 x 15710

Now: 16 x 2435

Intended? Could you explain this, please?

alitinet commented on 2023-02-03T14:41:34Z
----------------------------------------------------------------

It comes from https://decoupler-py.readthedocs.io/en/latest/generated/decoupler.get_pseudobulk.html#decoupler.get_pseudobulk params min_prop=0.2 and min_smpls=3, which filter out genes that are expressed in <20% of all cells and genes that are expressed in <3 samples

alitinet commented on 2023-02-03T14:43:41Z
----------------------------------------------------------------

I'm not sure if it'd better to make these more permissive

review-notebook-app · 2023-02-03T14:13:25Z

View / edit / reply to this conversation on ReviewNB

Zethson commented on 2023-02-03T14:13:25Z
----------------------------------------------------------------

The new plot looks pretty different from the old one. What happened?

alitinet commented on 2023-02-03T14:41:49Z
----------------------------------------------------------------

Didn't notice, thanks! will fix

alitinet · 2023-02-03T14:28:27Z

oh thanks, forgot to remove, we don't need it any more

View entire conversation on ReviewNB

alitinet · 2023-02-03T14:29:54Z

following decoupler tutorial here https://decoupler-py.readthedocs.io/en/latest/notebooks/pseudobulk.html

View entire conversation on ReviewNB

Zethson · 2023-02-03T14:31:06Z

I'd not change this without discussing it with Soroor

View entire conversation on ReviewNB

alitinet · 2023-02-03T14:41:36Z

It comes from https://decoupler-py.readthedocs.io/en/latest/generated/decoupler.get_pseudobulk.html#decoupler.get_pseudobulk params min_prop=0.2 and min_smpls=3, which filter out genes that are expressed in <20% of all cells and genes that are expressed in <3 samples

View entire conversation on ReviewNB

alitinet · 2023-02-03T14:41:50Z

Didn't notice, thanks! will fix

View entire conversation on ReviewNB

alitinet · 2023-02-03T14:43:43Z

I'm not sure if it'd better to make these more permissive

View entire conversation on ReviewNB

alitinet · 2023-02-03T14:55:37Z

Hey @soroorh, we changed to decoupler for pseudobulk creation which filters out genes that are expressed in <20% of all cells and genes that are expressed in < 3 samples. After this step, we are left with 2435 genes out of original 15710 genes. Do you think it would make sense to make this filtering step more permissive or is it ok as it is?

Also, for pb normalization, should we use 1e6 (as before) or 1e4 (from decoupler tutorial) as the normalizing factor?

soroorh · 2023-02-03T15:37:48Z

Hey @alitinet. Since you are using filterByExpr from edgeR later in the workflow, I would retain as much genes as possible. So, I'd go with a more permissive filtering or no filtering. Also perhaps explicitly explain this in the chapter that if one is following edgeR's workflow for DE, they do not need to apply any filtering when making pseudo-bulks.

I would go with 1e6, as this is closer to how counts-per-million (CPM) is computed in edgeR.

Zethson · 2023-04-12T07:44:22Z

@alitinet this chapter should then also use the pertpy dataloader for the kang dataset!

https://pertpy.readthedocs.io/en/latest/usage/data/pertpy.data.kang_2018.html#pertpy.data.kang_2018

alitinet added 2 commits February 3, 2023 14:55

changed to decoupler for pseudobulking

72bb5c5

updated the environment

cb30a4b

alitinet requested a review from Zethson February 3, 2023 14:08

github-actions bot added the enhancement New feature or request label Feb 3, 2023

Zethson changed the base branch from development to master April 24, 2023 08:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changed to decoupler for pseudobulking (#141) #153

Changed to decoupler for pseudobulking (#141) #153

alitinet commented Feb 3, 2023

review-notebook-app bot commented Feb 3, 2023

review-notebook-app bot commented Feb 3, 2023 •

edited

Loading

review-notebook-app bot commented Feb 3, 2023 •

edited

Loading

review-notebook-app bot commented Feb 3, 2023 •

edited

Loading

review-notebook-app bot commented Feb 3, 2023 •

edited

Loading

review-notebook-app bot commented Feb 3, 2023 •

edited

Loading

alitinet commented Feb 3, 2023

alitinet commented Feb 3, 2023

Zethson commented Feb 3, 2023

alitinet commented Feb 3, 2023

alitinet commented Feb 3, 2023

alitinet commented Feb 3, 2023

alitinet commented Feb 3, 2023

soroorh commented Feb 3, 2023

Zethson commented Apr 12, 2023

Changed to decoupler for pseudobulking (#141) #153

Are you sure you want to change the base?

Changed to decoupler for pseudobulking (#141) #153

Conversation

alitinet commented Feb 3, 2023

review-notebook-app bot commented Feb 3, 2023

review-notebook-app bot commented Feb 3, 2023 • edited Loading

review-notebook-app bot commented Feb 3, 2023 • edited Loading

review-notebook-app bot commented Feb 3, 2023 • edited Loading

review-notebook-app bot commented Feb 3, 2023 • edited Loading

review-notebook-app bot commented Feb 3, 2023 • edited Loading

alitinet commented Feb 3, 2023

alitinet commented Feb 3, 2023

Zethson commented Feb 3, 2023

alitinet commented Feb 3, 2023

alitinet commented Feb 3, 2023

alitinet commented Feb 3, 2023

alitinet commented Feb 3, 2023

soroorh commented Feb 3, 2023

Zethson commented Apr 12, 2023

review-notebook-app bot commented Feb 3, 2023 •

edited

Loading

review-notebook-app bot commented Feb 3, 2023 •

edited

Loading

review-notebook-app bot commented Feb 3, 2023 •

edited

Loading

review-notebook-app bot commented Feb 3, 2023 •

edited

Loading

review-notebook-app bot commented Feb 3, 2023 •

edited

Loading