[WIP] Examples dataset for Phenotype BEP #393

Remi-Gau · 2023-08-08T20:35:30Z

No description provided.

ericearl · 2023-08-11T13:10:46Z

rm_subjects.py

@@ -0,0 +1,73 @@
+"""Utility script to remove subjects from the phenotype dataset.


Suggested change

"""Utility script to remove subjects from the phenotype dataset.

"""

Utility script to remove subjects from the phenotype dataset.

ericearl · 2023-08-11T13:11:51Z

rm_subjects.py

+if "sub-ON01016" not in subjects_to_keep:
+    subjects_to_keep.append('sub-ON01016')


Please comment for reasoning?

ericearl · 2023-08-11T13:13:39Z

rm_subjects.py

+            print(f'Removing {subject_dir}')
+            shutil.rmtree(subject_dir)
+
+# remove subject from participants.tsv


Suggested change

# remove subject from participants.tsv

# remove subjects from participants.tsv

Remi-Gau · 2023-08-11T13:19:15Z

by the way I am not sure we will want to keep the script in the end, but I put it in the PR in case we want to regenerate those datasets slightly differently

ericearl · 2023-08-11T13:21:51Z

rm_subjects.py

+    print(participants_df)
+    participants_df.to_csv(participants_tsv, sep='\t', index=False)
+
+# remove subject from all tsv in phenotype folder


Suggested change

# remove subject from all tsv in phenotype folder

# remove subjects from all tsv's in phenotype folder

ericearl · 2023-08-11T13:23:37Z

rm_subjects.py

What a great utility! Thanks.

ericearl · 2023-08-11T13:27:27Z

ds004215-pheno_source/phenotype/ace.json

Maybe delete the unpaired JSONs like this one (meaning there's no TSV to go with the JSON)?

yes I think I did that manually for a couple of things and that would have to be scripted.

Remi-Gau · 2023-08-11T13:30:03Z

another todo in the scrip:

truncate the data.

ericearl · 2023-08-11T13:33:58Z

ds004129-pheno_segregated/sub-ON66199/phenotype/bai.tsv

@@ -0,0 +1,2 @@
+participant_id	bai01_r03, bai1_1	bai02_r03, bai1_2	bai03_r03, bai1_3	bai04_r03, bai1_4	bai05_r03, bai1_5	bai06_r03, bai1_6	bai07_r03, bai2_7	bai08_r03, bai2_8	bai09_r03, bai2_9	bai10_r03, bai2_10	bai11_r03, bai2_11	bai12_r03, bai3_12	bai13_r03, bai3_13	bai14_r03, bai3_14	bai15_r03, bai3_15	bai16_r03, bai3_16	bai17_r03, bai4_1	bai18_r03, bai4_2	bai19_r03, bai4_3	bai20_r03, bai4_4	bai21_r03, bai4_5
+sub-ON66199	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0


I can't remember if we decided since this file is nested under a participant_id folder that we don't need the participant_id column, or if maybe it just always needs to be there. Open to commentary...

ericearl

What a wonderful bunch of examples and utilities! Address those few mini-suggestions and the couple slightly larger comments and I approve!

Remi-Gau added 2 commits August 8, 2023 16:33

add_pheno

b167c35

add pheno

6a6d396

ericearl reviewed Aug 11, 2023

View reviewed changes

rm_subjects.py

Copy link

ericearl Aug 11, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What a great utility! Thanks.

ericearl reviewed Aug 11, 2023

View reviewed changes

ericearl approved these changes Aug 11, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Examples dataset for Phenotype BEP #393

[WIP] Examples dataset for Phenotype BEP #393

Remi-Gau commented Aug 8, 2023

ericearl Aug 11, 2023

ericearl Aug 11, 2023

ericearl Aug 11, 2023

Remi-Gau commented Aug 11, 2023

ericearl Aug 11, 2023

ericearl Aug 11, 2023

ericearl Aug 11, 2023

Remi-Gau Aug 11, 2023

Remi-Gau commented Aug 11, 2023

ericearl Aug 11, 2023

ericearl left a comment

		@@ -0,0 +1,73 @@
		"""Utility script to remove subjects from the phenotype dataset.

	"""Utility script to remove subjects from the phenotype dataset.
	"""
	Utility script to remove subjects from the phenotype dataset.

		if "sub-ON01016" not in subjects_to_keep:
		subjects_to_keep.append('sub-ON01016')

	# remove subject from participants.tsv
	# remove subjects from participants.tsv

	# remove subject from all tsv in phenotype folder
	# remove subjects from all tsv's in phenotype folder

		@@ -0,0 +1,2 @@
		participant_id bai01_r03, bai1_1 bai02_r03, bai1_2 bai03_r03, bai1_3 bai04_r03, bai1_4 bai05_r03, bai1_5 bai06_r03, bai1_6 bai07_r03, bai2_7 bai08_r03, bai2_8 bai09_r03, bai2_9 bai10_r03, bai2_10 bai11_r03, bai2_11 bai12_r03, bai3_12 bai13_r03, bai3_13 bai14_r03, bai3_14 bai15_r03, bai3_15 bai16_r03, bai3_16 bai17_r03, bai4_1 bai18_r03, bai4_2 bai19_r03, bai4_3 bai20_r03, bai4_4 bai21_r03, bai4_5
		sub-ON66199 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

[WIP] Examples dataset for Phenotype BEP #393

Are you sure you want to change the base?

[WIP] Examples dataset for Phenotype BEP #393

Conversation

Remi-Gau commented Aug 8, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Remi-Gau commented Aug 11, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Remi-Gau commented Aug 11, 2023

Choose a reason for hiding this comment

ericearl left a comment

Choose a reason for hiding this comment