Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Keep QIDs in json output #10

Open
Adafede opened this issue Feb 28, 2022 · 0 comments
Open

Keep QIDs in json output #10

Adafede opened this issue Feb 28, 2022 · 0 comments

Comments

@Adafede
Copy link
Contributor

Adafede commented Feb 28, 2022

When using -j the corresponding QID does not appear in the output. This is somehow problematic as labels change, while QIDs don't.

Example:

python3 classify.py -d ./ -m 'InChI=1S/C28H32O15/c1-39-14-7-15-18(12(32)6-13(40-15)10-2-4-11(31)5-3-10)22(35)19(14)26-27(24(37)21(34)16(8-29)41-26)43-28-25(38)23(36)20(33)17(9-30)42-28/h2-7,16-17,20-21,23-31,33-38H,8-9H2,1H3/t16-,17-,20-,21-,23+,24+,25-,26+,27-,28+/m1/s1' -j
{"molecule": "InChI=1S/C28H32O15/c1-39-14-7-15-18(12(32)6-13(40-15)10-2-4-11(31)5-3-10)22(35)19(14)26-27(24(37)21(34)16(8-29)41-26)43-28-25(38)23(36)20(33)17(9-30)42-28/h2-7,16-17,20-21,23-31,33-38H,8-9H2,1H3/t16-,17-,20-,21-,23+,24+,25-,26+,27-,28+/m1/s1", "ikey": "VGGSULWDCMWZPO-ODEMIOGVSA-N", "hits": [{"classification_names": ["flavonoid", "2-phenylchromane flavonoid", "flavone", "6C-substituted flavone", "6C-glycosylated flavone", "spinosin"], "biological_process": ["flavone biosynthetic process", "GO:0051553"]}]}

vs

python3 classify.py -d ./ -m 'InChI=1S/C28H32O15/c1-39-14-7-15-18(12(32)6-13(40-15)10-2-4-11(31)5-3-10)22(35)19(14)26-27(24(37)21(34)16(8-29)41-26)43-28-25(38)23(36)20(33)17(9-30)42-28/h2-7,16-17,20-21,23-31,33-38H,8-9H2,1H3/t16-,17-,20-,21-,23+,24+,25-,26+,27-,28+/m1/s1'   
Namespace(data='./', molecule='InChI=1S/C28H32O15/c1-39-14-7-15-18(12(32)6-13(40-15)10-2-4-11(31)5-3-10)22(35)19(14)26-27(24(37)21(34)16(8-29)41-26)43-28-25(38)23(36)20(33)17(9-30)42-28/h2-7,16-17,20-21,23-31,33-38H,8-9H2,1H3/t16-,17-,20-,21-,23+,24+,25-,26+,27-,28+/m1/s1', rawhits=False, testfile=None, nptest=False, json=False, natural=False, list_nproots=False)
reading biosyn class data
reading biosyn compound data
reading class pattern data
reading superclass data
collecting hits...
purging redundant hits
('Q105285804', 'spinosin', 'VGGSULWDCMWZPO') {'GO:0051553', 'flavone biosynthetic process'}
('Q421916', 'diol', None) {'diol biosynthetic process', 'GO:0034312'}
('Q2832210', 'primary alcohol', '*C(O)([H])[H]') {'primary alcohol biosynthetic process', 'GO:0034309'}
('Q2832211', 'secondary alcohol', None) {'secondary alcohol biosynthetic process', 'GO:1902653'}
('Q9148474', 'aromatic alcohol', None) {'alcohol biosynthetic process', 'GO:0046165'}
('Q67206812', 'methyl ether', None) {'ether biosynthetic process', 'GO:1901503'}
('Q74173050', 'D-glucoside', '[C@@H]1(OC([C@H](O)[C@H]([C@@H]1O)O)O*)CO') {'GO:0016138', 'glycoside biosynthetic process'}
('Q74568405', 'alpha-D-mannoside', 'OC[C@@H]1[C@H]([C@@H]([C@H](O)[C@H](O1)O*)O)O') {'GO:0016138', 'glycoside biosynthetic process'}
('Q74651555', 'phenolic donor', 'C1(=CC=C(C=C1)*)O') {'phenol-containing compound biosynthetic process', 'GO:0046189'}
('Q75078202', 'pyranoside', 'O1C(C(C(C(C1O*)O)O)O)CO') {'GO:0016138', 'glycoside biosynthetic process'}
('Q75082510', 'beta-D-galactoside', 'OC[C@H]1O[C@@H](O[*])[C@H](O)[C@@H](O)[C@H]1O') {'GO:0016138', 'glycoside biosynthetic process'}

Hope this makes sense

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant