Upgrade to Biolink 2.1 (KG2.7.1) - due Aug. 11 #1593

amykglen · 2021-07-28T22:46:27Z

associated code changes should go in the kg2integration branch

first update NodeSynonymizer:

change SRI NodeNormalizer URL to Biolink 2.1 endpoint
specify version=2.1.0 in Biolink Lookup tool URL
adjust NodeSynonymizer conflations
adjust any other mentions of ChemicalSubstance
try doing a synonymizer build

once we have what seems to be a good synonymizer:

update conflations hard-coded in KG2c build (record_kg2c_meta_info.py)
build a new KG2c (put it at http://kg2-7-1c.rtx.ai:7474/browser/) @acevedol
load it into plover (available at http://kg2-7-1cplover.rtx.ai:9990)

rebuild or edit and put copies in /translator/data/orangeboard/databases/KG2.7.1 on arax.ncats.io:

note: As databases are rebuilt, the new copy of config_local.json will need to be updated to point to their new paths.

update ARAX codebase:

make updates to the ARAX codebase:
- change usages of biolink:ChemicalSubstance to biolink:ChemicalEntity (lots of tests use this)
  - Expand/Resultify @amykglen
  - CHP/DTD/COHD Expand @chunyuma
  - Overlay @finnagin ?
  - Other?
- update Expand's hard-coded conflations
- specify version 2.1.0 in every place Expand grabs the Biolink model
test everything together (entire ARAX pytest suite should pass when using the new config_local.json - must locally set force_local=True in ARAX_expander.py to avoid using the old KG2 API)

other things:

update the test triples that go in some NCATS repo @finnagin
update Biolink version (2.1.0) and KG2 version (2.7.1) in openapi yaml @edeutsch ?
update SmartAPI registration for ARAX
after main roll-out is complete, rename config_local.json to config_local.json_FROZEN_DO-NOT-EDIT-FURTHER (any remaining edits to the config file, such as when the DTD build is complete, should be made directly to the master configv2.json on araxconfig.rtx.ai)

The text was updated successfully, but these errors were encountered:

amykglen · 2021-07-29T04:11:34Z

well the synonymizer build appeared to go well - artifacts are uploaded to arax.ncats.io at /data/orangeboard/databases/KG2.7.1/synonymizer.

only 56 problems listed in Problems.tsv (down from 76 for KG2.6.7.1).

going to do some spot-checking tomorrow to make sure things look ok.

edeutsch · 2021-07-29T04:53:51Z

terrific!

amykglen · 2021-07-29T21:32:51Z

things seem good in the synonymizer based on some spot-checking... although wondering about one thing - is it normal for the SRI_normalizer_category and other SRI fields here to be null?

python3 node_synonymizer.py --lookup CHEMBL.COMPOUND:CHEMBL112

...
    "id": {
      "SRI_normalizer_category": null,
      "SRI_normalizer_curie": null,
      "SRI_normalizer_name": null,
      "category": "biolink:SmallMolecule",
      "identifier": "CHEMBL.COMPOUND:CHEMBL112",
      "name": "ACETAMINOPHEN"
    },
...

edeutsch · 2021-07-29T21:47:54Z

It could happen if the concept is not known to the SRI Node Normalizer, but that is not the case here. Looks like a bug. What happens if you run this:

python3 sri_node_normalizer.py -c CHEMBL.COMPOUND:CHEMBL112

in the location where the SRI NN and NodeSyn was built?

amykglen · 2021-07-29T22:04:32Z

it returns this:

==========================================================
Native SRI Node Normalizer results:
{
  "CHEMBL.COMPOUND:CHEMBL112": {
    "equivalent_identifiers": [
      {
        "identifier": "PUBCHEM.COMPOUND:1983",
        "label": "Acetaminophen"
      },
      {
        "identifier": "CHEMBL.COMPOUND:CHEMBL112",
        "label": "ACETAMINOPHEN"
      },
      {
        "identifier": "UNII:362O9ITL9D",
        "label": "ACETAMINOPHEN"
      },
      {
        "identifier": "CHEBI:46195",
        "label": "paracetamol"
      },
      {
        "identifier": "DRUGBANK:DB00316"
      },
      {
        "identifier": "MESH:D000082",
        "label": "Acetaminophen"
      },
      {
        "identifier": "CAS:103-90-2"
      },
      {
        "identifier": "CAS:360769-21-7"
      },
      {
        "identifier": "DrugCentral:52",
        "label": "paracetamol"
      },
      {
        "identifier": "GTOPDB:5239",
        "label": "paracetamol"
      },
      {
        "identifier": "HMDB:HMDB0001859",
        "label": "Acetaminophen"
      },
      {
        "identifier": "KEGG.COMPOUND:C06804",
        "label": "Acetaminophen"
      },
      {
        "identifier": "INCHIKEY:RZVAJINKPMORJF-UHFFFAOYSA-N"
      }
    ],
    "id": {
      "identifier": "PUBCHEM.COMPOUND:1983",
      "label": "Acetaminophen"
    },
    "type": [
      "biolink:SmallMolecule",
      "biolink:MolecularEntity",
      "biolink:ChemicalEntity",
      "biolink:PhysicalEssence",
      "biolink:NamedThing",
      "biolink:Entity",
      "biolink:PhysicalEssenceOrOccurrent"
    ]
  }
}
==========================================================
Local more compact and useful formatting:
{
  "curie": "CHEMBL.COMPOUND:CHEMBL112",
  "equivalent_identifiers": [
    {
      "identifier": "PUBCHEM.COMPOUND:1983",
      "label": "Acetaminophen"
    },
    {
      "identifier": "CHEMBL.COMPOUND:CHEMBL112",
      "label": "ACETAMINOPHEN"
    },
    {
      "identifier": "UNII:362O9ITL9D",
      "label": "ACETAMINOPHEN"
    },
    {
      "identifier": "CHEBI:46195",
      "label": "paracetamol"
    },
    {
      "identifier": "DRUGBANK:DB00316"
    },
    {
      "identifier": "MESH:D000082",
      "label": "Acetaminophen"
    },
    {
      "identifier": "CAS:103-90-2"
    },
    {
      "identifier": "CAS:360769-21-7"
    },
    {
      "identifier": "DrugCentral:52",
      "label": "paracetamol"
    },
    {
      "identifier": "GTOPDB:5239",
      "label": "paracetamol"
    },
    {
      "identifier": "HMDB:HMDB0001859",
      "label": "Acetaminophen"
    },
    {
      "identifier": "KEGG.COMPOUND:C06804",
      "label": "Acetaminophen"
    },
    {
      "identifier": "INCHIKEY:RZVAJINKPMORJF-UHFFFAOYSA-N"
    }
  ],
  "equivalent_names": [
    "Acetaminophen",
    "ACETAMINOPHEN",
    "ACETAMINOPHEN",
    "paracetamol",
    "Acetaminophen",
    "paracetamol",
    "paracetamol",
    "Acetaminophen",
    "Acetaminophen"
  ],
  "preferred_curie": "PUBCHEM.COMPOUND:1983",
  "preferred_curie_name": "Acetaminophen",
  "status": "OK",
  "type": "biolink:SmallMolecule"
}

edeutsch · 2021-07-30T05:10:39Z

hmm, okay, definitely seems like a bug. do you use those fields for anything?

amykglen · 2021-07-30T16:27:24Z

no, I don't use them. so unless it's indicative of a larger issue, not a problem for the KG2c build.

amykglen · 2021-08-04T15:16:17Z

so I think we're at a point now where @chunyuma can start building COHD/DTD databases from KG2.7.1. all necessary files should be on arax.ncats.io at: /data/orangeboard/databases/KG2.7.1

I'm going to work next on loading KG2.7.1c into Plover (may require a little bit of tweaking to remove mixins from the expanded_categories property)

edeutsch · 2021-08-04T16:04:20Z

I suppose this is an important point that we need resolve before we move forward: what do we do about these pesky mixins?

edeutsch · 2021-08-04T16:54:20Z

At the moment the category_manager skips biolink:Entity. But I think it is true that all mixins come after this in the list. So it would be an easy tweak to STOP processing at biolink:Entity. This would exclude biolink:Entity and mixins I think.

Do we want to do that?

edeutsch · 2021-08-04T17:42:23Z

ah, but now Chris has responded that we want mixins in there. From Slack:

Yes, I think we want to include mixins in ancestors / descendents. One of the main reasons to have these mixins is to make querying easy by grouping similar concepts that are not direct ancestors in the model. That relies on applying ancestor logic to mixins....

chunyuma · 2021-08-04T22:43:03Z

Hi @finnagin, @dkoslicki and @jaredroach,

To build the COHD database and DTD model/database for kg2.7.1c based on the Biolink Model 2.1, I plan to use the biolink:Drug and biolink:SmallMolecule to replace the original bilink:Drug , biolink:ChemicalSubstance and biolink:Metabolite used in kg2.6.7.1 as drug. Based on this, I simply compared the total number of drug nodes in kg2.7.1 (which is 2039306) and the total number of drug nodes used in kg2.6.7.1 (which is 2010011 ). Using only those two classes even have more drug nodes. So any objection for only treating biolink:Drug and biolink:SmallMolecule as drug` for COHD and DTD?

amykglen · 2021-08-05T02:10:19Z

well surprisingly it worked to just load KG2c into Plover as is! (with mixins included as labels.) it did increase baseline memory usage by a decent amount, but not enough for it to cause a problem I think.

down the road I may tweak the code a bit to decrease its memory usage, but it seems to be totally fine for now.

edeutsch · 2021-08-05T02:36:15Z

Terrific! It looks like @chunyuma has several to dos in the list, but otherwise, do we just sit tight until you're back and then make the switch?

chunyuma · 2021-08-05T02:40:37Z

I've already started building COHD and DTD now. Both of them should be able to complete next week except for DTD probability precomputed database which might need more time.

amykglen · 2021-08-05T03:31:51Z

yeah, I think that sounds fine to wait to roll it out until I'm back - but in the meantime other codeowners could go in and fix any usages of biolink:ChemicalSubstance in their code/tests (in the kg2integration branch). (I created a couple subtasks for this above.)

and if anyone wants to test their changes, they just need to:

download the config_local.json from arax.ncats.io:/translator/data/orangeboard/databases/KG2.7.1/config_local.json (put it into your local RTX/code/ directory)
locally change this line in Expand to say force_local = True
run the pytest suite (from the kg2integration branch)

…mallmolecule in Expand/Overlay

chunyuma · 2021-08-11T00:52:46Z

Hi @edeutsch and @amykglen, do we have some synonymizer functions or utility functions that can return all children of a category in kg2.7.1? Thanks!

amykglen · 2021-08-11T01:32:01Z

I just messaged Chunyu about this in slack but posting here as well so others are aware it was addressed: the CategoryManager may work to find category ancestors (see this function) or the Biolink Lookup Service could be used to find category descendants (curl -X GET "https://bl-lookup-sri.renci.org/bl/ChemicalEntity/descendants?version=2.1.0" -H "accept: application/json" - although you may want to cache answers in this case).

finnagin · 2021-08-11T01:52:33Z

Do we want to try to implement a local way of getting descendents? I'm asking because we currently we cannot run the DTD tests on Travis because of it's lack of a cache for when it hits the SRI and I'm worried having SRI calls for cohd too might make those tests impossible to run as well.

amykglen · 2021-08-11T02:39:54Z

actually, I just remembered I already have a local method for category descendants in the KPSelector, so you could use that if you want, @chunyuma. you could create a KPSelector object and call this method:

RTX/code/ARAX/ARAXQuery/Expand/kp_selector.py

Line 300 in 8c95253

    
           def _get_category_descendants(self, categories: Optional[List[str]]) -> Set[str]:

chunyuma · 2021-08-11T03:19:24Z

I think all tests associated with dtd/chp/chod pass now in the kg2integration branch

amykglen · 2021-08-11T15:06:03Z

great. for some reason the CHP tests are still failing for me in the kg2integration branch (after pulling):

test_ARAX_expand.py::test_chp_expand_1 FAILED                                                                                           [ 12%]
test_ARAX_expand.py::test_chp_expand_2 FAILED                                                                                           [ 12%]

the errors are about connection reset:

  - 2021-08-11T08:01:57.344012 DEBUG: [] Prefixes CHP supports for ['biolink:Gene', 'biolink:Protein'] are: {'ENSEMBL'}
  - 2021-08-11T08:01:57.346041 DEBUG: [] CHP: Converted n00's 3 curies to a list of 3 curies with prefixes CHP supports
  - 2021-08-11T08:02:07.708723 ERROR: [UncaughtError] An uncaught error was thrown while trying to Expand using CHP. Error was: Traceback (most recent call last):
  File "/Users/amyglen/.pyenv/versions/3.7.8/envs/arax/lib/python3.7/site-packages/urllib3/connectionpool.py", line 677, in urlopen
    chunked=chunked,
  File "/Users/amyglen/.pyenv/versions/3.7.8/envs/arax/lib/python3.7/site-packages/urllib3/connectionpool.py", line 426, in _make_request
    six.raise_from(e, None)
  File "<string>", line 3, in raise_from
  File "/Users/amyglen/.pyenv/versions/3.7.8/envs/arax/lib/python3.7/site-packages/urllib3/connectionpool.py", line 421, in _make_request
    httplib_response = conn.getresponse()
  File "/Users/amyglen/.pyenv/versions/3.7.8/lib/python3.7/http/client.py", line 1354, in getresponse
    response.begin()
  File "/Users/amyglen/.pyenv/versions/3.7.8/lib/python3.7/http/client.py", line 306, in begin
    version, status, reason = self._read_status()
  File "/Users/amyglen/.pyenv/versions/3.7.8/lib/python3.7/http/client.py", line 267, in _read_status
    line = str(self.fp.readline(_MAXLINE + 1), "iso-8859-1")
  File "/Users/amyglen/.pyenv/versions/3.7.8/lib/python3.7/socket.py", line 589, in readinto
    return self._sock.recv_into(b)
ConnectionResetError: [Errno 54] Connection reset by peer

are you seeing this as well, @chunyuma?

chunyuma · 2021-08-11T16:33:48Z

It seems like this is a CHP API problem. I will figure it out today.

chunyuma · 2021-08-11T17:26:36Z

@amykglen, I met a similar problem like you got when I tested test_chp_expand_1 :

Traceback (most recent call last):
  File "/home/cqm5886/anaconda3/envs/RTX_env/lib/python3.7/site-packages/urllib3/connectionpool.py", line 706, in urlopen
    chunked=chunked,
  File "/home/cqm5886/anaconda3/envs/RTX_env/lib/python3.7/site-packages/urllib3/connectionpool.py", line 394, in _make_request
    conn.request(method, url, **httplib_request_kw)
  File "/home/cqm5886/anaconda3/envs/RTX_env/lib/python3.7/site-packages/urllib3/connection.py", line 234, in request
    super(HTTPConnection, self).request(method, url, body=body, headers=headers)
  File "/home/cqm5886/anaconda3/envs/RTX_env/lib/python3.7/http/client.py", line 1252, in request
    self._send_request(method, url, body, headers, encode_chunked)
  File "/home/cqm5886/anaconda3/envs/RTX_env/lib/python3.7/http/client.py", line 1298, in _send_request
    self.endheaders(body, encode_chunked=encode_chunked)
  File "/home/cqm5886/anaconda3/envs/RTX_env/lib/python3.7/http/client.py", line 1247, in endheaders
    self._send_output(message_body, encode_chunked=encode_chunked)
  File "/home/cqm5886/anaconda3/envs/RTX_env/lib/python3.7/http/client.py", line 1026, in _send_output
    self.send(msg)
  File "/home/cqm5886/anaconda3/envs/RTX_env/lib/python3.7/http/client.py", line 966, in send
    self.connect()
  File "/home/cqm5886/anaconda3/envs/RTX_env/lib/python3.7/site-packages/urllib3/connection.py", line 200, in connect
    conn = self._new_conn()
  File "/home/cqm5886/anaconda3/envs/RTX_env/lib/python3.7/site-packages/urllib3/connection.py", line 182, in _new_conn
    self, "Failed to establish a new connection: %s" % e
urllib3.exceptions.NewConnectionError: <urllib3.connection.HTTPConnection object at 0x7fc6259aac90>: Failed to establish a new connection: [Errno 110] Connection timed out

I think the problem might come from chp team's server. Right now, when I called their APIs, I met Connection timed out by running r = requests.get('http://chp.thayer.dartmouth.edu/predicates/').

chunyuma · 2021-08-11T21:56:22Z

Hi @yakaboskic, could you please help us take a look if there is something wrong with CHP APIs? Currently, we can't call the chp server by using r = requests.get('http://chp.thayer.dartmouth.edu/predicates/'). Thanks!

yakaboskic · 2021-08-12T01:32:34Z

Hi @chunyuma, so I am currently working on a server rebuild right now, but should be back up tonight. However, we have depreciated the predicates endpoint.

chunyuma · 2021-08-12T02:02:17Z

Thanks for reply @yakaboskic. Please let us know when it is back. Thanks again!

finnagin · 2021-08-12T07:07:31Z

I've created a pull request for the new predicate tuples in the NCATS testing repo. @dkoslicki when you get a chance could you approve and merge it?

yakaboskic · 2021-08-12T12:11:41Z

Hi @chunyuma, CHP server is back up as of around 12AM EST last night.

chunyuma · 2021-08-12T13:10:01Z

Thanks @yakaboskic!

chunyuma · 2021-08-12T23:55:10Z

Hi @yakaboskic, I tried calling the CHP APIs via r = requests.post('http://chp.thayer.dartmouth.edu/query/', json=query) for the following query:

{'message': {'query_graph': {'nodes': {'n0': {'ids': ['MONDO:0007254'],
     'categories': ['biolink:Disease'],
     'constraints': []},
    'n1': {'ids': ['ENSEMBL:ENSG00000162419'],
     'categories': ['biolink:Gene'],
     'constraints': []},
    'n2': {'ids': ['CHEMBL.COMPOUND:CHEMBL88'],
     'categories': ['biolink:Drug'],
     'constraints': []},
    'n3': {'ids': ['EFO:0000714'],
     'categories': ['biolink:PhenotypicFeature'],
     'constraints': []}},
   'edges': {'e0': {'predicates': ['biolink:gene_associated_with_condition'],
     'relation': None,
     'subject': 'n1',
     'object': 'n0',
     'constraints': []},
    'e1': {'predicates': ['biolink:treats'],
     'relation': None,
     'subject': 'n2',
     'object': 'n0',
     'constraints': []},
    'e2': {'predicates': ['biolink:has_phenotype'],
     'relation': None,
     'subject': 'n0',
     'object': 'n3',
     'constraints': [{'name': 'survival_time',
       'id': 'EFO:0000714',
       'operator': '>',
       'value': 500.0,
       'unit_id': None,
       'unit_name': None,
       'not': False}]}}},
  'knowledge_graph': {'nodes': {}, 'edges': {}},
  'results': []},
 'max_results': 10,
 'trapi_version': '1.1',
 'biolink_version': None}

But I got a warning message saying 'Passed category for n2: biolink:Drug, did not match our preferred category biolink:ChemicalSubstance for this curie. Going with our preferred category.' and the returned status is Bad request. See description. with description Problem during interface setup. No CHP core supported queries where found in passed query. Could you please help me see how to solve this issue? I think CHEMBL.COMPOUND:CHEMBL88 belongs to biolink:SmallMolecule based on biolink model 2.1 so it should be assigned to the drug, right?

I generated the standard query by using the following function:

    def _build_standard_query(
            gene=None,
            drug=None,
            outcome=None,
            outcome_name=None,
            outcome_op=None,
            outcome_value=None,
            disease=None,
            trapi_version='1.1',
            ):

        query = "{'message': {'query_graph': {'nodes': {'n0': {'ids': ['" + disease + "'], 'categories': ['biolink:Disease'], 'constraints': []}, 'n1': {'ids': ['" + gene + "'], 'categories': ['biolink:Gene'], 'constraints': []}, 'n2': {'ids': ['" + drug + "'], 'categories': ['biolink:Drug'], 'constraints': []}, 'n3': {'ids': ['" + outcome + "'], 'categories': ['biolink:PhenotypicFeature'], 'constraints': []}}, 'edges': {'e0': {'predicates': ['biolink:gene_associated_with_condition'], 'relation': None, 'subject': 'n1', 'object': 'n0', 'constraints': []}, 'e1': {'predicates': ['biolink:treats'], 'relation': None, 'subject': 'n2', 'object': 'n0', 'constraints': []}, 'e2': {'predicates': ['biolink:has_phenotype'], 'relation': None, 'subject': 'n0', 'object': 'n3', 'constraints': [{'name': '" + outcome_name + "', 'id': '" + outcome + "', 'operator': '" + outcome_op + "', 'value': " + str(outcome_value) + ", 'unit_id': None, 'unit_name': None, 'not': False}]}}}, 'knowledge_graph': {'nodes': {}, 'edges': {}}, 'results': []}, 'max_results': 10, 'trapi_version': '" + trapi_version + "', 'biolink_version': None}"

        return eval(query)

yakaboskic · 2021-08-13T03:24:14Z

Hi @chunyuma! Oh no! So... we actually turned off support for our weird muli-hop query structure and have transitioned everyone (we thought) to a fully one hop query structure in order to hopefully help teams better integrate with us. I am sorry that this was not communicated!!

So in light of that function you used to build those standard queries, I have made an equivalent python script to build equivalent one hop queries that will answer the same question as above.

Basically the change is that we have Gene to Disease, Drug to Disease, Gene to Drug, Drug to Gene, and Gene to Gene edges, and you can specify what we call a predicate proxy (EFO for survival) and predicate context (i.e. more gene or drug curies) but you don't have to specify this if you don't want to (we default). Here is the equivalent build script and I have checked and it works. Here is the code below (and also attached as a txt file, can't upload python here).
chp_onehop_simple_build_script.py.txt

Please let me know if you have any questions/concerns/issues! And many apologies for any unexpected work this may have caused!

import requests
import json

def _build_standard_query(
        gene=None,
        drug=None,
        outcome=None,
        outcome_name=None,
        outcome_op=None,
        outcome_value=None,
        disease=None,
        trapi_version='1.1',
        ):

    """Two options, both are equivalent in terms of CHP analysis.
    """
    # Option 1
    query_1 = {
            'message': { 
                'query_graph': {
                    'nodes': {
                        'n0': {
                            'ids': [gene], 
                            'categories': ['biolink:Gene'],
                            'constraints': []}, 
                        'n1': {
                            'ids': [disease],
                            'categories': ['biolink:Disease'],
                            'constraints': []
                            },
                        }, 
                    'edges': {
                        'e0': {
                            'predicates': ['biolink:gene_associated_with_condition'],
                            'relation': None,
                            'subject': 'n0',
                            'object': 'n1',
                            "constraints": [
                                {
                                    "id": "CHP:PredicateProxy",
                                    "not": False,
                                    "name": "predicate_proxy",
                                    "value": [
                                        outcome
                                    ],
                                    "unit_id": None,
                                    "operator": "==",
                                    "unit_name": None
                                    },
                                {
                                    "id": outcome,
                                    "not": False,
                                    "name": outcome_name,
                                    "value": outcome_value,
                                    "unit_id": None,
                                    "operator": outcome_op,
                                    "unit_name": None
                                    },
                                {
                                    "id": "CHP:PredicateContext",
                                    "not": False,
                                    "name": "predicate_context",
                                    "value": [
                                        "drug"
                                    ],
                                    "unit_id": None,
                                    "operator": "==",
                                    "unit_name": None
                                    },
                                {
                                    "id": "drug",
                                    "not": False,
                                    "name": "drug",
                                    "value": [
                                        drug
                                    ],
                                    "unit_id": None,
                                    "operator": "matches",
                                    "unit_name": None
                                    }
                                ]     
                            }
                        },
                    },
                'knowledge_graph': {
                    'nodes': {},
                    'edges': {}
                    }, 
                'results': []
                }, 
                'max_results': 10, 
                'trapi_version': trapi_version, 
                'biolink_version': None
                }

    # Option 2
    query_2 = {
            'message': { 
                'query_graph': {
                    'nodes': {
                        'n0': {
                            'ids': [drug], 
                            'categories': ['biolink:SmallMolecule'],
                            'constraints': []
                            }, 
                        'n1': {
                            'ids': [disease],
                            'categories': ['biolink:Disease'],
                            'constraints': []
                            },
                        },
                    'edges': {
                        'e0': {
                            'predicates': ['biolink:treats'],
                            'relation': None,
                            'subject': 'n0',
                            'object': 'n1',
                            "constraints": [
                                {
                                    "id": "CHP:PredicateProxy",
                                    "not": False,
                                    "name": "predicate_proxy",
                                    "value": [
                                        outcome
                                    ],
                                    "unit_id": None,
                                    "operator": "==",
                                    "unit_name": None
                                    },
                                {
                                    "id": outcome,
                                    "not": False,
                                    "name": outcome_name,
                                    "value": outcome_value,
                                    "unit_id": None,
                                    "operator": outcome_op,
                                    "unit_name": None
                                    },
                                {
                                    "id": "CHP:PredicateContext",
                                    "not": False,
                                    "name": "predicate_context",
                                    "value": [
                                        "gene"
                                    ],
                                    "unit_id": None,
                                    "operator": "==",
                                    "unit_name": None
                                    },
                                {
                                    "id": "gene",
                                    "not": False,
                                    "name": "gene",
                                    "value": [
                                        gene
                                    ],
                                    "unit_id": None,
                                    "operator": "matches",
                                    "unit_name": None
                                    }
                                ]     
                            }
                        },
                    },
                'knowledge_graph': {
                    'nodes': {},
                    'edges': {}
                    }, 
                'results': []
                }, 
                'max_results': 10, 
                'trapi_version': trapi_version, 
                'biolink_version': None
                }
        
    r1 = requests.post('http://chp.thayer.dartmouth.edu/query/', json=query_1)
    r2 = requests.post('http://chp.thayer.dartmouth.edu/query/', json=query_2)
    return r1, r2

if __name__ == '__main__':
    r1, r2 = _build_standard_query(
            gene='ENSEMBL:ENSG00000162419',
            drug='CHEMBL.COMPOUND:CHEMBL88',
            outcome='EFO:0000714',
            outcome_name='EFO:0000714',
            outcome_op=">",
            outcome_value=500,
            disease='MONDO:0007254',
            trapi_version='1.1',
            )
    print(json.dumps(r1.json(), indent=2))
    print(json.dumps(r2.json(), indent=2))

chunyuma · 2021-08-13T16:19:57Z

Thanks @yakaboskic! I really appreciate your help for figuring out the issue! I will have a try later today based on your code and suggestions.

chunyuma · 2021-08-13T21:55:34Z

@amykglen, thanks to the help of @yakaboskic, both chp tests in test_ARAX_expand.py pass now.

test_ARAX_expand.py::test_chp_expand_1 PASSED                                                                                                        [ 50%]
test_ARAX_expand.py::test_chp_expand_2 PASSED                                                                                                        [100%]

amykglen · 2021-08-13T22:37:29Z

awesome, thanks everyone!

I guess after the full DTD rebuild is done you can go ahead and close this issue, @chunyuma.

…estors

chunyuma · 2021-08-13T22:46:45Z

Also, thanks @amykglen for developing the biolinkHelper module to access the biolink model information, I replaced the original CategoryManger with BiolinkHelper and it works well.

I guess after the full DTD rebuild is done you can go ahead and close this issue, @chunyuma.
I will complete it as soon as possible.

edeutsch added a commit that referenced this issue Jul 28, 2021

update to use SRI dev endpoint for Biolink 2.1 #1593

34c34e4

edeutsch added a commit that referenced this issue Jul 28, 2021

update to BioLink 2.0 #1593

0c2e9eb

amykglen added a commit that referenced this issue Jul 29, 2021

Get rid of ChemicalSubstance conflation in KG2c build #1593

115b984

edeutsch self-assigned this Jul 29, 2021

amykglen assigned amykglen, acevedol, finnagin and chunyuma Jul 29, 2021

amykglen added the kg2 rollout label Aug 2, 2021

amykglen added a commit that referenced this issue Aug 4, 2021

Remove ChemicalSubstance/Drug conflations from Expand #1593

ae1ff37

amykglen added a commit that referenced this issue Aug 4, 2021

Update Biolink version used in Expand #1593

5da6ece

amykglen added a commit that referenced this issue Aug 4, 2021

Update other hard-coded ChemicalSubstances in Expand #1593

651e4ad

amykglen added a commit that referenced this issue Aug 4, 2021

Change ChemicalSubstance -> ChemicalEntity in tests #1593

1734a6d

amykglen added a commit that referenced this issue Aug 5, 2021

Adjust biolink:Drug usages in Expand tests #1593

2e30f28

chunyuma added a commit that referenced this issue Aug 11, 2021

#1593 change all code of category detection of chemicalsubstance to s…

5f561ad

…mallmolecule in Expand/Overlay

chunyuma added a commit that referenced this issue Aug 11, 2021

update scripts to build DTD database for kg2.7.1 #1593

8c95253

chunyuma added a commit that referenced this issue Aug 11, 2021

#1593 modify dtd/chp/chod scripts to comply the categories of kg2.7.1

29df9b6

amykglen added a commit that referenced this issue Aug 11, 2021

Change ChemicalSubstance -> ChemicalEntity (various places) #1593

4d898c3

chunyuma added a commit that referenced this issue Aug 11, 2021

#1593 update scripts of generating slim databases for kg2.7.1

8b97a09

amykglen added a commit that referenced this issue Aug 11, 2021

Merge remote-tracking branch 'origin/kg2integration' #1593

c6cf3f1

finnagin added a commit that referenced this issue Aug 12, 2021

#1593, generate KG2.7.1 predicate tuples

849f63d

chunyuma added a commit that referenced this issue Aug 13, 2021

#1593 fixed bugs in CHP_querier.py

7639e28

chunyuma added a commit that referenced this issue Aug 13, 2021

#1593 replace CategoryManager with BiolinkHelper module to access anc…

5315c9b

…estors

amykglen closed this as completed Sep 2, 2021

Upgrade to Biolink 2.1 (KG2.7.1) - due Aug. 11 #1593

Upgrade to Biolink 2.1 (KG2.7.1) - due Aug. 11 #1593

Comments

amykglen commented Jul 28, 2021 • edited Loading

amykglen commented Jul 29, 2021

edeutsch commented Jul 29, 2021

amykglen commented Jul 29, 2021

edeutsch commented Jul 29, 2021

amykglen commented Jul 29, 2021

edeutsch commented Jul 30, 2021

amykglen commented Jul 30, 2021

amykglen commented Aug 4, 2021

edeutsch commented Aug 4, 2021

edeutsch commented Aug 4, 2021

edeutsch commented Aug 4, 2021

chunyuma commented Aug 4, 2021

amykglen commented Aug 5, 2021 • edited Loading

edeutsch commented Aug 5, 2021

chunyuma commented Aug 5, 2021

amykglen commented Aug 5, 2021 • edited Loading

chunyuma commented Aug 11, 2021

amykglen commented Aug 11, 2021

finnagin commented Aug 11, 2021

amykglen commented Aug 11, 2021

chunyuma commented Aug 11, 2021

amykglen commented Aug 11, 2021

chunyuma commented Aug 11, 2021 • edited Loading

chunyuma commented Aug 11, 2021

chunyuma commented Aug 11, 2021

yakaboskic commented Aug 12, 2021

chunyuma commented Aug 12, 2021

finnagin commented Aug 12, 2021

yakaboskic commented Aug 12, 2021

chunyuma commented Aug 12, 2021

chunyuma commented Aug 12, 2021

yakaboskic commented Aug 13, 2021 • edited Loading

chunyuma commented Aug 13, 2021

chunyuma commented Aug 13, 2021

amykglen commented Aug 13, 2021

chunyuma commented Aug 13, 2021

amykglen commented Jul 28, 2021 •

edited

Loading

amykglen commented Aug 5, 2021 •

edited

Loading

amykglen commented Aug 5, 2021 •

edited

Loading

chunyuma commented Aug 11, 2021 •

edited

Loading

yakaboskic commented Aug 13, 2021 •

edited

Loading