Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DJORNL parser improvements #34

Open
3 of 5 tasks
ialarmedalien opened this issue Sep 11, 2020 · 1 comment
Open
3 of 5 tasks

DJORNL parser improvements #34

ialarmedalien opened this issue Sep 11, 2020 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@ialarmedalien
Copy link
Collaborator

ialarmedalien commented Sep 11, 2020

A few improvements for the parser:

  • cluster data could be parsed and merged directly into the node data, rather than uploading all the node data and then doing the cluster data
  • check for additional (invalid) headers
  • the 'GO description' field is currently not populated correctly (due to problems in the input data); it could either be generated by pulling in the GO term names from the existing RE db (i.e. have a copy of the GO term data in the DJORNL node document), or the search could be to query over the joined DJORNL and GO tables.
  • add a 'validate' function to parse the data and validate it, without uploading it.
  • add a README in the djornl directory with information specific to the dataset.
@ialarmedalien ialarmedalien added the enhancement New feature or request label Sep 11, 2020
@jayrbolton
Copy link
Contributor

Closed by #47 and/or #48 and/or #49 and/or #50?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants