Changes to support CDM data import #1

jeff-cohere · 2024-06-14T16:09:47Z

This PR pulls in changes made to support our ad-hoc ("heroic?") import of data into CDM.

This is a pretty new repo, so I don't have anything CI-ish in place yet. @ialarmedalien , we were talking about getting dtspy "up to spec" with some GitHub Actions, style checks, and what-not. Maybe we can use this PR to get things in shape. Please feel free to invite whatever other reviewer(s) you think would be good representatives of the KBase Python Way.

Right now, we just use a basic pip -r requirements.txt install process. I'd prefer not to wander into the bog of what passes for Python package management tools, since they're all horrible. :-)

Update

I implemented some unit tests, though only for search and metadata features (for now).

.github/workflows/tests.yml

ialarmedalien · 2024-08-13T15:37:31Z

.github/workflows/tests.yml

+      - name: Running tests (${{ matrix.os }})
+        run: coverage run -m unittest discover
+        env:
+          DTS_KBASE_DEV_TOKEN: ${{ secrets.DTS_KBASE_DEV_TOKEN }}


there should already be some kbase dev tokens in the environment if you check in the settings for this repo.

I tried KBASE_CI_TOKEN, but it doesn't work. The GitHub documentation is entering the Enterprise Software Heat Death stage of its lifecycle, so it's hard to figure out the difference between org secrets and repo secrets, or even whether anyone at GitHub knows what the difference is supposed to be. So I'm reverting my change here for the time being. I'll revisit when we have this other stuff sorted.

dts/client.py

ialarmedalien · 2024-08-13T16:08:08Z

dts/client.py

        """
-`client.deleteTransfer(id) -> None
+`client.cancel_transfer(id) -> None


it's probably worth using one of the recognised python documentation formats -- depending on what IDE you use, you may be able to automatically generate documentation skeletons for functions and if you add a linter, you won't get a bunch of errors because the docs don't follow pydocstyle.

Yeah, I've been hesitant to strap myself all the way into Python dev mode, but it's probably a good time to address some of this stuff. I don't use an IDE, though, because I find they make it easier to produce mountains of rubbish with almost no effort.

"one of the recognised python documentation formats" gets at the key issue -- there's 10M ways to do everything in Python. In this it takes after my other "favorite" language, C++, in which everyone strives to prove how clever they are by trotting out esoterically cute ways of doing trivial things that don't pertain to any other language.

FWIW I use the sphinx documentation style, and ruff for python code linting/formatting. I use VSCode for working with python -- there's an extension for generating python documentation for functions and a nice ruff integration, so you can handle all that stuff whilst you're working on the code. I'm sure that there are also emacs / vi / etc. integrations for ruff, too.

ialarmedalien · 2024-08-13T16:10:12Z

test/test_client.py

+import os
+import unittest
+
+class TestClient(unittest.TestCase):


I would switch to using pytest for tests -- it is a lot more flexible that unittest as you can easily parametrise your tests (run the same test on a bunch of different inputs) and the test output reporting is much more helpful.

Ah, yes, one of the 10M testing packages. Thanks for the recommendation!

I made an issue for this.

.github/workflows/tests.yml

dts/client.py

ialarmedalien · 2024-08-14T17:11:00Z

test/test_client.py

+        self.assertTrue(client.uri)
+        self.assertTrue(client.name)
+        self.assertTrue(client.version)


these aren't really worthwhile tests -- maybe they can be improved when you move to pytest, but I would expect the tests to check the values of these attributes, not just whether they are truthy or falsy.

Yeah, I can definitely address these when we move to pytest.

pytest uses assert so these self.assertTrue and self.assertFalse statements would turn into something like:

assert client.uri == "some_client_uri" assert client.name assert isinstance(client.version, str) assert len(client.name) > 15

jeff-cohere added 4 commits June 5, 2024 12:40

Adding database-specific search parameters.

49d030f

Some minor fixes.

535bb06

Changes supporting DTS -> CDM import

22729fc

Removing erroneous extra transfer.

6e2afad

jeff-cohere added enhancement New feature or request help wanted Extra attention is needed testing labels Jun 14, 2024

jeff-cohere requested a review from ialarmedalien June 14, 2024 16:09

jeff-cohere added 8 commits June 20, 2024 09:35

Repairing some broken calls.

8619d45

Added timeout parameter to transfer method.

45f0647

Trimming an unneeded parameter.

844da7d

Added unit tests and a related GitHub Action.

ae0c23d

Added coverage to list of requirements.

9808e70

Trying to work around a GitHub Actions issue.

c9dac93

Replacing os.environ with os.getenv.

aa31122

Putting dev token in place.

2d2d3b8

jeff-cohere mentioned this pull request Aug 6, 2024

Provided file metadata access. #4

Merged

ialarmedalien reviewed Aug 13, 2024

View reviewed changes

.github/workflows/tests.yml Outdated Show resolved Hide resolved

ialarmedalien reviewed Aug 13, 2024

View reviewed changes

.github/workflows/tests.yml Outdated Show resolved Hide resolved

ialarmedalien reviewed Aug 13, 2024

View reviewed changes

dts/client.py Outdated Show resolved Hide resolved

Responding to PR feedback.

200641b

ialarmedalien reviewed Aug 13, 2024

View reviewed changes

dts/client.py Outdated Show resolved Hide resolved

Reverting CI token, since it doesn't work as is.

fa65fdb

ialarmedalien reviewed Aug 13, 2024

View reviewed changes

dts/client.py Outdated Show resolved Hide resolved

ialarmedalien reviewed Aug 13, 2024

View reviewed changes

dts/client.py Outdated Show resolved Hide resolved

ialarmedalien reviewed Aug 13, 2024

View reviewed changes

dts/client.py Outdated Show resolved Hide resolved

ialarmedalien reviewed Aug 13, 2024

View reviewed changes

dts/client.py Outdated Show resolved Hide resolved

ialarmedalien reviewed Aug 13, 2024

View reviewed changes

Addressing several PR comments.

f02f4a3

jeff-cohere mentioned this pull request Aug 13, 2024

Select a supported Python documentation format and a linter. #7

Open

ialarmedalien reviewed Aug 13, 2024

View reviewed changes

.github/workflows/tests.yml Show resolved Hide resolved

ialarmedalien reviewed Aug 13, 2024

View reviewed changes

.github/workflows/tests.yml Outdated Show resolved Hide resolved

jeff-cohere added 2 commits August 13, 2024 09:35

Treating arguments a bit more carefully.

1b913b8

Some touchups to the GitHub Actions workflow.

e036c5f

jeff-cohere requested a review from ialarmedalien August 13, 2024 16:41

Reinstating pull-request branch logic in workflow.

0cbac24

ialarmedalien reviewed Aug 14, 2024

View reviewed changes

dts/client.py Outdated Show resolved Hide resolved

ialarmedalien reviewed Aug 14, 2024

View reviewed changes

dts/client.py Outdated Show resolved Hide resolved

ialarmedalien reviewed Aug 14, 2024

View reviewed changes

dts/client.py Outdated Show resolved Hide resolved

ialarmedalien reviewed Aug 14, 2024

View reviewed changes

dts/client.py Outdated Show resolved Hide resolved

ialarmedalien reviewed Aug 14, 2024

View reviewed changes

dts/client.py Outdated Show resolved Hide resolved

ialarmedalien reviewed Aug 14, 2024

View reviewed changes

dts/client.py Outdated Show resolved Hide resolved

ialarmedalien reviewed Aug 14, 2024

View reviewed changes

dts/client.py Outdated Show resolved Hide resolved

More fixes from PR feedback.

48615fe

ialarmedalien reviewed Aug 14, 2024

View reviewed changes

Number queries are now converted to strings.

ee92b7f

ialarmedalien approved these changes Aug 14, 2024

View reviewed changes

jeff-cohere merged commit db00daa into main Aug 14, 2024
2 checks passed

jeff-cohere deleted the db-specific-search branch August 14, 2024 17:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changes to support CDM data import #1

Changes to support CDM data import #1

jeff-cohere commented Jun 14, 2024 •

edited

Loading

ialarmedalien Aug 13, 2024

jeff-cohere Aug 13, 2024

ialarmedalien Aug 13, 2024

jeff-cohere Aug 13, 2024

jeff-cohere Aug 13, 2024

ialarmedalien Aug 13, 2024 •

edited

Loading

ialarmedalien Aug 13, 2024

jeff-cohere Aug 13, 2024

jeff-cohere Aug 13, 2024

ialarmedalien Aug 14, 2024

jeff-cohere Aug 14, 2024

ialarmedalien Aug 14, 2024 •

edited

Loading

Changes to support CDM data import #1

Changes to support CDM data import #1

Conversation

jeff-cohere commented Jun 14, 2024 • edited Loading

Update

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ialarmedalien Aug 13, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ialarmedalien Aug 14, 2024 • edited Loading

Choose a reason for hiding this comment

jeff-cohere commented Jun 14, 2024 •

edited

Loading

ialarmedalien Aug 13, 2024 •

edited

Loading

ialarmedalien Aug 14, 2024 •

edited

Loading