-
Notifications
You must be signed in to change notification settings - Fork 139
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Replace test_data_quality_at_scale.ipynb #208
Conversation
Updated the dataset (amazon products reviews replaced with a synthetic data), added a couple of new examples
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
Replaced Amazon Reviews with a synthetically generated reviews dataset; added declaration of the SPARK version
updated tutorials to use a new dataset
@@ -6,14 +6,115 @@ | |||
"source": [ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't see a reason the same code won't work with pydeequ 1.4.0/Spark 3.5. I think it might be fine to insert something like "Tested on pydeequ 1.2.0/Spark 3.3. Code should run on all supported pydeequ versions".
Reply via ReviewNB
@@ -6,14 +6,115 @@ | |||
"source": [ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Line #2. os.environ["SPARK_VERSION"] = '3.3'
Maybe add ad comment that mention setting this to 3.5 if one use pydeequ 1.4.0
Reply via ReviewNB
Minor comment - I can approve when addressed. |
Created a new pull request #230 to address the comments above. |
…addressed PR comments awslabs#208 and awslabs#230
Updated the dataset (amazon products reviews replaced with a synthetic data), added a couple of new examples
issue #207 issue #209
Description of changes:
Two updates:
By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.