Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue #1 : Improve documentation for Spark session creation (Databricks etc.) #85

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

rwitzel
Copy link

@rwitzel rwitzel commented Nov 16, 2021

Issue #1

Why?

The many comments in #1 suggest that users overlook the required installation of JARs when installing Deequ.

What?

Now the documentation clarifies the importance of installing the JARs.

…not be created as described in the user's specific Spark environment. Because the comments in awslabs#1 indicate that the current documentation does not emphasises enough the importance of having the JARs installed.
@arpheno
Copy link

arpheno commented Jan 7, 2022

Looks good to me

@bakintunde
Copy link

Perhaps include more description about the type of error obtained which requires this installation of the jar file.
Also, that both pydeequ package from pypi and the jar file from maven central need to be installed.

Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants