Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What does this PR do?
This pull request adds much needed details to the spark operator page.
cd ...
commands now use an environment variable called DOEKS_HOME that ensures copy and paste always works in every example.sed
commands are provided for replacing the S3_BUCKET placeholders in scripts and spark application yaml manifests.Some quality of life changes as well.
Using Docusaurus partials for paragraphs that are repeated throughout examples.
Move large yaml examples out of the single markdown file into their own partials.
Motivation
The spark on EKS examples are currently very terse. Many assumptions were being made on what the user must know ahead of time. Additionally, most examples were simply incorrect (taxi script always needs input, benchmark files didn't exist).
The changes can be previewed here (note that navigation is wonky in this temp deployment).
https://d2gd59uo3ya1kt.cloudfront.net/data-on-eks/docs/blueprints/data-analytics/spark-operator-yunikorn.html
More
website/docs
orwebsite/blog
section for this featurepre-commit run -a
with this PR. Link for installing pre-commit locallyFor Moderators
Additional Notes