Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Documentation improvement for Version Control Multi-Bucket Pipeline demo #217

Merged
merged 1 commit into from
Jul 23, 2024
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
59 changes: 14 additions & 45 deletions 00_notebooks/version-control-of-multi-buckets-pipelines.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,9 @@
"\n",
"In data engineering pipelines, it is common to have distinct buckets that serve different purposes. These buckets are typically named and categorized based on their respective stages in the data processing pipeline.\n",
"\n",
"When implementing lakeFS, it may be necessary to maintain separate physical buckets for each stage. However, it is important to version control all changes made to each bucket and link between different versions to track the evolution of the data through the pipeline."
"When implementing lakeFS, it may be necessary to maintain separate physical buckets for each stage. However, it is important to version control all changes made to each bucket and link between different versions to track the evolution of the data through the pipeline.\n",
"\n",
"To achieve this, lakeFS enables you to create and manage repositories for each bucket stage, ensuring all changes are version-controlled. In this demo, you will learn how to set up these repositories, merge branches, and link different versions to seamlessly promote data through the pipeline stages. This approach offers a robust version control system that ensures comprehensive data lineage and auditing of transformations across all stages of the pipeline."
]
},
{
Expand Down Expand Up @@ -45,10 +47,7 @@
"execution_count": null,
"id": "835f510a",
"metadata": {
"tags": [],
"vscode": {
"languageId": "python"
}
"tags": []
},
"outputs": [],
"source": [
Expand All @@ -70,10 +69,7 @@
"execution_count": null,
"id": "871bdea3",
"metadata": {
"tags": [],
"vscode": {
"languageId": "python"
}
"tags": []
},
"outputs": [],
"source": [
Expand Down Expand Up @@ -207,10 +203,7 @@
"execution_count": null,
"id": "7d4a3a20",
"metadata": {
"tags": [],
"vscode": {
"languageId": "python"
}
"tags": []
},
"outputs": [],
"source": [
Expand All @@ -236,10 +229,7 @@
"execution_count": null,
"id": "8857019f",
"metadata": {
"tags": [],
"vscode": {
"languageId": "python"
}
"tags": []
},
"outputs": [],
"source": [
Expand Down Expand Up @@ -289,10 +279,7 @@
"execution_count": null,
"id": "e6c13379-da71-4eab-9ca0-d3b301ab8249",
"metadata": {
"tags": [],
"vscode": {
"languageId": "python"
}
"tags": []
},
"outputs": [],
"source": [
Expand All @@ -312,10 +299,7 @@
"execution_count": null,
"id": "b9c4e5f9-1ba3-44bf-8b17-d53cf8f3b794",
"metadata": {
"tags": [],
"vscode": {
"languageId": "python"
}
"tags": []
},
"outputs": [],
"source": [
Expand All @@ -336,10 +320,7 @@
"execution_count": null,
"id": "2402e90f-460b-4e82-9ab4-55999c23011b",
"metadata": {
"tags": [],
"vscode": {
"languageId": "python"
}
"tags": []
},
"outputs": [],
"source": [
Expand All @@ -360,10 +341,7 @@
"execution_count": null,
"id": "1d930ba7-e1f8-4400-a7f6-ae14cc208be1",
"metadata": {
"tags": [],
"vscode": {
"languageId": "python"
}
"tags": []
},
"outputs": [],
"source": [
Expand Down Expand Up @@ -519,10 +497,7 @@
"execution_count": null,
"id": "765cd354-832e-4b06-9417-405dc7bda4e6",
"metadata": {
"tags": [],
"vscode": {
"languageId": "python"
}
"tags": []
},
"outputs": [],
"source": [
Expand Down Expand Up @@ -587,10 +562,7 @@
"execution_count": null,
"id": "92b7103b-c550-442a-822e-589657fd3079",
"metadata": {
"tags": [],
"vscode": {
"languageId": "python"
}
"tags": []
},
"outputs": [],
"source": [
Expand Down Expand Up @@ -668,10 +640,7 @@
"execution_count": null,
"id": "f7afa2fd-dd36-4960-8615-bb28bf15b795",
"metadata": {
"tags": [],
"vscode": {
"languageId": "python"
}
"tags": []
},
"outputs": [],
"source": [
Expand Down
Loading