Skip to content

Commit

Permalink
Merge pull request #16 from massi-ang/fix_data_notebooks
Browse files Browse the repository at this point in the history
Fix to data notebooks
  • Loading branch information
massi-ang authored Nov 8, 2024
2 parents 1d495af + cd112d1 commit bfe1daf
Show file tree
Hide file tree
Showing 5 changed files with 555 additions and 580 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -84,7 +84,8 @@
"db_secret_arn = db_secret_arn[\"Parameter\"][\"Value\"]\n",
"\n",
"subnet_ids = ssm.get_parameter(Name=subnet_ids_parameter)\n",
"private_subnets_with_egress_ids = json.loads(subnet_ids[\"Parameter\"][\"Value\"])"
"private_subnets_with_egress_ids = json.loads(subnet_ids[\"Parameter\"][\"Value\"])\n",
"script_processor_container_uri = ssm.get_parameter(Name=script_processor_container_parameter)[\"Parameter\"][\"Value\"]"
]
},
{
Expand Down Expand Up @@ -175,7 +176,7 @@
"id": "dcb8bed1-bef3-4fd7-84b7-29dd4b418362",
"metadata": {},
"source": [
"## Attempt the same in a SageMaker processing job with VPC network config"
"## Attempt the same in a SageMaker processing job with VPC network config"
]
},
{
Expand Down
8 changes: 4 additions & 4 deletions data_pipelines/02-download-raw-pdf-documents.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -163,15 +163,15 @@
},
"outputs": [],
"source": [
"ls {raw_base_directory}/Amazon"
"!ls {raw_base_directory}/Amazon"
]
},
{
"cell_type": "markdown",
"id": "87a9f7bd-81e0-46b2-aa37-abddd1b0f1ec",
"metadata": {},
"source": [
"## Keep relevant pages\n",
"## Keep relevant pages\n",
"\n",
"Although you can run the full PDF documents through the solution, to optimize the extraction costs, we suggest that you select the relevant pages from each pdf documents."
]
Expand Down Expand Up @@ -354,7 +354,7 @@
},
"outputs": [],
"source": [
"ls {prepared_base_directory}"
"!ls {prepared_base_directory}"
]
},
{
Expand All @@ -366,7 +366,7 @@
},
"outputs": [],
"source": [
"cat {prepared_base_directory}/metadata.json | python -m json.tool"
"!cat {prepared_base_directory}/metadata.json | python -m json.tool"
]
}
],
Expand Down
Loading

0 comments on commit bfe1daf

Please sign in to comment.