Skip to content

Commit

Permalink
Update scripts with memory tuning
Browse files Browse the repository at this point in the history
  • Loading branch information
acmiyaguchi committed Dec 4, 2019
1 parent 13fa2c2 commit 5c3dd24
Show file tree
Hide file tree
Showing 3 changed files with 6 additions and 5 deletions.
5 changes: 3 additions & 2 deletions scripts/run-command
Original file line number Diff line number Diff line change
Expand Up @@ -5,11 +5,12 @@ set -x
python setup.py bdist_egg
echo "from wikicast import __main__; __main__.cli()" > runner.py

ARROW_PRE_0_15_IPC_FORMAT=1 \
SPARK_HOME=$(python -c "import pyspark; print(pyspark.__path__[0])") \
spark-submit \
--master 'local[*]' \
--conf spark.driver.memory=8g \
--conf spark.sql.shuffle.partitions=8 \
--driver-memory 24g \
--conf spark.sql.shuffle.partitions=24 \
--packages \
graphframes:graphframes:0.7.0-spark2.4-s_2.11 \
--py-files "dist/*.egg" \
Expand Down
4 changes: 2 additions & 2 deletions scripts/start-jupyter
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@ PYSPARK_DRIVER_PYTHON=jupyter \
PYSPARK_DRIVER_PYTHON_OPTS=notebook \
pyspark \
--master 'local[*]' \
--conf spark.driver.memory=8g \
--conf spark.sql.shuffle.partitions=8 \
--conf spark.driver.memory=24g \
--conf spark.sql.shuffle.partitions=16 \
--packages \
graphframes:graphframes:0.7.0-spark2.4-s_2.11
2 changes: 1 addition & 1 deletion scripts/start-jupyter.ps1
Original file line number Diff line number Diff line change
Expand Up @@ -11,6 +11,6 @@ $env:PYSPARK_DRIVER_PYTHON_OPTS = "notebook"
pyspark `
--master 'local[*]' `
--conf spark.driver.memory=8g `
--conf spark.sql.shuffle.partitions=8 `
--conf spark.sql.shuffle.partitions=16 `
--packages `
graphframes:graphframes:0.7.0-spark2.4-s_2.11

0 comments on commit 5c3dd24

Please sign in to comment.