Skip to content

Latest commit

 

History

History
36 lines (23 loc) · 742 Bytes

README.md

File metadata and controls

36 lines (23 loc) · 742 Bytes

Spark in Docker

Create a Apache Spark cluster in Docker.

Note that this is just a proof of concept I developed just for testing purposes. See the Sources section for more information about the places I checked out. Moreover, some cleanup activities and enhancements are required, which I will try to do in my spare time.

Commands

Build images (with base):

DEPLOY_BASE=1 ./build

Build images (reusing base image):

./build

Start cluster:

docker-compose up --scale spark-master=1 --scale spark-worker=6 --scale spark-submit=0

Submit task:

docker-compose run spark-submit
./run <script args>

Sources