GitHub - CallumWang/GeoSpark: A Cluster Computing System for Processing Large-Scale Spatial Data

Stable	Latest	Source code

GeoSpark@Twitter || GeoSpark Discussion Board || || (since Jan. 2018)

GeoSpark is a cluster computing system for processing large-scale spatial data. GeoSpark extends Apache Spark / SparkSQL with a set of out-of-the-box Spatial Resilient Distributed Datasets (SRDDs)/ SpatialSQL that efficiently load, process, and analyze large-scale spatial data across machines.

GeoSpark contains three modules:

Name	API	Spark compatibility	Dependency
GeoSpark-core	RDD	Spark 2.X/1.X	Spark-core
GeoSpark-SQL	SQL/DataFrame	SparkSQL 2.1 and later	Spark-core, Spark-SQL, GeoSpark-core
GeoSpark-Viz	RDD	Spark 2.X/1.X	Spark-core, GeoSpark-core

Core: GeoSpark SpatialRDDs and Query Operators.
SQL: SQL interfaces for GeoSpark core.
Viz: Visualization extension of GeoSpark core.

GeoSpark development team has published four papers about GeoSpark. Please read Publications.

GeoSpark received an evaluation from PVLDB 2018 paper "How Good Are Modern Spatial Analytics Systems?" Varun Pandey, Andreas Kipf, Thomas Neumann, Alfons Kemper (Technical University of Munich), quoted as follows:

GeoSpark comes close to a complete spatial analytics system. It also exhibits the best performance in most cases.

Please visit GeoSpark website for details and documentations.

News!

The full research paper of GeoSpark has been accepted by Geoinformatica Journal. This paper has over 40 pages to dissect GeoSpark in details and compare it with many other existing systems such as Magellan, Simba, and SpatialHadoop.
GeoSpark 1.1.3 is released. This release contains a critical bug fix for GeoSpark-core RDD API. Release notes || Maven Coordinate.
GeoSpark 1.1.2 is released. This release contains several bug fixes. Thanks for the patch from Lucas C.! Release notes || Maven Coordinate.

Name		Name	Last commit message	Last commit date
Latest commit History 619 Commits
.github		.github
core		core
docs		docs
sql		sql
viz		viz
.gitignore		.gitignore
.travis.yml		.travis.yml
GeoSpark_logo.png		GeoSpark_logo.png
LICENSE		LICENSE
README.md		README.md
_config.yml		_config.yml
mkdocs.yml		mkdocs.yml
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

News!

About

Releases

Packages

Languages

License

CallumWang/GeoSpark

Folders and files

Latest commit

History

Repository files navigation

News!

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages