Skip to content

Latest commit

 

History

History
38 lines (25 loc) · 2.67 KB

chapter-03.adoc

File metadata and controls

38 lines (25 loc) · 2.67 KB

Big Data (BIG)

Big data methods and techniques applied to geospatial data

Title Description

Big Data Geo Technologies

Big Data Geo Technologies is building upon Big Data and referring to the use of predictive analytics, user behavior analytics, or certain other advanced data analytics methods that extract value from data, and seldom to a particular size of data set - (DSTL).

Cloud and HPC

Cloud computing is defined by US NIST as five essential characteristics of cloud computing: on-demand self-service, broad network access, resource pooling, rapid elasticity or expansion, and measured service. It also lists three "service models" (software, platform and infrastructure), and four "deployment models" (private, community, public and hybrid) that together categorize ways to deliver cloud services - (NIST).

Workflow and provenance

Provenance is information about entities, activities, and people involved in producing a piece of data or thing, which can be used to form assessments about its quality, reliability or trustworthiness - (W3C PROV).

Machine Learning

Machine learning is the subfield of computer science that gives computers the ability to learn without being explicitly programmed. Deep learning and Convolutional Neural Networks (CNNs) - a sub type of machine learning - consists of multiple hidden layers in an artificial neural network - (Wikipedia). Deep Learning is advancing rapidly based on increasing computing capabilities, image databases and trained CNNs.

Big data analytics

“Big data,” machine learning, and predictive data analytics have been hailed as the fourth paradigm of science, allowing researchers to extract insights from both scientific instruments and computational simulations - (ACM Comm).

Modeling, Simulation and prediction

Simulation modeling is the process of creating and analyzing a digital prototype of a physical model to predict its performance in the real world. Models and simulation can be used for analysis and for training.

Extremely Large Databases

An extremely large database (XLDB) is a database that stores and processes enormous amounts of data and associated records and entries. As the largest database form factor, XLDB is created and managed by very few organizations around the world, typically scientific research institutes that have massive data sets at their disposal.