Skip to content

Latest commit

 

History

History
319 lines (258 loc) · 15.9 KB

README.md

File metadata and controls

319 lines (258 loc) · 15.9 KB

toy_weather_forecast

Background

The application is designed to predict the weather of various important cities around Australia. As the weather observations are over a period of time, time-series analysis using ARIMA is used to forecast weather parameters.

Data required for the prediction is sourced from BOM (Bureau of Meteorology, Austraila).

Forecasts are performed for - CANBERRA, SYDNEY, WOLLONGONG, MELBOURNE, BRISBANE, CAIRNS, GOLD COAST, ADELAIDE, PERTH, HOBART, DARWIN.

Building

Building & Test:

Building:

Project uses SBT to build & test the application. To compile, test and build jars:

 sbt assembly

Testing:

All test cases can be executed with the below command.

sbt test

Running:

Command line parms must be specified in pairs in the format --name value. Internally application validates this format & converts to a Map(name -> value)

The application expects, 4 mandatory command line parms viz. dataSource, year, month, outFile. Optional parameter dateRange allows overriding the default date range of (201506 to 201606). This range is chosen is default based on data availability in BOM.

if dataSource is bom, the data is downloaded from bom before forecasting else it expects the data to be in local files in current working directory.

To run the project with fat jar in /path/to/jar folder:

java -jar /path/to/jar/toy-weather-forecast.jar --dataSource bom --year 2016 --month 11

Dependencies:

For smooth compile & run. Esure sbt 0.13.9, scala version above 2.10 is installed and java 1.8.

  1. sparkts (For ARIMA time series analysis.)
  2. spark-mllib (sparkts uses spark-mllib)
  3. joda-time
  4. json4s (Extracting Geo-coordinates from Google Rest API response json)

Project organization

The project is split into two layers by means of packages. All the data handling like downloading, accessing API, parsing of values is organised under the org.weather.model.data and all processing logic required for forecasting is grouped under org.weather.model.forecast. Below diagram shows the Objects & functions under the packages.

							  ------------------------------
							  |    toy_weather_forecast    |
						      ------------------------------
                                  |                   |
                                  |                   |
                                  V                   V
	------------------------------------			---------------------------------
	|	org.weather.model.data         |			|  org.weather.model.forecast   |
	|								   |			|                               |
	------------------------------------ 	        ---------------------------------
	     |												|
	     |     ************************					|     ***********************
	     |---->| CommonData           |                 |---->|InitialProcess       |
	     |     ************************                 |     *********************** 
	     |                                              |
	     |     ************************                 |     *********************** 
	     |---->|FileOperations        |                 |---->|PredictFunctions     | 
	     |     ************************                 |     ***********************
	     |                                              |
	     |     ************************                 |     *********************** 
	     |---->|GetData               |                 |---->|PredictWeather       |
	     	   ************************                       ***********************

Execution Flow

PredictWeather is the entry point into the application which controls the order of execution various steps required to complete forecasting. Flow of steps is represented in the diagram below.

***********************************************
* 	Parsing Arguments & Accuring Data         *										 
***********************************************
	------------------------------------	
    |Parse & Validate Command line args|
    ------------------------------------
    			  |
    			  Vtoy_weather_forecast

============

toy_weather_forecast

Background

The application is designed to predict the weather of various important cities around Australia. As the weather observations are over a period of time, time-series analysis using ARIMA is used to forecast weather parameters.

Data required for the prediction is sourced from BOM (Bureau of Meteorology, Austraila).

Forecasts are performed for - CANBERRA, SYDNEY, WOLLONGONG, MELBOURNE, BRISBANE, CAIRNS, GOLD COAST, ADELAIDE, PERTH, HOBART, DARWIN.

Building

Building & Test:

Building:

Project uses SBT to build & test the application. To compile, test and build jars:

 sbt assembly

Testing:

All test cases can be executed with the below command.

sbt test

Running:

Command line parms must be specified in pairs in the format --name value. Internally application validates this format & converts to a Map(name -> value)

The application expects, 4 mandatory command line parms viz. dataSource, year, month, outFile. Optional parameter dateRange allows overriding the default date range of (201506 to 201606). This range is chosen is default based on data availability in BOM.

if dataSource is bom, the data is downloaded from bom before forecasting else it expects the data to be in local files in current working directory.

To run the project with fat jar in /path/to/jar folder:

java -jar /path/to/jar/toy-weather-forecast.jar --dataSource bom --year 2016 --month 11

Dependencies:

  1. sparkts (For ARIMA time series analysis.)
  2. spark-code & spark-mllib
  3. joda-time
  4. json4s (Extracting Geo-coordinates from Google Rest API response json)

Project organization

The project is split into two layers by means of packages. All the data handling like downloading, accessing API, parsing of values is organised under the org.weather.model.data and all processing logic required for forecasting is grouped under org.weather.model.forecast. Below diagram shows the Objects & functions under the packages.

                              ------------------------------
                              |    toy_weather_forecast    |
                              ------------------------------
                                  |                   |
                                  |                   |
                                  V                   V
    ------------------------------------            ---------------------------------
    |   org.weather.model.data         |            |  org.weather.model.forecast   |
    |                                  |            |                               |
    ------------------------------------            ---------------------------------
         |                                              |
         |     ************************                 |     ***********************
         |---->| CommonData           |                 |---->|InitialProcess       |
         |     ************************                 |     *********************** 
         |                                              |
         |     ************************                 |     *********************** 
         |---->|FileOperations        |                 |---->|PredictFunctions     | 
         |     ************************                 |     ***********************
         |                                              |
         |     ************************                 |     *********************** 
         |---->|GetData               |                 |---->|PredictWeather       |
               ************************                       ***********************

Execution Flow

PredictWeather is the entry point into the application which controls the order of execution various steps required to complete forecasting. Flow of steps is represented in the diagram below.

***********************************************
*   Parsing Arguments & Accuring Data         *                                      
***********************************************
    ------------------------------------    
    |Parse & Validate Command line args|
    ------------------------------------
                  |
                  V
    ------------------------------------
    |Get the cities & mapping to IATA  | 
    |code and BoM file name for each   | 
    |city                              |  
    ------------------------------------
                  |
                  V
    ------------------------------------
    |Validate & Get the date for which |
    |forecast is to be performed. Also |
    |validate.                         |
    ------------------------------------
                  |
                  V 
    -------------------------------------
    |Download BoM data if the dataSource|
    |is 'bom' for all the cities & the  |
    |input date                         |
    -------------------------------------
                  |
                  V
    ***************************************
    * Perform Predictions for all Cities  *
    ***************************************

    ------------------------------------------
    |   Read Observation for the city        | <-----------------------  
    ------------------------------------------                        |
                  |                                                   |
                  V                                                   |
    ------------------------------------------                        | 
    | Calculate the t+1 day for which the    |                        |
    | forecast is being performed            |                        |
    ------------------------------------------                        | 
                  |                                                   |
                  V                                                   |
    ------------------------------------------                        |
    |Predict Average Temperature, Pressures  |                        |
    |and Relative humidity using ARIMA time- |                        |
    |series analysis.                        |                        |
    ------------------------------------------                        |
                  |                                                   |
                  V                                       ------------------------------------
    ------------------------------------------            | Repeat For each City in scope of |
    |Predict cloud cover & Sunshine hours    |            | forecast                         |
    |for outlook forecasting                 |            ------------------------------------
    ------------------------------------------                        |
                  |                                                   |
                  V                                                   |
    ------------------------------------------                        |
    |Uinsg Google API get Geo coordinates &  |                        |
    |elevation                               |                        |
    ------------------------------------------                        |
                  |                                                   |
                  V                                                   |
    ------------------------------------------                        |
    |String IATA Code, Geo coordinates, date |                        |
    |forecasting for(utc), outlook, predicted| ------------------------
    |average temperature, pressure and       |  
    |relative humidity.                      |
    ------------------------------------------
                  |
                  V
    ******************************************
    * Write results into the output file     *  
    ******************************************


    ------------------------------------
    |Get the cities & mapping to IATA  | 
    |code and BoM file name for each   | 
    |city                              |  
    ------------------------------------
                  |
                  V
    ------------------------------------
    |Validate & Get the date for which |
    |forecast is to be performed. Also |
    |validate.                         |
    ------------------------------------
                  |
                  V 
    -------------------------------------
    |Download BoM data if the dataSource|
    |is 'bom' for all the cities & the  |
    |input date                         |
    -------------------------------------
    			  |
    			  V
    ***************************************
    * Perform Predictions for all Cities  *
    ***************************************

    ------------------------------------------
    |   Read Observation for the city        | <-----------------------  
    ------------------------------------------                        |
    			  |													  |
    			  V                                                   |
    ------------------------------------------						  |	
    | Calculate the t+1 day for which the    |                        |
    | forecast is being performed		     |                        |
    ------------------------------------------						  |	
    			  |												      |
    			  V                                                   |
    ------------------------------------------					      |
    |Predict Average Temperature, Pressures  |                        |
    |and Relative humidity using ARIMA time- |                        |
    |series analysis.						 |                        |
    ------------------------------------------                        |
                  |                                                   |
    			  V                                       ------------------------------------
    ------------------------------------------            | Repeat For each City in scope of |
    |Predict cloud cover & Sunshine hours 	 |            | forecast                         |
    |for outlook forecasting				 |            ------------------------------------
    ------------------------------------------                        |
    	          | 												  |
    			  V           										  |
    ------------------------------------------                        |
    |Uinsg Google API get Geo coordinates &  |                        |
    |elevation							     |                        |
    ------------------------------------------                        |
    			  |             								      |
    			  V	      											  |
    ------------------------------------------                        |
    |String IATA Code, Geo coordinates, date |				          |
    |forecasting for(utc), outlook, predicted| ------------------------
    |average temperature, pressure and 		 |	
    |relative humidity. 					 |
    ------------------------------------------
                  |
                  V
    ******************************************
    * Write results into the output file     *  
    ******************************************