Identifying Atmospheric Rivers on the West Coast of the United States with Geostationary Operational Environmental Satellite (GOES) Imagery

Summary

This repository contains all necessary information (rationale, methodology, data, codes and results) used in identifying atmospheric rivers (ARs) in the Western United States. The primary objective of this project is to explore the potential of GOES satellite data in accurately identifying ARs along the Pacific coast of North America. We developed and tested two machine learning models—a U-Net model and a random forest model—to identify ARs from GOES satellite imagery spanning the period from 2018 to 2020. The research questions guiding this investigation include:

Can optical satellite data from GOES, with its unique spatio-temporal resolution, be effectively utilized for the detection of ARs?
How do the performance of a high-fidelity U-Net model compare to those of a lower-fidelity Random Forest model in identifying ARs from GOES imagery?

Methodology

Study Area

Our research focuses on the Pacific West Coast of North America, as illustrated in Figures 1 (a) and 1 (b). The study area encompasses a broad region from the state of Washington in the north to California in the south, extending westward over the Pacific Ocean.

Figure 1: Study Area and GOES Satellite Imagery. (a) Pacific West Coast region including the US, Canada, and Mexico. (b) GOES satellite image capturing atmospheric conditions over the study area (false color composite using bands 2,3,1).

Dataset

GOES Imagery

This study utilizes data from the GOES-R series, specifically employing the Advanced Baseline Imager (ABI) with 16 channels, offering a spatial resolution of 2 km for most channels and a temporal resolution of 15 minutes for full disk scans. Data retrieval for the GOES-17 and GOES-18 satellites from 2016 to present is facilitated by the GOES-2-go Python package (Version 2022.07.15), which accesses data through Amazon Web Services (AWS) as part of NOAA's Open Data Dissemination Program GOES2Go. package in github.
The raw data from the GOES satellites required conversion into a standardized projection format. This step involved transforming the native GOES projection into a rectilinear latitude-longitude projection to ensure consistency with the analysis requirements and facilitate accurate spatial referencing. Additionally, the GOES images were resized from their original resolution of 4500x4996 pixels to 512x512 pixels to facilitate machine learning training and improve computational efficiency.

Labeled data

The dataset consists of continuous labeled data from 1979 to 2022, recorded at 6-hour intervals. The data was sourced from Rhodes. which utilized the advanced TempestExtremes v2.1 algorithm to detect these features. The paper associated can be found here.

ERA5 climate reanalysis product

The retrieved data from ERA5 include wind components (u, v) and specific humidity (q) at multiple pressure levels (300 hPa to 1000 hPa). These data were used to calculate Integrated Water Vapor (IWV) and Integrated Vapor Transport (IVT). We utilized ERA-5 reanalysis data to calculate IVT and IWV and developed a thresholding algorithm to identify AR objects. This algorithm applies specific thresholds for IVT and IWV and evaluates the minimum length and width of potential AR regions to meet typical AR criteria. The algorithm starts by applying threshold values for IVT and IWV to create an initial mask of potential AR regions. Regions meeting these thresholds are marked. Minimum length and width requirements for ARs are converted from kilometers to grid points, ensuring precise identification of contiguous AR regions. Each connected region in the initial mask is labeled and evaluated against these criteria, resulting in a robust final binary mask representing AR regions. This mask is used for further analysis and benchmarking, providing a reliable method for identifying ARs in the ERA-5 reanalysis dataset.

Models

U-Net CNN architecture

Learning rate: initialized to 0.001
Batch size: 16
Optimizer: Adam optimizer
Loss function: Binary Cross-Entropy and Focal Cross-Entropy (combined)
Training: over 100 epochs
Trained for 33 hours

Figure 2: Overall Methodological Framework

RF model

Decision trees: 100
Maximum depth: none
Max_features: sqrt
Trained for 7 hours

Results

ERA-5 Benchmark Dataset

Figure 3: Atmospheric River Identification for December 13, 1997, at 12:00 UTC. (a) Integrated Vapor Transport (IVT), (b) Integrated Water Vapor (IWV), and (c) AR mask indicating identified atmospheric rivers.

Image Segmentation using RF

Figure 4: Random Forest (RF) model predictions compared with labeled data and evaluation results. The top row shows (a) RF model prediction, (b) labeled data, and (c) evaluation of the model which has an IoU of 0.34 and an accuracy of 0.94.

Image Segmentation using CNN

Figure 5: AR identification using our U-Net model trained on GOES imagery, showing (a) brightness temperature from the water vapor bands (Bands 8-10) as a False Color Composite showing complex cloud patterns and atmospheric features, (b) The labeled dataset provided for training, highlighting the atmospheric river region in white, and (c) the prediction made by the U-Net model, indicating the probability of an atmospheric river (AR), where lighter shades represent higher probabilities.

Codes

All codes that we used in the project is inside the folder "jupyter_notebooks"

Report

Report can be found here. (To be included)
Supplemantary material can be found here. (To be included)

References

Reviewed literature is tracked in this excel.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
cdsapi_requests		cdsapi_requests
data		data
jupyter_notebooks		jupyter_notebooks
old_jobfiles		old_jobfiles
scripts		scripts
.gitignore		.gitignore
README.md		README.md
jupyter_notebook.job		jupyter_notebook.job
model_training.job		model_training.job
train_model.py		train_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Identifying Atmospheric Rivers on the West Coast of the United States with Geostationary Operational Environmental Satellite (GOES) Imagery

Summary

Methodology

Study Area

Dataset

GOES Imagery

Labeled data

ERA5 climate reanalysis product

Models

U-Net CNN architecture

RF model

Results

ERA-5 Benchmark Dataset

Image Segmentation using RF

Image Segmentation using CNN

Codes

Report

References

About

Releases

Packages

Contributors 3

Languages

NWC-CUAHSI-Summer-Institute/SI24_GOES-ARs

Folders and files

Latest commit

History

Repository files navigation

Identifying Atmospheric Rivers on the West Coast of the United States with Geostationary Operational Environmental Satellite (GOES) Imagery

Summary

Methodology

Study Area

Dataset

GOES Imagery

Labeled data

ERA5 climate reanalysis product

Models

U-Net CNN architecture

RF model

Results

ERA-5 Benchmark Dataset

Image Segmentation using RF

Image Segmentation using CNN

Codes

Report

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages