A challenge on the SWOT satellite error calibration organised by Datlas, IGE, IMT Altlantique and CLS.
4.2 Evaluation on SWOT swath: Leaderboard
4.3 Evaluation on Nadirs tracks: Leaderboard
4.3 Evaluation on an independant nadir: Leaderboard
The two-dimensional sea level SWOT products are very much expected to be a game changer in many oceanographic applications which will make them an unprecedented L3 product to be distributed. The row SWOT data will however be contaminated by instrumental and geophysical errors (Gaultier et al., 2016; Peral and Esteban-Fernandez, 2018). In order to be able to observe front, mesoscale and sub-mesoscale features, the SWOT data will require specific processing. Also, these errors are expected to strongly pollute the first and second derivatives of the SSH data which are used for the computation of geostrophic currents and vorticity. Hence, being able to remove the SWOT errors will be of significant importance to recover information on 2D surface currents and vertical mixing.
The SWOT errors are expected to generate noises that are both correlated on the swath and spatially uncorrelated. Several past efforts have already investigated methods to remove or reduce the correlated noises from the SWOT data using prior knowledge on the ocean state (e.g. Metref et al., 2019, see Figure 2.4.A), calibration from independent Nadir altimeter data (e.g. Febvre et al., 2021, see Figure 2.4.B) or cross-calibration from SWOT data themselves (on-going CNES-DUACS studies). And other efforts focused on reducing the uncorrelated data (Gomez-Navarro et al., 2018, 2020; Febvre et al., 2021). Yet, so far, no rigorous intercomparison between the recently developed methods has been undertaken and it seems difficult, to this day, to outline the benefits and limitations of favoring one error reduction method from another.
It is important to mention that the SWOT science requirement for uncorrelated noise measurement error specifies that KaRin must resolve SSH on wavelength scales up to 15 km based on the 68th percentile of the global wavenumber distribution.
The goal of this SWOT Calibration data challenge is to provide a platform to investigate the most appropriate calibration procedures to reduce the spatially correlated errors from the SWOT data.
You can find another data challenge proposed by Datlas, CLS and IMT-Atlantique if you are interested in filtering only the instrumental (KaRIn) SWOT noise: here.
In practice, the SWOT error calibration data challenge is in the form of an Observing System Simulation Experiment (OSSE) in the Gulf Stream region considering a realistic ocean model simulation (NATL60) as the true ocean state. The SWOT simulator (Gaultier et al., 2016) was then used to create realistic SWOT data with and without correlated errors. Then, various calibration procedures can be tested and compared to the true ocean state.
This data challenge is partly funded by the CNRS, by the CNES and, as part of the Sea Level Innovations and Collaborative Intercomparisons for the Next-Generation products (SLICING) project, by the Copernicus Marine Service Evolution (21036-COP-INNO SCI).
The data challenge is in the form of an Observing System Simulation Experiment (OSSE) considering a realistic ocean model simulation, the NEMO high resolution North Atlantic simulation NATL60, as the true ocean state in a 10°X10° degree box in the Gulf Stream region. The SWOT simulator (Gaultier et al., 2016) was then used to create realistic SWOT data with and without instrumental noise.
The experiment is performed over one SWOT orbital cycle (cycle 13) which contains 270 passes. All other cycles are available to tune or train the filters.
The SWOT data to calibrate (the inputs: ssh_err) and their equivalent error-free SWOT data for evaluation (the targets: ssh_true) are hosted and available for download on the MEOM opendap server: see Download the data section below. In no way the targets that are available during the evaluation period should be used in the calibration process.
Fig.: Example of one target of the data challenge: SWOT data without errors (left), one input of the data challenge: SWOT data with errors (center) and the difference between the two (right). Here, SWOT is "flying" over the NATL60 model simulation in the Gulf Stream region.The baseline method is a very simple and straightforward method that removes the averaged on the pass of a linear trend across track. This is supposed to remove all the signal that has the same structure as the roll error (if the roll error was constant over the region). This method is flowed since (1) there are other errors, (2) the roll error is not constant over the region and (3) the method also removes signals that happens to have a linear structure across track but is actual SSH. The baseline is performed in the notebook demo_generate_baseline.
There are three ways of evaluating the calibration performance. Since the experiment is an OSSE, we do have the true SSH on the SWOT swath. Hence the first possible evaluation is directly on the SWOT swath where the calibrated SWOT data are compared to the true SWOT data. The second evaluation is performed using OSE-type metrics, i.e. as if the true data on the swath wasn't available. In this case, the calibrated SWOT data are interpolated onto some Nadir tracks (envisat, geosat2, jason1 and topex-poseidon) and then compared to the true SWOT data (also interpolated on the Nadir tracks). The third evaluation is the same as the second one but with an independant satellite (cryosat) that has never been used to help calibrate SWOT.
The first evaluation of the calibration methods is based on the comparison of the SWOT calibrated swaths with the error-free SWOT swaths. It includes two scores, one based on the Root-Mean-Square Error (RMSE), the other a noise-to-signal ratio based on an isotropic 2D Fourier wavenumber spectra. The evaluation notebook demo_evaluate_onswot_baseline implements the computation of these two scores for the baseline as they appear in the first leaderboard.
After running the evaluation notebook for a method, the files stat_'METHOD'.nc (RMSE data), psd_'METHOD'.nc (PSD and SNR data) and ldb_'METHOD'.nc (Leaderboard data) are saved in the chosen foler.
A comparison notebook, compare_evaluations_onswot, is available to plot and compare the statistics and the spectral diagnostics of multiple methods in the same figures. Also, the leaderboard for each method can be printed.
The second evaluation of the calibration methods is based on the comparison of the true SWOT data interpolated on Nadir tracks. It also includes two scores, one based on the Root-Mean-Square Error (RMSE), the other a noise-to-signal ratio based on a one-dimensional (along Nadir tracks crossing the swath) Fourier wavenumber spectra. The evaluation notebook demo_evaluate_onnadirs_baseline implements the computation of these two scores for the baseline as they appear in the second leaderboard.
This third evaluation is exactly the same as the previous one (Nadir tracks) but using an independant nadir track. Here, the cryosat altimeter data has been simulated on the NAtl60 reference simulation and should not have been used in any way to tune the SWOT calibration.
The evaluation notebook demo_evaluation_onindepnadir_baseline implements the computation of these two scores for the baseline as they appear in the third leaderboard.
💻 How to get started ?
Clone the data challenge repo:
git clone https://github.com/ocean-data-challenges/2022c_SWOT_error_calibration_GS.git
create the data challenge conda environment, named env-dc-swot-filtering, by running the following command:
conda env create --file=environment.yml
and activate it with:
conda activate env-dc-swot-calibration-gs
then add it to the available kernels for jupyter to see:
ipython kernel install --name "env-dc-swot-calibration-gs" --user
You're now good to go !
The data are hosted on the opendap: ocean-data-challenges/2022c_SWOT_error_calibration_GS/.
The data needed for the DC are presented with the following directory structure:
.
|-- 2022c_SWOT_error_calibration_GS
| |-- dc_SWOTcalibGS_inputs.tar.gz
| |-- dc_SWOTcalibGS_nadirs.tar.gz
| |-- dc_SWOTcalibGS_maps.tar.gz
To start out download the datasets from the data server.
!wget https://ige-meom-opendap.univ-grenoble-alpes.fr/thredds/fileServer/meomopendap/extract/ocean-data-challenges/2022c_SWOT_error_calibration_GS/dc_SWOTcalibGS_inputs.tar.gz
and then uncompress the files using tar -xvf <file>.tar.gz
. You may also use ftp
, rsync
or curl
to donwload the data.
The inputs are stored in the variable ssh_err and the targets are stored in the variable ssh_true.
!wget https://ige-meom-opendap.univ-grenoble-alpes.fr/thredds/fileServer/meomopendap/extract/ocean-data-challenges/2022c_SWOT_error_calibration_GS/dc_SWOTcalibGS_nadirs.tar.gz
NB: The file dc_SWOTcalibGS_cryosat.nc is to be used only for evaluation !
!wget https://ige-meom-opendap.univ-grenoble-alpes.fr/thredds/fileServer/meomopendap/extract/ocean-data-challenges/2022c_SWOT_error_calibration_GS/dc_SWOTcalibGS_maps.tar.gz
and then uncompress the files using tar -xvf <file>.tar.gz
. You may also use ftp
, rsync
or curl
to donwload the data.
See the Baseline section above.
Notebook available: demo_generate_baseline.ipynb
The CER method is based on Metref et al. (2019). The CER method algorithm is included in src/aux.py and launched in src/swot.py with the function apply_CERmethod_calib
.
Here, the ensemble used for the assimilation is a 1/20° SSH fields created using optimal interpolations with Nadir altimeter data (both the maps and the Nadir data are available on the opendap ; check Download the data section). From these maps, a 10-member ensemble is generated by selecting fields at dates close to the SWOT observation date. The ensemble is projected onto the SWOT swath and both the SWOT data and the projected ensemble are detrended (i.e., the signal with a geometrical structure similar to the expected error structure is removed). Using the projected ensemble, the detrended ensemble and the detrended SWOT data, an assimilation analysis is performed (using an Ensemble Transform Kalman Filter). The ensemble average of this analysis constitutes the calibrated SWOT data.
Notebook available: generate_CERmethod_calibration.ipynb
The projection method is very similar to the CER method. The projection method interpolates the SSH map created with the OI method and 5 nadirs (see CER method) onto the SWOT track. The difference between the SWOT with error data and the interpolated SSH map is then "projected", i.e., using a min square method the signal in this difference that has the same geometry as the expected SWOT errors (linear across track for the roll error, quadratic for the baseline dilation error ...). The residual is then added back onto the interpolated OI SSH map (containing mostly larger scales). A low-pass and high-pass filter is also performed so to leave untouch the mesoscale that is already accurate in the OI SSH map and that can be wrongly projected onto the error geometry.
Notebook available: generate_Projmethod_calibration.ipynb
The CalCNN method is based on Febvre et al. (in prep.). The algorithms are not yet shared by the authors but will be soon.
The CalCNN method uses a trainable architecture that takes as inputs the uncalibrated observations and the nadir based L4 product interpolated on the SWOT swath. The architecture is trained in a supervised manner on the reconstruction of the SWOT SSH. The L4 product and KaRIN observations are first each decomposed into Ns scale components which are concatenated together as channels. This produces an input tensor of shape (2Ns, Nal, Nac) where Nal and Nac are respectively the along track and across track sizes of the input swath section. An input convolution layer is used to change the number of channels. The data is then processed by a series a residual convolutional blocks composed of a convolution layer, a ReLU non-linearity, a skip connection and a mixing layer. Then a last convolution layer is used to compute a point wise correction of the L4 product from the different channels. Finally this correction is added to the input interpolated field to produce the calibrated SWOT data.
No notebook available. The method was generated offline by IMT Atlantique (contact Quentin Febvre).
Method | Field | µ(RMSE) | λ(SNR1) [km] | Reference |
---|---|---|---|---|
No Calib | Sea Surface Height |
3.395 | 116.3 | demo_evaluate_onswot_baseline |
--- | Geos. current |
9.437 | nan | --- |
--- | Relative vorticity |
0.760 | 37.9 | --- |
Baseline | Sea Surface Height |
0.145 | 83.9 | demo_evaluate_onswot_baseline |
--- | Geos. current |
0.288 | 84.9 | --- |
--- | Relative vorticity |
0.529 | 36.5 | --- |
CER-method | Sea Surface Height |
0.074 | 4.2 | evaluate_onswot_CERmethod |
--- | Geos. current |
0.219 | 78.4 | --- |
--- | Relative vorticity |
0.261 | 103.4 | --- |
Projmethod | Sea Surface Height |
0.037 | 4.2 | evaluate_onswot_Projmethod |
--- | Geos. current |
0.115 | 40.2 | --- |
--- | Relative vorticity |
0.271 | 30.6 | --- |
CalCNN | Sea Surface Height |
0.014 | 4.2 | evaluate_onswot_CalCNNmethod |
--- | Geos. current |
0.076 | 30.6 | --- |
--- | Relative vorticity |
0.212 | 37.9 | --- |
with:
µ(RMSE)
: averaged root-mean square error over the tracks crossing the swath,
λ(SNR1)
: spatial wavelength where SNR=0.5.
Method | Field | RMSE | λ(SNR1) [km] | Reference |
---|---|---|---|---|
Ref (True SWOT) | SSH |
0.013 | 15.561 | demo_evaluation_onnadirs_baseline |
No Calib | SSH |
4.041 | NaN | demo_evaluation_onnadirs_baseline |
Baseline | SSH |
0.218 | 30.270 | demo_evaluation_onnadirs_baseline |
CER-method | SSH |
0.074 | 23.800 | evaluation_onnadirs_CERmethod |
Projmethod | SSH |
0.037 | 25.647 | evaluation_onnadirs_Projmethod |
CalCNN | SSH |
0.0134 | 19.772 | evaluation_onnadirs_CalCNN |
with:
RMSE
: root-mean square error over the nadir tracks-SWOT swath crossings,
λ(SNR1)
: spatial wavelength where SNR=0.5.
Method | Field | RMSE | λ(SNR1) [km] | Reference |
---|---|---|---|---|
Ref (True SWOT) | SSH |
0.016 | 22.616 | demo_evaluation_onindepnadir_baseline |
No Calib | SSH |
4.078 | NaN | demo_evaluation_onindepnadir_baseline |
Baseline | SSH |
0.171 | 38.132 | demo_evaluation_onindepnadir_baseline |
CER-method | SSH |
0.076 | 18.656 | evaluation_onnadirs_CERmethod |
Projmethod | SSH |
0.033 | 22.312 | evaluation_onnadirs_Projmethod |
CalCNN | SSH |
? | ? | ? |
with:
RMSE
: root-mean square error over the nadir tracks-SWOT swath crossings,
λ(SNR1)
: spatial wavelength where SNR=0.5.
This data challenge was funded by CNRS, by CNES and by the Service Evolution CMEMS project: SLICING. The challenge was designed by Datlas in collaboration with IGE, CLS and IMT-Atlantique.
The structure of this data challenge was to a large extent inspired by the ocean-data-challenges
created for the BOOST-SWOT ANR project and the SLICING CMEMS project.
Gaultier, L., Ubelmann, C., & Fu, L. L. (2016). The challenge of using future SWOT data for oceanic field reconstruction. Journal of Atmospheric and Oceanic Technology, 33(1), 119-126.
Gómez-Navarro, L., Fablet, R., Mason, E., Pascual, A., Mourre, B., Cosme, E., & Le Sommer, J. (2018). SWOT spatial scales in the western Mediterranean sea derived from pseudo-observations and an Ad Hoc filtering. Remote Sensing, 10(4), 599.
Gomez-Navarro, L., Cosme, E., Le Sommer, J., Papadakis, N., & Pascual, A. (2020). Development of an image de-noising method in preparation for the Surface Water and Ocean Topography satellite mission. Remote Sensing, 12(4).
Metref, S., Cosme, E., Le Sommer, J., Poel, N., Brankart, J. M., Verron, J., & Gómez Navarro, L. (2019). Reduction of spatially structured errors in wide-swath altimetric satellite data using data assimilation. Remote Sensing, 11(11), 1336.
Peral, E., & Esteban-Fernandez, D. (2018, July). SWOT mission performance and error budget. In IGARSS 2018-2018 IEEE International Geoscience and Remote Sensing Symposium (pp. 8625-8628). IEEE.