first pass QC of underway data #29

ecrockford · 2021-03-30T02:32:24Z

Establish some simple filters to pass over underway data for rough QC prior to uploading to the API.

Output NA for skipped data.
When looking at max/min values use some sort of rolling average or rolling median?
If 1 parameter is bad, do we mask out all parameters? Do we group TSG together and any others separate?

Possible filters:

minimum salinity value
minimum flow meter value (Armstrong)
maximum fluorometer reading
Temperature.... extreme max and min?

joefutrelle · 2021-04-08T14:20:29Z

Locate column header for flow meter for different cruises.

sbeaulieu · 2021-04-08T16:54:36Z

Column header for salinity: SBE45S. Column header for flow meter: FLOW. A challenge with flow meter is that the average value (in ml/s) differs between cruises (e.g., ~50, ~80, ~130 depending on cruise). I saw zero values in several, and NAN values in some. Let's discuss with Taylor re: thresholds for minimum salinity and flow meter values.

ecrockford · 2021-04-08T18:28:51Z

Yes thanks for pointing that out Stace. A while ago Joe and I discussed this. We thought maybe a running average with significant deviation from that average could work? Something along those lines. I've heard in passing from the ship's techs that the flow meter isn't super great so we should take values with a grain of salt. Basically, looking for deviations from consistency.

sbeaulieu · 2021-04-16T15:44:46Z

Noting that in today's Zoom we discussed providing a quality flag column for each of these: salinity, flow meter, and fluorometer. We also discussed providing a comments column that would be auto-populated to alert the end user that a flag(s) was applied.

joefutrelle · 2021-04-22T14:37:22Z

Endeavor data does not use the column headers specified here, so the API will need to skip adding quality flags for that data.

Once column names are regularized, this will not require per-cruise or per-vessel configurations (although regularization will).

joefutrelle · 2021-04-22T14:39:31Z

Related to #30

sbeaulieu · 2021-05-06T14:27:38Z

develop QA/QC checks using Armstrong data and evaluate with Taylor

joefutrelle · 2023-03-20T19:17:13Z

From the original description:

Establish some simple filters to pass over underway data for rough QC prior to uploading to the API.

If these tools are not part of the REST API then we should consider tracking this work elsewhere. I'm leaving the issue open for now but addressing it may not touch this codebase.

joefutrelle added this to the REST api enhancements milestone Apr 1, 2021

joefutrelle assigned sbeaulieu Apr 1, 2021

sbeaulieu assigned ecrockford and joefutrelle and unassigned sbeaulieu Apr 15, 2021

joefutrelle modified the milestones: REST api enhancements, Year 4 report May 27, 2021

joefutrelle added the discussion issue primarily serves as a discussion forum label Mar 28, 2023

joefutrelle removed this from the enhancements to be completed ahead of mid-term review milestone Apr 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

first pass QC of underway data #29

first pass QC of underway data #29

ecrockford commented Mar 30, 2021

joefutrelle commented Apr 8, 2021

sbeaulieu commented Apr 8, 2021

ecrockford commented Apr 8, 2021

sbeaulieu commented Apr 16, 2021

joefutrelle commented Apr 22, 2021 •

edited

Loading

joefutrelle commented Apr 22, 2021

sbeaulieu commented May 6, 2021

joefutrelle commented Mar 20, 2023

first pass QC of underway data #29

first pass QC of underway data #29

Comments

ecrockford commented Mar 30, 2021

joefutrelle commented Apr 8, 2021

sbeaulieu commented Apr 8, 2021

ecrockford commented Apr 8, 2021

sbeaulieu commented Apr 16, 2021

joefutrelle commented Apr 22, 2021 • edited Loading

joefutrelle commented Apr 22, 2021

sbeaulieu commented May 6, 2021

joefutrelle commented Mar 20, 2023

joefutrelle commented Apr 22, 2021 •

edited

Loading