-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Generate fullfield products #398
Comments
we currently have around half of the full-field products generated. this is main issue following #393 |
stereo rgb, flir IR have completed generation @ 80% threshold and need to be evaluted for exhaustiveness. Remaining full field (laser 3D heightmap stitch) on hold pending fix of the heightmap PLY -> BMP conversion #417 |
|
|
On terra-clowder from @max-zilla a dump of the rulemonitor db with all the unique FF scans we have and their counts as of friday 3/16 /home/mburnet2/scans.csv. |
Identified problem with fullfield process due to bin2tif (and likely ir) posting metadata after the files are uploaded. Tentative fix for this in bin2tif is providing more complete, though incomplete still, fullfields. First image generated using this method is 2/7/2018. |
What triggers the run, the metadata or the actual image? We should probably make sure people understand this is not atomic, and if they need both pieces they should make sure both are available, we probably should also have some sort of contract that we do things is the same order (i.e. always upload image then metadata) that way people can trigger of just metadata and not have to trigger of both metadata and image to make sure they know when both are ready. |
Rulechecker is a switchboard for many extractors and triggers from a file being added to a dataset. Several of our extractors upload files before attaching metadata, so rulechecker would trigger before metadata was uploaded and respond by saying md is missing. If md is added as a final step consistently we could trigger on that, but not every extractor uploads metadata. I briefly discussed with Craig whether we just rework things for extractors to manually trigger downstream extractors explicitly without using rulechecker or a dataset file added/metadata added listener. |
Update - RGB fullfields are (generally) being generated for the new season starting 5/1. We are missing fullfields for 5/13 and 5/14, likely due to connection error in globus-monitor (need to monitor this component). We are also missing fullfields for 5/2 and 5/4, likely due to other pipeline failures. After manually triggering, there is an entry in the fullfield extractor log for 5/2:
So the VRT already exists but the TIF hasn't been generated. In the case of 5/4, we have over 99% coverage in rgb-geotiff, but rulechecker reports otherwise:
|
Lookin in detail at the missing files for recent dates 2018-05-12
Raw dataset doesn't exist in Clowder. 2018/5/11
Raw dataset doesn't exist in Clowder. Of the 28 datasets, 27 do not exist in Clowder. This suggests that there is a failure in registering the initial raw data. |
We've determined that this is likely caused by the somewhat rare case that files in a dataset are split across two transfers this is now tracked in #446 |
I just deployed new fullfield with higher threshold. Once we see those, we can close this. |
We are missing fullfields for 5/7, 5/25, and 5/26. I've resubmitted all stereotop datasets to rule checker for these dates and we have near complete coverage on filesystem (rgb_geotiff) but the counts in rulemonitor do not match. See https://docs.google.com/spreadsheets/d/1emUW6xu9i0zMh1jUupgqPmIIz_IVjDv-DZWEKbZNCz8/. I'd like to keep this open until we know why this is happening. |
Causes identified were related to outages -- zero byte files on disk. Created new issue. |
We continue to have problems with the fullfield pipeline. This is a best-effort task to generate as many fullfield products as possible along with trait data.
The text was updated successfully, but these errors were encountered: