Add the "decollage" process for raw microscope output to the package #22

metazool · 2024-08-15T09:26:19Z

See #21 for the context for this and links to the original - moving a rough script from the internal project and refactoring it for use in a future pipeline, as yet unspecified.

Adds test fixtures: a single sample image of the composite output as saved on the FlowCam instrument, and an extract of metadata corresponding to that
Adds minimal tests
Write EXIF headers in each individual image with geographic location and date, using the file naming conventions

To test

Run unit tests

export PYTHONPATH=.
py.test cyto_ml/tests/test_decollage.py

Run from the commandline (stopgap)

python cyto_ml/data/decollage.py fixtures/MicrobialMethane_MESO_Tank10_54.0143_-2.7770_04052023_1 test

The last argument there is an "experiment name" used to name the output files. This is a stop-gap set of changes, I didn't want to go any further as it's still not completely clear how the workflow fits together. #9

What this doesn't cover

One discovery here is there's a lot of metadata for individual images based on segmentation and shape analysis that happens onboard the FlowCam - a lot more detail than I thought we'd have access to.

Given we don't have a really clear use case for it, I haven't attempted to do anything with that here but I can see the output being usefully either dropped into the object store and picked up for use with dask via intake, or indexed in a lightweight database like sqlite/datasette

metazool · 2024-08-15T09:32:28Z

@Kzra your feedback particularly appreciated, if you're still generating new images on a regular basis then this should be directly useful to you now, and faster as we're not re-reading the large TIFF every time we extract a small window

If not, i guess the next step is to add a binary classifier and add a probably-junk flag and maybe a confidence metric to each output image, option of just not sending anything on to the cloud at this stage, and see if we can do that with the new object_store_api

Kzra

great work speeding up the decollage process

albags

Great work adding the decollage process.

metazool added 12 commits August 12, 2024 13:40

move the test fixtures out of the package, add the large image

d803846

Add the smallest possible refactor + failing test

f16a2c1

kludge a metadata .lst for a single test collage

f49daf4

leave a collection of TODOs to pick up

5ad1ba0

fixture file for single collage, correct format

3da79aa

very slow refactoring of the decollage script, tests

0f92387

function for filename metadata, abandoned some changes

146a6d4

update to add standard-enough EXIF tag names

ea39f40

switch back to exiftool dependency

f443c7d

logic and tests for Exif header writing to image samples

28d404d

runnable as script, needs a better interface, slower than i'd like

adcaa0c

run from a cleaner OO interface, stop fiddling

a72b96e

metazool requested a review from Kzra August 15, 2024 09:26

metazool requested a review from albags August 27, 2024 10:33

This was referenced Aug 27, 2024

Layout and minimal changes #23

Merged

Drop scivision #24

Merged

Kzra approved these changes Aug 27, 2024

View reviewed changes

Kzra merged commit 634c800 into main Aug 27, 2024

albags reviewed Aug 27, 2024

View reviewed changes

jmarshrossney mentioned this pull request Aug 29, 2024

Revert "Add the "decollage" process for raw microscope output to the package" #27

Merged

metazool deleted the decollage_images branch October 2, 2024 08:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add the "decollage" process for raw microscope output to the package #22

Add the "decollage" process for raw microscope output to the package #22

metazool commented Aug 15, 2024

metazool commented Aug 15, 2024

Kzra left a comment

albags left a comment

Add the "decollage" process for raw microscope output to the package #22

Add the "decollage" process for raw microscope output to the package #22

Conversation

metazool commented Aug 15, 2024

To test

What this doesn't cover

metazool commented Aug 15, 2024

Kzra left a comment

Choose a reason for hiding this comment

albags left a comment

Choose a reason for hiding this comment