Running an example with data_module #1

anlarro · 2022-07-26T11:39:25Z

Hi, I was trying to run an example with my own data but when running app.py is asking me for a data_module flag. Reading the documentation and the code I couldn't find a reference on this.
'data_module flag must be provided (path to the data preprocessing module)'

Could you please provide more info about this, thank you!

The text was updated successfully, but these errors were encountered:

carlos-gg · 2022-07-27T12:23:49Z

Hi, thanks for your interest in DL4DS. The documentation needs some work and the creation of a tutorial is WIP. My suggestion would be to call dl4ds.SupervisedTrainer or dl4ds.CGANTrainer directly in your script while passing your (preprocessed) data variables. Have you tried this?

Bear in mind that the app.py module is very experimental and is what I used to run my experiments in a cluster with a workflow manager. The data_module is just a python script were you run your pre-processing steps (e.g., slicing data, splitting, normalizing/standardizing) and some variables are declared. These variables are called in app.py, e.g., DATA.data_train or DATA.predictors_train, when feeding the training or inference steps.

anlarro · 2022-07-27T15:19:49Z

Thank you for your quick response. I'm calling dl4ds.SupervisedTrainer using only data_train, data_val, data_test. But when executing trainer.run() I get:
Unexpected result of train_function (Empty logs). Please use Model.compile(..., run_eagerly=True), or tf.config.run_functions_eagerly(True) for more information of where went wrong, or file a issue/bug to tf.keras.

For what I have found, this error may be because of wrong input data shape. My input data are xr.DataArray with shape [time, latitude, longitude, 1].

carlos-gg · 2022-07-29T09:41:49Z

The error doesn't tell me much so I'm not sure it's even related to the data (shape, format). Please provide more information about how you call the trainer and the full error.

anlarro · 2022-09-14T10:35:02Z

Hi Carlos, thank you for taking care of this. Indeed, the error didn't tell much but I figured out that the problem was with the batch size, so by setting a lower batch size I was able to train a model.

I have another doubt, do all the LR data should be at the same resolution? I mean, data_train_lr, predictors_train, and static_vars should be all at the same resolution or can I have different resolutions for train_lr and static_vars for example?

carlos-gg · 2022-09-15T08:09:26Z

Hi Andrés, I'm glad you've found the issue there. batch_size is a tricky hyperparameter to set as it depends on many factors, such as the size of the model, the available GPU/CPU memory, the size/dimensionality of the training samples, etc. So it's very case dependant.

To answer your question: the parameters data_train_lr, data_val_lr and data_test_lr require low/coarse resolution data. predictors_train is for inputing time-varying predictors and they can come in high or intermediate resolution (DL4DS will internally interpolate/resize the arrays when needed). static_vars on the other hand, must be high-resolution variables, such as elevation/topography. So yes, you can have different resolutions data_train_lr and static_vars.

anlarro changed the title ~~Running an example~~ Running an example with data_module Jul 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running an example with data_module #1

Running an example with data_module #1

anlarro commented Jul 26, 2022

carlos-gg commented Jul 27, 2022 •

edited

Loading

anlarro commented Jul 27, 2022

carlos-gg commented Jul 29, 2022

anlarro commented Sep 14, 2022

carlos-gg commented Sep 15, 2022

Running an example with data_module #1

Running an example with data_module #1

Comments

anlarro commented Jul 26, 2022

carlos-gg commented Jul 27, 2022 • edited Loading

anlarro commented Jul 27, 2022

carlos-gg commented Jul 29, 2022

anlarro commented Sep 14, 2022

carlos-gg commented Sep 15, 2022

carlos-gg commented Jul 27, 2022 •

edited

Loading