Feat/Data handler to save data #188

nulinspiratie · 2024-02-22T13:48:20Z

Data handler

This PR is made because a standardized method is needed in Qualibrate to save data.

Introduction

The DataHandler is used to easily save data once a measurement has been performed.
It saves data into an automatically generated folder with folder structure:
{root_data_folder}/%Y-%m-%d/#{idx}_{name}_%H%M%S.

root_data_folder is the root folder for all data, defined once at the start
%Y-%m-%d: All datasets are first ordered by date
{idx}: Datasets are identified by an incrementer (starting at #1).
Whenever a save is performed, the index of the last saved dataset is determined and
increased by 1.
name: Each data folder has a name
%H%M%S: The time is also specified.
This structure can be changed in DataHandler.folder_structure.

Data is generally saved using the command data_handler.save_data("msmt_name", data),
where data is a dictionary.
The data is saved to the json file data.json in the data folder, but nonserialisable
types are saved into separate files. The following nonserialisable types are currently
supported:

Matplotlib figures
Numpy arrays
Xarrays

Basic example

# Assume a measurement has been performed, and all results are collected here
T1_data = {
    "T1": 5e-6,
    "T1_figure": plt.figure(),
    "IQ_array": np.array([[1, 2, 3], [4, 5, 6]])
}

# Initialize the DataHandler
data_handler = DataHandler(root_data_folder="C:/data")

# Save results
data_folder = data_handler.save_data(data=T1_data, name="T1_measurement")
print(data_folder)
# C:/data/2024-02-24/#152_T1_measurement_095214
# This assumes the save was performed at 2024-02-24 at 09:52:14

After calling data_handler.save_data(), three files are created in data_folder:

T1_figure.png
arrays.npz containing all the numpy arrays

data.json which contains:

{
    "T1": 5e-06,
    "T1_figure": "./T1_figure.png",
    "IQ_array": "./arrays.npz#IQ_array"
}

Creating a data folder

A data folder can be created in two ways:

# Method 1: explicitly creating data folder
data_folder_properties = data_handler.create_data_folder(name="new_data_folder")

# Method 2: Create when saving results
data_folder = data_handler.save_data("T1_measurement", data=T1_data)

Note that the methods return different results.
The method DataHandler.save_data simply returns the path to the newly-created data folder, whereas DataHandler.create_data_folder returns a dict with additional information on the data folder such as the idx.
This additional information can also be accessed after calling DataHandler.save_data through the attribute DataHandler.path_properties.

Manually adding additional files to data folder

After a data folder has been created, its path can be accessed from DataHandler.path.
This allows you to add additional files:

data_folder = data_handler.save_data(data)
assert data_folder == data_handler.path  # data_folder is added to data_handler.path

(data_handler.path / "test_file.txt").write_text("I'm adding a file to the data folder")

Auto-saving additional files to data folder

In many cases certain files need to be added every time a data folder is created.
Instead of having to manually add these files each time, they can be specified beforehand:

DataHandler.additional_files = {
    "configuration.py": "configuration.py
}

Each key is a path from the current working directory, and the corresponding value is the target filepath w.r.t. the data folder.
The key does not have to be a relative filepath, it can also be an absolute path.
This can be useful if you want to autosave a specific file on a fixed location somewhere on your hard drive.

github-actions · 2024-02-22T13:49:38Z

Unit Test Results

394 tests 391 ✔️ 26s ⏱️
    1 suites     3 💤
    1 files     0 ❌

Results for commit a9e14ea.

♻️ This comment has been updated with latest results.

nulinspiratie · 2024-02-23T07:56:42Z

The tests are failing because I haven't added xarray as a required package but as an optional package. @yomach @TheoLaudatQM any recommendations? Should I skip tests if xarray isn't installed?

qualang_tools/results/README.md

qualang_tools/results/data_handler/data_handler.py

qualang_tools/results/README.md

yomach · 2024-02-23T09:44:25Z

The tests are failing because I haven't added xarray as a required package but as an optional package. @yomach @TheoLaudatQM any recommendations? Should I skip tests if xarray isn't installed?

You can tell the tests to build with these packages, I'll take a look next week

yomach · 2024-02-23T20:59:18Z

The tests are failing because I haven't added xarray as a required package but as an optional package. @yomach @TheoLaudatQM any recommendations? Should I skip tests if xarray isn't installed?

You can tell the tests to build with these packages, I'll take a look next week

@nulinspiratie
Check out commit 2bc4181, and specifically this change:

This is how you tell poetry to install extra packages for the testing.

yomach · 2024-02-23T21:08:27Z

The tests are failing because I haven't added xarray as a required package but as an optional package. @yomach @TheoLaudatQM any recommendations? Should I skip tests if xarray isn't installed?

You can tell the tests to build with these packages, I'll take a look next week

@nulinspiratie Check out commit 2bc4181, and specifically this change: This is how you tell poetry to install extra packages for the testing.

Wait, I don't see you added xarray as a required package at all?

nulinspiratie · 2024-02-25T11:06:03Z

@yomach yeah just noticed the pyproject.toml file was never committed. I fixed it now, all the tests are working

qualang_tools/results/data_handler/data_handler.py

yomach

Please add it also to the CHANGELOG.md and to the main README.md file (in the root directory)

nulinspiratie · 2024-02-26T13:44:26Z

Please add it also to the CHANGELOG.md and to the main README.md file (in the root directory)

Done @yomach !

nulinspiratie · 2024-02-26T14:27:12Z

@TheoLaudatQM @yonatanrqm any comments? If possible, I'm hoping to have this merged tomorrow

nulinspiratie added 8 commits February 20, 2024 21:20

add functionality to select new data folder based on idx

9093c5d

started adding save_data

cdbd19e

basic data handler

5e5957e

add numpy array processor

36d40d4

add xarray data handler

9b90b0e

working tests, added init

2c61e3a

add DataHandler.path

d4e7be9

docs + small changes

695ca21

nulinspiratie requested review from yomach, TheoLaudatQM and HiroQM February 22, 2024 13:48

HiroQM reviewed Feb 23, 2024

View reviewed changes

nulinspiratie added 2 commits February 23, 2024 11:05

Added initialization name

6e7556a

Proper sorting of data folders

46f3681

yomach requested a review from yonatanrqm February 23, 2024 20:59

nulinspiratie and others added 10 commits February 25, 2024 10:50

add documentation

6cbc66e

add optional xarray

4cad629

lower min xarray version

c452b7a

add xarray as to poetry extras

087388f

modify workflow to allow xarray

798ce63

remove underscore for workflow

5af5273

update lock file

8d52ad9

remove min_size numpy array

84a6b4e

add test xarray skip if not installed

c8ee97a

Reduce performance test duration

7d620b4

nulinspiratie added 3 commits February 25, 2024 13:07

added additional_files

bc07ff1

black formatting

2bbef32

added info on auto using filename as name

99c1853

HiroQM reviewed Feb 25, 2024

View reviewed changes

qualang_tools/results/data_handler/data_handler.py Show resolved Hide resolved

yomach requested changes Feb 26, 2024

View reviewed changes

Update changelog and readme

c5ed718

nulinspiratie requested review from yomach and HiroQM February 26, 2024 14:26

nulinspiratie added 3 commits February 26, 2024 16:35

Fix attempt: windows \ to /

9bd4419

fix: create create_data without creating

47e308a

fix import pathlib

3f22f64

yomach approved these changes Feb 26, 2024

View reviewed changes

Allow multiple saves

a9e14ea

nulinspiratie merged commit 1afde01 into main Feb 27, 2024
2 checks passed

nulinspiratie deleted the data_handler branch February 27, 2024 08:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/Data handler to save data #188

Feat/Data handler to save data #188

nulinspiratie commented Feb 22, 2024 •

edited

Loading

github-actions bot commented Feb 22, 2024 •

edited

Loading

nulinspiratie commented Feb 23, 2024

yomach commented Feb 23, 2024

yomach commented Feb 23, 2024

yomach commented Feb 23, 2024

nulinspiratie commented Feb 25, 2024

yomach left a comment

nulinspiratie commented Feb 26, 2024

nulinspiratie commented Feb 26, 2024

Feat/Data handler to save data #188

Feat/Data handler to save data #188

Conversation

nulinspiratie commented Feb 22, 2024 • edited Loading

Data handler

Introduction

Basic example

Creating a data folder

Manually adding additional files to data folder

Auto-saving additional files to data folder

github-actions bot commented Feb 22, 2024 • edited Loading

Unit Test Results

nulinspiratie commented Feb 23, 2024

yomach commented Feb 23, 2024

yomach commented Feb 23, 2024

yomach commented Feb 23, 2024

nulinspiratie commented Feb 25, 2024

yomach left a comment

Choose a reason for hiding this comment

nulinspiratie commented Feb 26, 2024

nulinspiratie commented Feb 26, 2024

nulinspiratie commented Feb 22, 2024 •

edited

Loading

github-actions bot commented Feb 22, 2024 •

edited

Loading