Skip to content

Commit

Permalink
Adjusting README and files according to JOSS reviewers
Browse files Browse the repository at this point in the history
  • Loading branch information
silkemaes committed Oct 10, 2024
1 parent 754e98c commit 3a610d8
Show file tree
Hide file tree
Showing 7 changed files with 66 additions and 29 deletions.
Binary file added M_rate16.npy
Binary file not shown.
68 changes: 56 additions & 12 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,41 +2,85 @@

Welcome to the MACE repository!

***MACE, a Machine-learning Approach to Chemistry Emulation***, by [Maes et al. (2024)](https://ui.adsabs.harvard.edu/abs/2024arXiv240503274M/abstract), is a surrogate model for chemical kinetics. It is developed in the contexts of circumstellar envelopes (CSEs) of asymptotic giant branch (AGB) stars, i.e. evolved low-mass stars.
***MACE, a Machine-learning Approach to Chemistry Emulation***, by [Maes et al. (2024)](https://ui.adsabs.harvard.edu/abs/2024arXiv240503274M/abstract), is a surrogate model for chemical kinetics. It is developed in the contexts of circumstellar envelopes (CSEs) of asymptotic giant branch (AGB) stars, i.e. evolved low-mass stars.

MACE is implemented in Python and uses [PyTorch](https://pytorch.org/), together with [torchode](https://github.com/martenlienen/torchode) [(Lienen & Gunnemann, 2022)](https://openreview.net/pdf?id=uiKVKTiUYB0), to be trained.
During development, the chemical models of [Maes et al. (2023)](https://ui.adsabs.harvard.edu/abs/2023MNRAS.522.4654M/abstract) are used. In this paper you can also find more details about the astrochemical environment used.

MACE is implemented in Python and is trained using [PyTorch](https://pytorch.org/), together with [torchode](https://github.com/martenlienen/torchode) [(Lienen & Gunnemann, 2022)](https://openreview.net/pdf?id=uiKVKTiUYB0).

---
## Table of content
- [Installation](#inst)
- [What is MACE?](#what)
- [How to use?](#use)
- [Example case](#exmp)
- [Contact](#cont)
- [Acknowledgements](#ackn)

---
## Notes on installation
- MACE is not available on ```pypi```, the package named ```mace``` is not this one.
## Notes on installation <a name="inst"></a>
- MACE is currently not available as a package on ```pypi```. There is a package named ```mace```, but it is not this one.
- To use MACE, please clone the repo and install the required packages, see ```requirements.txt```:
```
git clone https://github.com/silkemaes/MACE.git
```


---
## What?
## What is MACE? <a name="what"></a>

MACE offers a surrogate model that emulates the evolution of chemical abundances over time in a dynamical physical environment. As the name states, it makes use of machine-learning techniques. More specifically, combining an *autoencoder* (blue) and a *trainable ordinary differential equation (ODE)* (red) allows to accurately emulate a chemical kinetics model.

Hence, MACE is a framework, an architecture, that can be trained for specific chemical datasets, but before using, should be made compatible with the dataset, see _[How to use?](#use)_.

The architecture of MACE is schematically given as
![MACE architecture](MACE.png)

MACE offers a surrogate model that emulates the evolution of chemical abundances over time in a dynamical physical environment. As the name states, it makes use of machine learning techniques. More specifically, combining an *autoencoder* (blue) and a *trainable ordinary differential equation (ODE)* (red) allows to accurately emulate a chemical kinetics model.
MACE offers a surrogate model that emulates the evolution of chemical abundances over time in a dynamical physical environment. As the name states, it makes use of machine-learning techniques. More specifically, combining an *autoencoder* (blue) and a *trainable ordinary differential equation (ODE)* (red) allows to accurately emulate a chemical kinetics model.

In formula, MACE is stated as
$${\hat{\boldsymbol{n}}}(t) = \mathcal{D}\Big( G \big( \mathcal{E} ({\boldsymbol{n}}, {\boldsymbol{p}}),t \big) \Big).$$
Here, ${\hat{\boldsymbol{n}}}(t)$ are the predicted chemical abundances at a time $t$ later dan the initial state ${\boldsymbol{n}}$ . $\mathcal{E}$ and $\mathcal{D}$ represent the autoecoder, with the encoder and decoder, respectively. The autoencoder maps the chemical space ${\boldsymbol{n}}$ together with the physical space ${\boldsymbol{p}}$ to a lower dimensional representation $\boldsymbol{z}$, called the latent space. The function $G$ describes the evolution in latent space such that $\boldsymbol{z}(\Delta t) = G(\boldsymbol{z}, \Delta t)=\int_0^{\Delta t} g(\boldsymbol{z}){\rm d}t$.

$$
{\hat{\boldsymbol{n}}}(t) = \mathcal{D}\Big( G \big( \mathcal{E} ({\boldsymbol{n}}, {\boldsymbol{p}}),t \big) \Big).
$$

Here, ${\hat{\boldsymbol{n}}}(t)$ are the predicted chemical abundances at a time $t$ later dan the initial state ${\boldsymbol{n_0}}$. $\mathcal{E}$ and $\mathcal{D}$ represent the autoecoder, with the encoder and decoder, respectively. The autoencoder maps the chemical space ${\boldsymbol{n_0}}$ together with the physical space ${\boldsymbol{p}}$ to a lower dimensional representation $\boldsymbol{z}$, called the latent space. The function $G$ describes the evolution in latent space such that $\boldsymbol{z}(\Delta t) = G(\boldsymbol{z}, \Delta t)=\int_0^{\Delta t} g(\boldsymbol{z}){\rm d}t$.

For more details, check out our paper: [Maes et al. (2024)](https://ui.adsabs.harvard.edu/abs/2024arXiv240503274M/abstract).

---
## How to run?
## How to use? <a name="use"></a>

The script ```routine.py``` gives the flow of training & storing a MACE architecture, and immediately applies to the specified test dataset once training is finished. As such, it returns an averaged error on the MACE model compared to the classical model. More info on the training routine can be found in the [paper](https://ui.adsabs.harvard.edu/abs/2024arXiv240503274M/abstract).

An annotated notebook of the routine can be found in the [documentation](https://mace-code.readthedocs.io/en/latest/example/run.html).

The script ```routine.py``` takes an input file with the needed (hyper)parameter setup. An example of such an input file can be found in input/.
```
python routine.py example
```

***Disclaimer:***

In order to train MACE with a certain chemical dataset, the ```Dataset``` class
should be made compatible with that data. Currently, the script ```src/mace/CSE_0D/dataset.py``` works only for the specific dataset used here, i.e. models from [Maes et al. (2023)](https://ui.adsabs.harvard.edu/abs/2023MNRAS.522.4654M/abstract), using the [Rate22-CSE code](https://github.com/MarieVdS/rate22_cse_code).


---
## Example case <a name="exmp"></a>

This repository contains a trained MACE model as a test case, see ```model/20240604_160152```.

Once the Dataset class is set up properly (see src/mace/CSE_0D/dataset.py), a MACE model can be trained. This can be done using the script 'run.py', which takes an input file with the needed (hyper)parameter setup. An example of such an input file can be found in input/.
The code for loading a trained MACE model can be found in the script ```src/mace/load.py```, testing in ```src/mace/test.py```. An annotated notebook can be found in the [documentation](https://mace-code.readthedocs.io/en/latest/example/load%26test.html).

The script run.py trains the model, as explained by [Maes et al. (2024)](https://ui.adsabs.harvard.edu/abs/2024arXiv240503274M/abstract), and is immediately applied to the specified test dataset once training is finished. As such, it returns an averaged error on the MACE model compared to the classical model.
---
## Contact <a name="cont"></a>

If any comments or issues come up, please contact me via [email](mailto:[email protected]), or set up a GitHub issue.

---
## Acknowledgements <a name="ackn"></a>

The MACE architecture is free to use. Please cite our paper [Maes et al. (2024)](https://ui.adsabs.harvard.edu/abs/2024arXiv240503274M/abstract)



Expand Down
File renamed without changes.
8 changes: 0 additions & 8 deletions input/example.in
Original file line number Diff line number Diff line change
Expand Up @@ -25,11 +25,3 @@ elm = 0




Name = 20240604_133421

Name = 20240604_133421

Name = 20240604_133538

Name = 20240604_133932
Empty file added rate16.specs
Empty file.
17 changes: 9 additions & 8 deletions run.py → routine.py
Original file line number Diff line number Diff line change
@@ -1,10 +1,11 @@
import matplotlib.pyplot as plt
import numpy as np
import matplotlib.pyplot as plt
import numpy as np
import sys
import torch
from time import time
import datetime as dt
from tqdm import tqdm
from time import time
import datetime as dt
from tqdm import tqdm
import os

import src.mace.CSE_0D.dataset as ds
import src.mace.train as train
Expand All @@ -15,14 +16,14 @@
import src.mace.utils as utils
from src.mace.input import Input


source_dir = os.path.dirname(os.path.abspath(__file__))

specs_dict, idx_specs = utils.get_specs()

start = time()
now = dt.datetime.now()
name = str(now.strftime("%Y%m%d")+'_'+now.strftime("%H%M%S"))
path = '/STER/silkem/MACE/models/CSE_0D/'+name
path = source_dir+'/models/CSE_0D/'+name


## ================================================== INPUT ========
Expand All @@ -31,7 +32,7 @@
## READ INPUT FILE
arg = sys.argv[1]

infile = '/STER/silkem/MACE/input/'+arg+'.in'
infile = source_dir+'/input/'+arg+'.in'

input = Input(infile, name)

Expand Down
2 changes: 1 addition & 1 deletion src/mace/loss.py
Original file line number Diff line number Diff line change
Expand Up @@ -46,7 +46,7 @@ def __init__(self, norm, fract, losstype):

parentpath = str(Path(__file__).parent)[:-15]

self.M = np.load(parentpath+'data/M_rate16.npy')
self.M = np.load(parentpath+'M_rate16.npy')

## initialise
self.set_losstype(losstype)
Expand Down

0 comments on commit 3a610d8

Please sign in to comment.