Showing it is possible to run neural network inference on StarkNet.
Change directory to contracts/deployment
and run python deploy_and_run.py
- Train the model, and split the model into Cairo contracts
- Deploy the contracts to StarkNet testnet, and perform address linkage - storing addresses of children contracts in their parent contracts
- Call the top-level contract to perform inference.
- Model:
x
=> FC1 (W1
:784x32) =>v1
=> RELU =>h1
=> FC2 (W2
:32x10) =>z
- Divide
x*W1
into 32 sum-of-products (sop), and divide each sop into 8 sub-sop, where first 7 sub-sops have 100 terms each and the 8th has 84 terms; don't forget the biases; each sub-sop is converted to a contract where the parameters are constants. Aggregator contracts are created for sop's andv1
. - Create one contract for relu operation.
- Divide
h1*W2
into 10 sop's, each converted to a contract where the parameters are constants. Aggregator contract is created forz
. - Create one top-level contract with exposed
inference()
function for external call.
- If calling top-level
inference()
directly, we get http time-out. Additionally, Cairo CPU is single-threaded, meaning we do not get inter-layer parallelism. - Instead, the demo script calls each sop in a parallel job. This trades off compute-latency with network-latency.
- The part in the demo script that handles contract deployment and address linkage is commented out.
- conv layer
- recurrent networks
- backprop -- would enable on-chain learning and open a new world; have to deal with memory problem with activations