StarkWizard is a state-of-the-art Language Model (LLM) tailored for the Cairo smart contract language of StarkNet. With our meticulously designed training process and dedicated integration with StarkNet's intricacies, we aim to provide a robust and reliable LLM for all your Cairo smart contract needs.
- Cairo LLM Integration: Crafted specifically for Cairo's smart contract language.
- Training script: A cross-platform script to train the model on Windows/Linux/MacOS.
- Merge Capabilities: Seamlessly merge the training weights of PEFT with the original model(s).
- Quantization: AWQ (PC only) and GGUF notebooks to generate quantized versions of the merged models.
please visit:
StarkWizard On Hugging Face See LM Studio for a UI to chat with the model
Create a venv and install the requirements
pip install -r requirements.txt
If you are on Apple Silicon platform, please use the following:
pip install -r requirements-Mac.txt
Apple Silicon requires to install the nightly build of pyTorch, to get support of MPS
Activate the venv
conda activate cairo-llm
If you want to train or merge, you first need to install wizardlib. You only need to install the lib for training and merging.
cd wizardlib
pip install -e .
Building the dataset
If you have modified the corpus, you will need to rebuild the associated dataset. Please note that you need some special rights and be part of the HF StarkWizard organization to push to the StarkWizard HF repo.
get into the "0 buidl datasets" directory and run the first 2 cells of the notebook
Training:
If you are on mac, and want to train code-llama-instruct
cd 1-train
./mac-train-llama.sh
on other plateforms:
cd 1-train
./train-llama.sh
Please create custom sh files in order to tweak the training parameters parameters.
Follow this link to get a list of the available params Training Parameters
Merging:
The training outputs a PEFT model that can be merged to the original model in order to get a full model. Use the merge.py script
cd 2-merge
python merge.py
Please find the parameters for merging Merging Parameters
The merged model is uploaded after merging. You need to get authorization to push to the HF StarkWizartd repos
Quantizing:
Two notebooks are proposed to generate AWQ and GGUF quantitization. Please note that AWQ is not supported on Apple Silicon.
NEW The directory 0-build datasets contains a Gemini Data synthetizer. You need to have an aenvironment variabble called GEMINI_KEY, containing a configured access to gcloud Generative AI.
StarkWizard/
│
├── 0-build datasets/ # Contains a notebook to merge csv and upload the resulting dataset
│
├── 1-train/ # the train.py command
│
├── 2-merge/ # The merge.py command
│
├── doc/ # md files for merge and train
│
├── backup/ # POC of this project, outdated and useless
│
├── wizarlib/ # A lib used for convenient functions shared accross scripts
StarkWizard is an open-source project, and we welcome contributions of all kinds: new models, bug fixes, improvements to the documentation, and more. See our contributing guide for more details on how to get started.
For support, questions, or feedback, please open an issue or mail [email protected].
StarkWizard is open-sourced under the MIT License. See the LICENSE file for more details.