Skip to content

Latest commit

 

History

History
71 lines (50 loc) · 3.6 KB

README.md

File metadata and controls

71 lines (50 loc) · 3.6 KB

Imara: GenAI application development in Google Cloud

The Imara project showcases GenAI application development in Google Cloud, including:

  • Tuning of Google and Hugging Face foundation models using Reinforcement Learning from Human Feedback (RLHF) in Vertex AI.
  • Automating Continuous Integration (CI), Continuous Training (CT) and Continuous Delivery (CD) for machine learning. This automation streamlines the development and deployment processes, ensuring efficient and reliable model updates to deliver value to users more quickly.

Setup

Before you begin, use the setup guide to configure your project, service account, permissions, storage and more.

Development

In machine learning domain, Jupyter notebooks provide a highly interactive environment for model development which supports iterative refinement of machine learning models.

Development

This project provides several notebooks:

  1. Bison Notebook: demonstrates fine tuning a PaLM 2 Text Bison model from Vertex AI Model Garden.
  2. (Coming Soon) T5 Notebook: demonstrates fine tuning the Text-To-Text Transfer Transformer model from Hugging Face.

The notebooks can be executed locally or on Vertex AI through either Vertex AI Workbench or Colab Enterprise.

Vertex AI Workbench

  • Create a Vertex AI Workbench instance.
  • Clone this GitHub repository in your instance.
  • Open a notebook within your Vertex AI Workbench instance and execute the steps.

Colab Enterprise

TBD (will add the steps after testing it out)

Automation

This project demonstrates Continuous Integration (CI), Continuous Training (CT) and Continuous Delivery (CD) to accelerate deployment of LLMs into production and value delivery of GenAI applications.

Google Cloud

This project uses a suite of Google Cloud products to facilitate CI/CT/CD of the PaLM 2 Text Bison model, including:

Upon any code or data changes in this repository, Cloud Build triggers the execution of the workflow, ensuring efficient and reliable GenAI application development and delivery.

GitHub Actions on Google Cloud

(Coming Soon)

This project uses GitHub Actions to facilitate CI/CT/CD of the Text-To-Text Transfer Transformer model from Hugging Face, including: