Skip to content
This repository has been archived by the owner on Oct 16, 2023. It is now read-only.

Releases: hpcaitech/EnergonAI

V0.0.1 Released Today!

13 Sep 07:03
aeb486b
Compare
Choose a tag to compare

Overview

EnergonAI is a service framework for large-scale model inference, which is powered by ColossalAI. It support large model inference with tensor parallelism and pipeline parallelism. The most important example of this release is serving OPT. You can serve OPT-175B conveniently using EnergonAI.

What's Changed

Read more