Name		Name	Last commit message	Last commit date
parent directory ..
Dockerfile		Dockerfile
README.md		README.md
get_models.py		get_models.py
model.py		model.py
requirements-optional.txt		requirements-optional.txt
requirements.txt		requirements.txt
server.py		server.py

README.md

MosaicML's MPT-7B-StoryWriter-65k+ (4-bit quantization)

Description

This example shows how to use the MPT-7B-StoryWriter-65k+ model, with a SimpleAI server.

It implements complete method.

Setup

First build the image with:

docker build . -t mpt-7b-storywriter:0.1

Then declare your model in your SimpleAI configuration file models.toml:

[mpt-7b-storywriter]
    [mpt-7b-storywriter.metadata]
        owned_by    = 'MosaicML'
        permission  = []
        description = 'MPT-7B-StoryWriter-65k+ is a model designed to read and write fictional stories with super long context lengths. It was built by finetuning MPT-7B with a context length of 65k tokens on a filtered fiction subset of the books3 dataset. At inference time, thanks to ALiBi, MPT-7B-StoryWriter-65k+ can extrapolate even beyond 65k tokens.'
    [mpt-7b-storywriter.network]
        type = 'gRPC'
        url = 'localhost:50051'

Start service

Just start your container with:

docker run -it --rm -p 50051:50051 --gpus all mpt-7b-storywriter:0.1

And start your SimpleAI instance, for instance with:

simple_ai serve [--host 127.0.0.1] [--port 8080]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MPT-7B-Storywriter-65kplus

MPT-7B-Storywriter-65kplus

README.md

MosaicML's MPT-7B-StoryWriter-65k+ (4-bit quantization)

Description

Setup

Start service

Files

MPT-7B-Storywriter-65kplus

Directory actions

More options

Directory actions

More options

Latest commit

History

MPT-7B-Storywriter-65kplus

Folders and files

parent directory

README.md

MosaicML's MPT-7B-StoryWriter-65k+ (4-bit quantization)

Description

Setup

Start service