Skip to content
This repository has been archived by the owner on Jun 21, 2024. It is now read-only.

Will instruction fine tuned models be made available as well #2

Open
allthingssecurity opened this issue May 9, 2023 · 8 comments
Open

Comments

@allthingssecurity
Copy link

Will instruction fine tuned models be made available as well for this

@conceptofmind
Copy link
Owner

@allthingssecurity All instruction-finetuned models on FLAN will be made publicly available as well.

@allthingssecurity
Copy link
Author

Thanks for such a quick reply. When can we expect the same?

@conceptofmind
Copy link
Owner

conceptofmind commented May 9, 2023

Thanks for such a quick reply. When can we expect the same?

The 2.1b model is training now. 2b won't be done for days. So after that finishes I will start training all of the flan-PaLM models.

@allthingssecurity
Copy link
Author

Please let me know if I can help. I can dedicate some compute for it. Have already done some finetuning for Flan models in past

@Njasa2k
Copy link

Njasa2k commented May 9, 2023

Will there be bigger models than 2B?

@conceptofmind
Copy link
Owner

Will there be bigger models than 2B?

That is largely dependent on whether CarperAI and StabilityAI want to pursue larger training runs for PaLM. I can say that there is a plan to train a much larger Sparrow model similar to PaLM with RLHF on more tokens.

You can join the CarperAI discord to follow the projects: https://discord.gg/canadagoose

@varunnathan
Copy link

Hello Team,

First of all, great work and this is super helpful for the open source community.
I wanted to check if there are any updates on the instruction fine-tuned models?

Thanks.

@conceptofmind
Copy link
Owner

I do not have access to any gpus. So no models will be trained.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants