Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.github/workflows		.github/workflows
llm		llm
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Repository files navigation

LLM(Large Language Modeling)

A platform for training large language model based multi-turns chatbots.

Developed based on FastChat, trl, trlx, Huggingface transformers.

Features

Support Llama, Baichuan, Qwen seriels models.
Characters awared templates.
Long sequence length training with FlashAttention2.
A stacked dataset class to support long chats training.
Ghost Attention (GAtt) in templates.

Installation

pip3 install --no-cache-dir -i https://mirrors.aliyun.com/pypi/simple/ --upgrade -e ".[data]"

Pre-training

Fine-tuning

Process data

python -m llm.data.process_chats --dataset PIPPA

Alignment

About

Codebase and experiments of LLM(Large Language Modeling)

large-language-models rlhf

Readme

Activity

0 stars

1 watching

0 forks

Report repository

Releases

No releases published

Packages

No packages published

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM(Large Language Modeling)

Features

Installation

Pre-training

Fine-tuning

Alignment

About

Releases

Packages

Languages

congchan/llm

Folders and files

Latest commit

History

Repository files navigation

LLM(Large Language Modeling)

Features

Installation

Pre-training

Fine-tuning

Alignment

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages