diff --git a/README.md b/README.md index 0512dee481..16dfe6991d 100644 --- a/README.md +++ b/README.md @@ -356,6 +356,13 @@ See [examples](examples) for quick start. It is recommended to duplicate and mod - typescript type: ... # unimplemented custom format + # fastchat conversation + # See 'conversation' options: https://github.com/lm-sys/FastChat/blob/main/fastchat/conversation.py + datasets: + - path: ... + type: sharegpt + conversation: chatml + # local datasets: - path: data.jsonl # or json @@ -394,7 +401,7 @@ See [examples](examples) for quick start. It is recommended to duplicate and mod
-All yaml options +All yaml options (click me) ```yaml # This is the huggingface model that contains *.pt, *.safetensors, or *.bin files @@ -461,7 +468,9 @@ datasets: data_files: # Optional[str] path to source data files shards: # Optional[int] number of shards to split data into name: # Optional[str] name of dataset configuration to load - conversation: # Optional[str] fastchat conversation type, only used with type: sharegpt + + # Optional[str] fastchat conversation type, only used with type: sharegpt + conversation: # Options (see Conversation 'name'): https://github.com/lm-sys/FastChat/blob/main/fastchat/conversation.py # Custom user prompt - path: repo