Skip to content

Commit

Permalink
[doc] Placeholder for TrtLLM tutorial and tuning guide (#1333)
Browse files Browse the repository at this point in the history
  • Loading branch information
sindhuvahinis authored Nov 21, 2023
1 parent 00c94e3 commit ebe8821
Show file tree
Hide file tree
Showing 3 changed files with 7 additions and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@
There are a number of shared configurations for python models running large language models.
They are also available through the [Large Model Inference Containers](https://github.com/aws/deep-learning-containers/blob/master/available_images.md#large-model-inference-containers).

### Common ([doc](https://github.com/deepjavalibrary/djl-serving/blob/521e0edadec35b04ec9e1d51b9e406119efd0235/serving/docs/configurations_large_model_inference_containers.md#common-doc))
### Common ([doc](https://docs.aws.amazon.com/sagemaker/latest/dg/large-model-inference-configuration.html))

| Item | Required | Description | Example value |
|----------------------------------------------------|-----------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------|
Expand Down
3 changes: 3 additions & 0 deletions serving/docs/lmi/tuning_guides/trtllm_tuning_guide.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
# TensorRT LLM Tuning guide

This doc recommends the configurations based on your model and instance type.
3 changes: 3 additions & 0 deletions serving/docs/lmi/tutorials/trtllm_aot_tutorial.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
# TensorRT LLM ahead of time compilation of models

This doc helps you to convert your HuggingFace model to Tensorrt-LLM LMI model format to load and run inference with Tensorrt-LLM.

0 comments on commit ebe8821

Please sign in to comment.