how to specify the number of cpu threads? #3215
-
Beta Was this translation helpful? Give feedback.
Answered by
imtuyethan
Sep 2, 2024
Replies: 1 comment
-
"n_gl" or "n_gpu_layers" is a setting that controls how many layers of the AI model are loaded into the GPU memory. It's basically a way to balance between speed and memory usage:
Suggestions
|
Beta Was this translation helpful? Give feedback.
0 replies
Answer selected by
imtuyethan
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
"n_gl" or "n_gpu_layers" is a setting that controls how many layers of the AI model are loaded into the GPU memory. It's basically a way to balance between speed and memory usage:
Suggestions