You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
numa: Enable NUMA support. (NOTE: The initial value of this parameter is used for the remainder of the program as this value is set in llama_backend_init)
821
817
chat_format: String specifying the chat format to use when calling create_chat_completion.
822
818
chat_handler: Optional chat handler to use when calling create_chat_completion.
823
-
cache: Optional if true enables caching.
824
-
cache_type: String can be "ram" or "disk".
825
-
cache_size: Number of bytes to cache, defaults to 2GB
0 commit comments