You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -347,7 +347,9 @@ class llama_model_kv_override(Structure):
347
347
# int32_t main_gpu; // the GPU that is used for scratch and small tensors
348
348
# const float * tensor_split; // how to split layers across multiple GPUs (size: LLAMA_MAX_DEVICES)
349
349
350
-
# // called with a progress value between 0 and 1, pass NULL to disable
350
+
# // Called with a progress value between 0.0 and 1.0. Pass NULL to disable.
351
+
# // If the provided progress_callback returns true, model loading continues.
352
+
# // If it returns false, model loading is immediately aborted.
351
353
# llama_progress_callback progress_callback;
352
354
# // context pointer passed to the progress callback
353
355
# void * progress_callback_user_data;
@@ -367,7 +369,7 @@ class llama_model_params(Structure):
367
369
n_gpu_layers (int): number of layers to store in VRAM
368
370
main_gpu (int): the GPU that is used for scratch and small tensors
369
371
tensor_split (ctypes.Array[ctypes.c_float]): how to split layers across multiple GPUs (size: LLAMA_MAX_DEVICES)
370
-
progress_callback (llama_progress_callback): called with a progress value between 0 and 1, pass NULL to disable
372
+
progress_callback (llama_progress_callback): called with a progress value between 0.0 and 1.0. Pass NULL to disable. If the provided progress_callback returns true, model loading continues. If it returns false, model loading is immediately aborted.
371
373
progress_callback_user_data (ctypes.c_void_p): context pointer passed to the progress callback
372
374
kv_overrides (ctypes.Array[llama_model_kv_override]): override key-value pairs of the model meta data
373
375
vocab_only (bool): only load the vocabulary, no weights
0 commit comments