llmcompressor.pytorch.model_load.helpers
fallback_to_cpu(device)
Takes in a device string and forces it to cpu if cuda is not available
Parameters:
Name | Type | Description | Default |
---|---|---|---|
device | str | device id to check | required |
Returns:
Type | Description |
---|---|
str | device modified for CUDA status |
Source code in src/llmcompressor/pytorch/model_load/helpers.py
get_completed_stages(checkpoint_dir)
Given a checkpoint directory for a staged run, get the list of stages that have completed in a prior run if the checkpoint_dir is a string
Parameters:
Name | Type | Description | Default |
---|---|---|---|
checkpoint_dir | Any | path to staged checkpoint | required |
Returns:
Type | Description |
---|---|
List[str] | list of completed stage names |
Source code in src/llmcompressor/pytorch/model_load/helpers.py
get_session_model()
Returns:
Type | Description |
---|---|
Optional[Module] | pytorch module stored by the active CompressionSession, or None if no session is active |
Source code in src/llmcompressor/pytorch/model_load/helpers.py
load_safetensors_state_dict(file_path)
Load a safetensors file from disk
Parameters:
Name | Type | Description | Default |
---|---|---|---|
file_path | str | path to the safetensors file | required |
Returns:
Type | Description |
---|---|
Dict[str, Tensor] | dictionary of safetensors data |
Source code in src/llmcompressor/pytorch/model_load/helpers.py
parse_dtype(dtype_arg)
Parameters:
Name | Type | Description | Default |
---|---|---|---|
dtype_arg | Union[str, dtype] | dtype or string to parse | required |
Returns:
Type | Description |
---|---|
dtype | torch.dtype parsed from input string |
Source code in src/llmcompressor/pytorch/model_load/helpers.py
save_checkpoint(save_path, model, processor=None, save_safetensors=True, save_compressed=True, skip_sparsity_compression_stats=False)
Save a model, processor, and recipe
Parameters:
Name | Type | Description | Default |
---|---|---|---|
save_path | str | Path used to save model and processor | required |
model | PreTrainedModel | model to save | required |
processor | Optional[Processor] | processor to save | None |
save_safetensors | bool | save model checkpoint using safetensors file type | True |
save_compressed | bool | save model checkpoint using compressed-tensors format | True |
Source code in src/llmcompressor/pytorch/model_load/helpers.py
save_completed_stages(checkpoint_dir, completed_stages)
Save a list of completed stages to a checkpoint directory
Parameters:
Name | Type | Description | Default |
---|---|---|---|
checkpoint_dir | str | model checkpoint directory to save stages to | required |
completed_stages | List[str] | list of stage names that have been run | required |