Deploying Your PyTorch Model: Saving and Loading Techniques

Deploying a machine learning model such as a PyTorch model into a production environment involves several critical steps. A first crucial step is to ensure that the model can be saved efficiently and loaded reliably. In this article, we will go over the various techniques for saving and loading PyTorch models and provide code examples to illustrate these techniques.

Saving and Loading Model State
1. Saving a Model
2. Loading a Model
Saving and Loading the Entire Model
1. Save the Entire Model
2. Load the Entire Model
Script and Trace for Model Export
1. Using TorchScript
  1. 1. Trace Method
  2. 2. Script Method
Conclusion

Saving and Loading Model State

The most straightforward way to save and load a PyTorch model is by saving and loading the model's state dictionary. A state dictionary is an essential data structure in PyTorch that maps each layer to its corresponding parameters such as weights and biases.

Saving a Model

To save a model's state_dict, you can use the following code:

import torch

torch.save(model.state_dict(), 'model.pth')

In this code snippet, torch.save() is used to save the state_dict to a file named model.pth. The filename can be anything you prefer.

Loading a Model

When it comes to loading the saved model, first ensure to reconstruct the model architecture, then load its state_dict as follows:

import torch
from my_model import MyModel  # Assume MyModel is the model class

model = MyModel()
model.load_state_dict(torch.load('model.pth'))
model.eval()

In this case, Torch will load the state_dict into the model by calling load_state_dict(). The model.eval() call modifies the model for evaluation, turning off specific settings like dropout, which are useful during inference.

Saving and Loading the Entire Model

Besides saving the model's state_dict, you can save the entire model using torch.save(), including the model's architecture. This approach might be convenient since loading the model does not require redefining the model class.

Save the Entire Model

To save the model entirely, use the following code:

import torch

torch.save(model, 'entire_model.pth')

Load the Entire Model

To load the whole model back, use:

import torch

model = torch.load('entire_model.pth')
model.eval()

This will load the entire model, including both the architecture and the state_dict, directly.

Script and Trace for Model Export

For even more robust model deployment, PyTorch provides TorchScript, which allows you to serialize your models. TorchScript is ideal for optimization and execution for environments outside of Python.

Using TorchScript

There are two primary ways to create TorchScript models: Tracing and Scripting.

1. Trace Method

Tracing involves deriving a TorchScript model from following a default execution path through the model:

import torch

# Assuming 'example_tensor' is a sample tensor for tracing
traced_script_module = torch.jit.trace(model, example_tensor)
traced_script_module.save("traced_model.zip")

Here, torch.jit.trace() is used to produce a TorchScript module by tracing a single run of the model.

2. Script Method

Scripting directly compiles your model to TorchScript by analyzing the whole model logic, which may be advantageous when model logic doesn’t suit tracing:

import torch

scripted_model = torch.jit.script(model)
scripted_model.save("scripted_model.zip")

Here, the model’s functionality is analyzed and compiled using torch.jit.script().

Conclusion

Knowing different methods to handle saving and loading PyTorch models effectively allows you to choose the most suitable method for your use case. Whether you are saving state dictionaries or the entire models, or even getting the benefits of TorchScript for production-level deployment, PyTorch provides a flexible environment to adapt to your needs.

Next Article: How to Write Device-Agnostic Code in PyTorch

Previous Article: Persistence in PyTorch: Save Your Model Easily

Series: The First Steps with PyTorch

PyTorch