Combining Data Preparation, Model Training, and Prediction in PyTorch

When working with machine learning in PyTorch, one often circles through key tasks: data preparation, model training, and making predictions. Understanding and combining these steps into a streamlined workflow can greatly enhance your development process, allowing for efficient experimentation and optimization. Let’s explore how each component fits into creating a robust PyTorch model and how you can implement it in code.

Data Preparation
Model Definition and Training
Making Predictions

Data Preparation

Data preparation is the cornerstone of any machine learning project. In PyTorch, this typically involves using the torch.utils.data.DataLoader and dataset classes. First, define a custom dataset class by inheriting from torch.utils.data.Dataset, and then load it using DataLoader. This systematically organizes data loading and manages batching.

import torch
from torch.utils.data import Dataset, DataLoader

class CustomDataset(Dataset):
    def __init__(self, data, targets):
        self.data = data
        self.targets = targets
    
    def __len__(self):
        return len(self.data)
    
    def __getitem__(self, idx):
        return self.data[idx], self.targets[idx]

# Example data
X = torch.tensor([[1.0], [2.0], [3.0], [4.0]])
y = torch.tensor([[2.0], [4.0], [6.0], [8.0]])

# Create dataset and data loader
dataset = CustomDataset(X, y)
dataloader = DataLoader(dataset, batch_size=2, shuffle=True)

Model Definition and Training

Defining a model in PyTorch involves creating a class that inherits from torch.nn.Module. You need to define an initialization method and the forward pass. Let's focus on a simple linear regression model:

import torch.nn as nn

class LinearRegressionModel(nn.Module):
    def __init__(self, input_dim, output_dim):
        super(LinearRegressionModel, self).__init__()
        self.linear = nn.Linear(input_dim, output_dim)
    
    def forward(self, x):
        return self.linear(x)

# Model instantiation
input_dim = 1
output_dim = 1
model = LinearRegressionModel(input_dim, output_dim)

Once your model is defined, choose an optimizer and a loss function to train it. A common combination for regression problems includes the Mean Squared Error loss and the Stochastic Gradient Descent optimizer.

import torch.optim as optim

criterion = nn.MSELoss()
optimizer = optim.SGD(model.parameters(), lr=0.01)

# Training loop 
num_epochs = 100
for epoch in range(num_epochs):
    for inputs, targets in dataloader:
        # Zero gradients
        optimizer.zero_grad()
        
        # Forward pass
        outputs = model(inputs)
        loss = criterion(outputs, targets)
        
        # Backward pass
        loss.backward()
        optimizer.step()
    
    # logging
    if epoch % 10 == 0:
        print(f'Epoch [{epoch}/{num_epochs}], Loss: {loss.item():.4f}')

Making Predictions

After training, the model is ready to make predictions. To do so, prepare the model to inference mode and pass your input data. Don't forget to detach the output from any gradient computation using torch.no_grad().

# Set model to eval mode
model.eval()

with torch.no_grad():
    new_data = torch.tensor([[5.0]])
    predicted = model(new_data)
    print(f'Predicted value: {predicted.item():.4f}')

In summary, PyTorch provides a structured framework to manage the data lifecycle of custom datasets, a flexible module system to build a variety of models, and straightforward mechanisms to optimize and infer those models. Mastery of these steps positions you well to adapt and extend PyTorch functionalities, fitting any project demands efficiently.

Next Article: An End-to-End Guide to PyTorch Linear Regression

Previous Article: Building a Complete Model Pipeline in PyTorch: Step-by-Step

Series: The First Steps with PyTorch

PyTorch