How to Use NumPy for Advanced Image Processing Techniques

Updated: January 23, 2024 By: Guest Contributor Post a comment

Table Of Contents

3 Use Case #2: Image Filtering with Convolution

4 Use Case #3: Image Rescaling and Interpolation

Introduction

In the world of image processing, NumPy is a fundamental library in Python that facilitates high-performance operations on large arrays of data, which is crucial when dealing with images. This guide will explore various advanced image processing techniques using NumPy and demonstrate how to implement them.

Use Case #1: Image Color Inversion

Image color inversion is one of the simplest yet dramatic transformations you can apply. It involves transforming all the colors of an image to their respective complementary colors. This is typically done by subtracting each color value from the maximum value, which for an 8-bit image is 255.

Load the image using a library that can read image files, like PIL or OpenCV.

Convert the image to a NumPy array.
Subtract the original values from 255 to invert the colors.
Convert the NumPy array back to an image.

Display or save the processed image.

Code Example:

import numpy as np
from PIL import Image

# Load the image
image = Image.open('photo.jpg')

# Convert the image to a NumPy array
np_image = np.array(image)

# Invert colors
inverted_image = 255 - np_image

# Convert back to an image
inverted_image = Image.fromarray(inverted_image)

# Display the image
inverted_image.show()

This operation is very straightforward and fast since it’s a basic arithmetic operation applied on a matrix. It works well for analyzing highlights and shadows in images but does not hold much importance in nuanced image analysis.

Use Case #2: Image Filtering with Convolution

Image filtering allows you to apply various effects, from blurring and sharpening to edge detection. This can be done by convolving the image with a specific kernel (a small matrix). Convolution alters the value of a pixel by combining it with the values of neighboring pixels weighted by the kernel.

Create a kernel matrix representing the desired filter.
Use the ‘signal’ module from the SciPy library to apply the convolution operation.

Handle the edges of the image. For example, you can ignore them, wrap them around, or pad with zeros.
Convert the resulting matrix back to an image and save or display it.

Code Example:

import numpy as np
from scipy import signal
from PIL import Image

# Example of a simple blur kernel
kernel = np.ones((5, 5)) / 25

# Load the image
image = Image.open('photo.jpg').convert('L')  # Convert image to grayscale

# Convert the image to a NumPy array
np_image = np.array(image)

# Apply convolution
filtered_image = signal.convolve2d(np_image, kernel, boundary='fill', fillvalue=0)

# Convert the numpy array back to an image
filtered_image = Image.fromarray(filtered_image)

# Display the image
filtered_image.show()

Convolution is a powerful operation in image processing but it can be computationally intensive. The process can be optimized using the Fast Fourier Transform (FFT) for larger kernels or utilizing GPU acceleration.

Use Case #3: Image Rescaling and Interpolation

Rescaling images is a common preprocessing step. Whether you’re resizing an image to fit a certain size requirement or you’re zooming in or out, it involves changing the dimensions of the image. This can lead to loss of information or introduction of new data, and how new pixel values are calculated is known as interpolation.

Define the new dimensions for the image.

Use the ‘zoom’ function from the SciPy library for rescaling.
During the rescaling, choose an appropriate interpolation method like nearest, bilinear, or cubic.
Convert the rescaled array back to an image and save or display it.

Code Example:

import numpy as np
from scipy.ndimage import zoom
from PIL import Image

# Load the image
image = Image.open('photo.jpg')

# Convert the image to a NumPy array
np_image = np.array(image)

# Define new width and height
width, height = np_image.shape[1], np_image.shape[0]
new_width, new_height = width * 2, height * 2  # Example: scaling by a factor of 2

# Calculate the scaling factors
zoom_factors = (new_height / height, new_width / width, 1)

# Apply zoom (rescale image)
rescaled_image = zoom(np_image, zoom_factors)

# Convert the numpy array back to an image
rescaled_image = Image.fromarray(rescaled_image.astype('uint8'))

# Display the image
rescaled_image.show()

The choice of interpolation method has a big impact on the quality of the rescaled image. Bilinear and cubic interpolations are generally used as they produce smoother results than the nearest-neighbor interpolation. Performance-wise, upscaling images is usually more computationally expensive than downscaling them due to the higher data volume.

Conclusion

In this guide, we’ve explored a few advanced image processing techniques you can implement using NumPy, including color inversion, filtering with convolution, and rescaling along with interpolation. Each technique has its approach and applications, and understanding how they work gives a strong foundation for further exploration of image processing in Python.

Next Article: How to Optimize NumPy Code for Performance

Previous Article: Is there a way to use async/await with NumPy?

Series: NumPy Intermediate & Advanced Tutorials

NumPy