PostgreSQL: Creating a sequence using a specific data type

Introduction
Understanding Sequences in PostgreSQL
Basic Sequence Creation
Specifying a Data Type
1. Example: Creating an INTEGER Based Sequence
2. Customizing Data Types in Hierarchical IDs
Viewing and Changing Sequence Properties
Sequence and Data Type Use Cases
Advanced Usage: Combining Sequences and Default Values
1. Caveats and Considerations
2. Performance Impact and Cache
Conclusion

Introduction

Sequences are essential database objects in PostgreSQL used to generate a series of unique numbers, often serving as surrogate primary keys. This tutorial walks you through creating sequences with explicit data types to tailor their behavior for specific needs.

Understanding Sequences in PostgreSQL

In PostgreSQL, a sequence is a special kind of database object that generates a serially incrementing number sequence. It’s often used to create unique identifiers for rows in a table, commonly known as a surrogate key. When creating a sequence, you can specify parameters such as the increment value, minimum and maximum values, and whether the sequence should cycle.

Before creating a sequence with a specific data type, it is important to understand that PostgreSQL implicitly uses the BIGINT data type for sequences. However, if you require a different data type (e.g., INTEGER), this can be done with some consideration of the capacity and range of the desired type.

Basic Sequence Creation

CREATE SEQUENCE my_sequence_name;

This simple command creates a sequence with default characteristics. The sequence starts at 1, increments by 1, and is based on the BIGINT data type.

Specifying a Data Type

To create a sequence with a specific data type, use the following structure. Note that while you can influence the range by choosing a data type like INTEGER instead of BIGINT, PostgreSQL does not inherently create sequences with explicit data types. Instead, you will manage the range and behavior understanding the chosen data type’s limitations.

Example: Creating an INTEGER Based Sequence

CREATE SEQUENCE my_smallint_sequence
    AS SMALLINT
    START WITH 1
    INCREMENT BY 1
    MINVALUE 1
    MAXVALUE 32767;

This sequence will generate numbers ranging from 1 to 32767, fitting within the range of the SMALLINT type. Similarly, you could use INTEGER for a wider but still limited-range sequence.

Customizing Data Types in Hierarchical IDs

In advanced scenarios, you might need hierarchical or multifaceted sequences. Here’s where custom-sized sequences can shine. For instance, you could use an SMALLINT-based sequence for one part of an identifier, and an INTEGER-based for another. Despite not being fully supported in pure sequence syntax, this can still be managed by application logic or stored procedures.

Viewing and Changing Sequence Properties

Inspecting the properties and altering a created sequence is vital for database management. By using ALTER SEQUENCE, you can change aspects of your sequence post-creation.

ALTER SEQUENCE my_smallint_sequence
    INCREMENT BY 2;

This command alters the attributes of the sequence and have it increment by two instead of one. Be mindful that alterations that conflict with your sequences’ data type constraints (like a max value of 32767 for SMALLINT) can cause errors.

Sequence and Data Type Use Cases

Sequences can play various roles depending on the application. For example, sequences act as auto-incrementing primary keys, tracking order fulfilment, or paginating large datasets. Choosing the correct data type boils down to understanding your data’s scope and how it will grow.

Advanced Usage: Combining Sequences and Default Values

You can also integrate sequences within table column defaults, ensuring that insert operations without explicit ID specifications use the next sequence value automatically.

CREATE TABLE my_table (
    id INTEGER DEFAULT nextval('my_smallint_sequence'),
    content VARCHAR(255)
);

This simplifies record creation, as the database will handle the unique ID generation without additional input.

Caveats and Considerations

While sequences are powerful tools, mismanagement can result in performance bottlenecks or exhausted ranges. It’s crucial to monitor usage and adjust sequence and data type configurations as necessary.

Performance Impact and Cache

Remember that sequence caching can positively affect performance by preallocating a number of sequence values. However, within a failover or crash scenario, these cached values may be lost, resulting in potential gaps in the sequence.

Conclusion

PostgreSQL sequences are robust elements for unique number generation, and while they default to BIGINT, understanding how to control their behavior through the effective execution of data types and range parameters is key to building scalable, efficient databases. Intelligent sequence management ensures they serve your applications well over their lifetime.

Next Article: PostgreSQL: Descending sequence and CYCLE sequence

Previous Article: PostgreSQL: Sequence with START WITH, INCREMENT, MINVALUE and MAXVALUE

Series: PostgreSQL Tutorials: From Basic to Advanced

PostgreSQL