Kafka: How to Customize Start Offset for a Consumer
Updated: Jan 30, 2024
Introduction Apache Kafka is a powerful streaming platform that enables you to process and analyze data in real-time. It’s built on the concept of producers writing data to topics and consumers reading from those topics.......
How to delete a Kafka topic (with examples)
Updated: Jan 30, 2024
Understanding Kafka Topic Deletion Apache Kafka is a popular distributed streaming platform that provides high-throughput and low-latency event processing. Kafka stores streams of records in categories called topics. As you work with......
Understanding Kafka max.poll.records (with examples)
Updated: Jan 30, 2024
Overview Apache Kafka is a highly popular distributed streaming platform used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications. Kafka consumers read......
Kafka: 3 ways to delete committed offsets for a consumer group
Updated: Jan 30, 2024
Overview Managing consumer groups and offsets is a crucial aspect of using Apache Kafka effectively. At times you might need to reset the offsets for a consumer group to a previous state, perhaps for reprocessing messages or recovering......
Kafka: How to change the number of partitions in a topic
Updated: Jan 30, 2024
Introduction Apache Kafka is a widely used event streaming platform that has become the backbone of many real-time analytics and monitoring systems. One of the key configurations of a Kafka topic is its partitions, which dictate the......
Kafka: How to set retention time for messages in a topic
Updated: Jan 30, 2024
Introduction Apache Kafka is a distributed streaming platform designed for building real-time data pipelines and streaming applications. One of its core features is the ability to store records (messages) in topics for a specified......
Understanding max.poll.interval.ms in Kafka (with examples)
Updated: Jan 30, 2024
Introduction Apache Kafka is a distributed streaming platform that allows for high-throughput, fault-tolerant, publish-subscribe messaging. Kafka, at its core, is designed to provide a durable, scalable platform for handling streams of......
Understanding session.timeout.ms in Kafka (through examples)
Updated: Jan 30, 2024
Introduction Apache Kafka has become a pivotal piece in modern data-driven applications architecture, enabling high-throughput, fault-tolerant messaging and stream processing. At the core of Kafka’s reliability is its distributed......
Kafka: How to import/export CSV/TXT data
Updated: Jan 30, 2024
Overview Working with Apache Kafka, one of the most popular distributed streaming platforms, often requires the ability to import and export data in common file formats like CSV and TXT. This tutorial will walk you through the basics......
Kafka: How to read events from a topic (with examples)
Updated: Jan 30, 2024
Introduction Apache Kafka is a popular open-source stream-processing software platform developed by the Apache Software Foundation, written in Scala and Java. It is designed to handle data feeds with high throughput and low latency.......
Kafka: How to add events to a topic with a timestamp
Updated: Jan 30, 2024
Introduction Apache Kafka has become the backbone of many modern data-driven applications because of its capability to handle high-throughput, resilient distributed streaming. Timestamps play a crucial role in Kafka messages, serving......
Kafka: Limit the memory used for buffering across all threads
Updated: Jan 30, 2024
Introduction Apache Kafka is a distributed streaming platform that is widely used for building real-time data pipelines and streaming applications. One of the key aspects of operating a Kafka cluster is managing the memory used for......