Frequently asked questions - Event Hubs for Apache Kafka

This article provides answers to some of the frequently asked questions on migrating to Event Hubs for Apache Kafka.

Does Azure Event Hubs run on Apache Kafka?

No. Azure Event Hubs is a cloud-native multi-tier broker with support for multiple protocols that is developed and maintained by Microsoft and doesn't use any Apache Kafka code. One of the supported protocols is the Kafka RPC protocol for the Kafka client's consumer and producer APIs. Event Hubs works with many of your existing Kafka applications. For more information, see Event Hubs for Apache Kafka. Because the concepts of Apache Kafka and Azure Event Hubs are very similar (but not identical), we're able to offer the unmatched reliability of Azure Event Hubs to customers with existing Apache Kafka investments.

Event Hubs consumer group vs. Kafka consumer group

What's the difference between an Event Hubs consumer group and a Kafka consumer group on Event Hubs? Kafka consumer groups on Event Hubs are fully distinct from standard Event Hubs consumer groups.

Event Hubs consumer groups

  • They're Managed with create, retrieve, update, and delete (CRUD) operations via portal, SDK, or Azure Resource Manager templates. Event Hubs consumer groups can't be autocreated.
  • They're children entities of an event hub. It means that the same consumer group name can be reused between event hubs in the same namespace because they're separate entities.
  • They aren't used for storing offsets. Orchestrated AMQP consumption is done using external offset storage, for example, Azure Storage.

Kafka consumer groups

  • They're autocreated. Kafka groups can be managed via the Kafka consumer group APIs.
  • They can store offsets in the Event Hubs service.
  • They're used as keys in what is effectively an offset key-value store. For a unique pair of group.id and topic-partition, we store an offset in Azure Storage (3x replication). Event Hubs users don't incur extra storage costs from storing Kafka offsets. Offsets are manipulable via the Kafka consumer group APIs, but the offset storage accounts aren't directly visible or manipulable for Event Hubs users.
  • They span a namespace. Using the same Kafka group name for multiple applications on multiple topics means that all applications and their Kafka clients will be rebalanced whenever only a single application needs rebalancing. Choose your group names wisely.
  • They fully distinct from Event Hubs consumer groups. You don't need to use '$Default', nor do you need to worry about Kafka clients interfering with AMQP workloads.
  • They aren't viewable in the Azure portal. Consumer group info is accessible via Kafka APIs.

Does Azure Event Hubs for Apache Kafka support shared access signature token authentication?

Authenticating by using OAuth 2.0 and shared access signature is supported.

Shared access signature tokens are generated by using an authorization rule and one of its signing keys. It isn't supported when using the Event Hubs for Apache Kafka endpoint.

Does Azure Event Hubs support Apache Kafka idempotent producers and consumers?

Yes. Azure Event Hubs supports Apache Kafka idempotent producers and consumers.