Documentation Index
Fetch the complete documentation index at: https://docs.startree.ai/llms.txt
Use this file to discover all available pages before exploring further.
Cross AZ data transfer cost
In a StarTree Pinot cluster, Pinot servers utilize low-level Kafka consumers to retrieve data from Kafka brokers. When a Pinot consumer operates in a different Availability Zone than the Kafka broker hosting the required partition, each fetch request generates cross-AZ network traffic. Cross-AZ traffic for Kafka consumers creates several challenges:- Increased costs: Cross-AZ data transfer incurs additional charges
- Higher latency: Network requests across zones introduce additional delay
- Reduced reliability: Cross-zone communication increases potential failure points
- Improved application performance through reduced latency
- Significant cost savings on data transfer fees
- Enhanced system reliability and fault tolerance

Solution Architecture
The optimization strategy centers on implementing AZ-aware Kafka consumers using the Kafka RackAwareReplicaSelector. This approach ensures that Pinot servers preferentially consume from Kafka brokers within the same Availability Zone.
- Step 1: Implement AZ-Aware Instance Assignment (Recommended but optional) Configure the instance assignment strategy to consider Availability Zone placement when distributing workloads across the cluster.
- Step 2: Configure AZ-Aware Table Settings
Implementation details
Make instance assignment AZ-aware
Note: This step is recommended but not necessary. Even if the Pinot servers are not perfectly provisioned in the same zones as Kafka cluster, we can still get partial benefits (best effort).
Example config:

