Hybrid tables in StarTree Cloud combine the benefits of both real-time and offline ingestion in a single logical table. This powerful configuration allows you to query across both streaming and batch data seamlessly, without having to specify which data source you’re accessing.
A hybrid table consists of two physical tables that share the same name:
A real-time table ingesting data from streaming sources (e.g., Kafka)
An offline table containing historical data loaded from batch sources
The query broker intelligently routes queries to the appropriate segments based on time boundaries, providing a unified view of your data. When an offline segment is pushed to cover a time period that overlaps with real-time data, the broker automatically prioritizes the offline segments for that period.
This task runs periodically to create offline segments from real-time data, simplifying the maintenance of hybrid tables.
Hybrid tables configuration requires using Controller APIs as this setup is not yet available through the Data Portal interface. For detailed configuration instructions and examples, refer here