description |
---|
This quick start guide will show you how to run a Pinot cluster using Docker. |
{% hint style="success" %} This is a quickstart guide that will show you how to quickly start an example recipe in a standalone instance and is meant for learning. To run Pinot in cluster mode, please take a look at Manual cluster setup. {% endhint %}
{% hint style="info" %} Prerequisites
Install Docker
You can also try Kubernetes quick start if you already have a local minikube cluster installed or Docker Kubernetes setup.
If running locally, please ensure your docker cluster has enough resources, below is a sample config. {% endhint %}
We'll be using our docker image apachepinot/pinot:latest
to run this quick start, which does the following:
- Sets up the Pinot cluster
- Creates a sample table and loads sample data
The following quick-start scripts are available
- Batch example
- Streaming example
- Hybrid example
Before running the scripts, create an isolated bridge network pinot-demo
in docker. This will allow all docker containers to easily communicate with each other. You can create the network using the following command -
docker network create -d bridge pinot-demo
In this example we demonstrate how to do batch processing with Pinot.
- Starts Pinot deployment by starting
- Apache Zookeeper
- Pinot Controller
- Pinot Broker
- Pinot Server
- Creates a demo table
baseballStats
- Launches a standalone data ingestion job
- Builds one Pinot segment for a given CSV data file for table
baseballStats
- Pushes the built segment to the Pinot controller
- Builds one Pinot segment for a given CSV data file for table
- Issues sample queries to Pinot
docker run \
--network=pinot-demo \
--name pinot-quickstart \
-p 9000:9000 \
-d apachepinot/pinot:latest QuickStart \
-type batch
Once the Docker container is running, you can view the logs by running the following command.
docker logs pinot-quickstart -f
That's it! We've spun up a Pinot cluster.
{% hint style="info" %} It may take a while for all the Pinot components to start and for the sample data to be loaded.
Use the below command to check the status in the container logs.
docker logs pinot-quickstart -f
{% endhint %}
Your cluster is ready once you see the cluster setup completion messages and sample queries, as demonstrated below.
You can head over to Exploring Pinot to check out the data in the baseballStats
table.
In this example we demonstrate how to do stream processing with Pinot.
- Starts Pinot deployment by starting
- Apache Kafka
- Apache Zookeeper
- Pinot Controller
- Pinot Broker
- Pinot Server
- Creates a demo table
meetupRsvp
- Launches a
meetup
****stream - Publishes data to a Kafka topic
meetupRSVPEvents
to be subscribed to by Pinot - Issues sample queries to Pinot
# stop previous container, if any, or use different network
docker run \
--network=pinot-demo \
--name pinot-quickstart \
-p 9000:9000 \
-d apachepinot/pinot:latest QuickStart \
-type stream
Once the cluster is up, you can head over to Exploring Pinot to check out the data in the meetupRSVPEvents
table.
In this example we demonstrate how to do hybrid stream and batch processing with Pinot.
- Starts Pinot deployment by starting
- Apache Kafka
- Apache Zookeeper
- Pinot Controller
- Pinot Broker
- Pinot Server
- Creates a demo table
airlineStats
- Launches a standalone data ingestion job
- Builds Pinot segments under a given directory of Avro files for table
airlineStats
- Pushes built segments to Pinot controller
- Builds Pinot segments under a given directory of Avro files for table
- Launches a ****stream of flights stats
- Publishes data to a Kafka topic
airlineStatsEvents
to be subscribed to by Pinot - Issues sample queries to Pinot
# stop previous container, if any, or use different network
docker run \
--network=pinot-demo \
--name pinot-quickstart \
-p 9000:9000 \
-d apachepinot/pinot:latest QuickStart \
-type hybrid
Once the cluster is up, you can head over to Exploring Pinot to check out the data in the airlineStats
table.