Skip to content

Barkha/apache-pinot-workshops

Repository files navigation

Apache Pinot Workshops

Welcome to the Apache Pinot Workshops repo on Github. This repo contains content designed to get you started with Apache Pinot in a series of workshops that focus on teaching you one concept at a time. The workshops tend to be incremental, so we recommend you start in the begining. If you are already familiar with the concepts, you may skip some workshops.

If you are an organization interested in hosting these workshops, feel free to contact me Here.

A Bit About Apache Pinot

Apache Pinot is a real-time distributed OLAP datastore, purpose-built for low-latency high throughput analytics, perfect for user-facing analytical workloads.

Some examples of companies using Pinot are:

  • Stripe uses Pinot to query more than 1 trillion rows, with over 1 petabyte of data at under 1 second.
  • LinkedIn consumes 1 million events per second while serving 200K queries per second at under 100 millisecond latency.
  • Uber Eats supports under 100 ms latency for their 30K+ queries per second for their over 200 terabyte data.

Why learn Pinot?

If you are looking to solve problems related to:

  • Large datasets
  • Low-latecy queries
  • High query currency
  • Combined batch and real-time consumption
  • Making streaming data easily queryable

The Workshops

  1. Getting Started with Pinot - Batch
  2. Getting Started with Pinot - Streaming with Kafka
  3. Getting Started with ThirdEye
  4. Visualization: SuperSet with Pinot
  5. Log Analytics with Pinot.

Upcoming Workshops

  • Working with Functions for Transformation
  • Working with Minion for Running Jobs
  • Working with Indexes
  • Connecting to Front End with Streamlit

Requests

Would you like to see some hacks that you don't see here? We take requests! Contact me Here.

About

Apache Pinot Workshops

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •