Data pipeline to move data from Totesys OLTP database to OLAP database for BI.
Move data from a relational OLTP database into a OLAP database stored in star schema every 20 minutes. The data pipeline consists of three ETL stages.
Take the data from the OLTP database and store the data in JSON format in an S3 bucket
Transform the JSON data into a star schema and store in parquet format
Loads the data into the end OLAP data warehouse
The final schema of the dataware is a star schema made up of 3 facts tables with 8 dimension tables
- Lambda
- Step-function
- CloudWatch
- EventBridge
- S3
- RDS
- Quicksight
- SNS
- Build cloud infrastructure
- Code for Lambda functions
- Upload data into the final warehouse
- Add business intelligence
- Refactor
- Optimise
- Heiman
- Leonette
- Michael
- Mostyn
- Nicholas