This project focuses on creating an analytics dashboard for an e-commerce company that sells unique all-occasion gifts. The project leverages data from an online retail dataset to perform data extraction, analysis, and visualization. The goal is to gain insights into various aspects of the business, such as sales, customers, products, and orders.
- Dataset
- Project Structure
- Project Outline
- Setup
- Data Extraction
- Data Cleaning
- Data Analysis
- Data Visualization
- Dashboard Creation
- Usage
- Contributing
- License
- Name: Online Retail Data Set
- Source: UCI Machine Learning Repository
- Description: Contains approximately 541,909 records of transactions made by customers of an online retail company. Each record includes information about the customer, product, quantity, price, and other transaction-related details.
The project is organized into the following sections:
- Data Extraction
- Data Cleaning
- Data Analysis
- Data Visualization
- Dashboard Creation
-
Data Extraction:
- Automated data flow with Apache NiFi to load data into MYSQ database.
-
Data Cleaning:
- Cleaned and preprocessed data in Python.
- Addressed missing values and outliers.
-
Data Analysis:
- Used SQL queries to perform data analysis.
- Calculated sales trends, customer segments, and repeat purchase rates.
- created views of the analysis
-
Data Visualization:
- Utilized Tableau for creating interactive visualizations of the views created in the analysis.
- Developed charts and graphs to represent key insights.
-
Dashboard Creation:
- Assembled interactive dashboards in Tableau for data exploration.
To run this project, you'll need:
- Apache NiFi for data extraction.
- A relational database (e.g., MySQL) to store and access the data.
- Python for data cleaning and preprocessing.
- VSCode for SQL queries.
- Tableau for data visualization and dashboard creation.
-
Clone this repository.
-
Follow the steps outlined in the Project Outline section to set up your environment and execute the project.
Contributions are welcome! Feel free to open issues or submit pull requests to enhance the project.
This project is licensed under the MIT License.