Extract, Transform and Load a sales dataset for business intelligence visualization
The following python libraries are needed to run the project
pandasscipysqlalchemypython-dotenv
Install them by using pip install pandas scipy sqlalchemy.
- Clone the repo
git clone https://github.com/ashenafiDL/sales-ETL.git
-
Add git large files system support to your account (in order to support version control of files that are more than 100MB)
- install
git-lfs(sudo apt install git-lfson ubuntu) - enable git lfs by running
git lfs install
- install
-
Create
.envfile and set it up as in.env.examplefile -
Open
Load and Transformjupyter notebook and run each cell one by one. -
Visualizations are available at
visualization-pdfandvisualization.pbitfiles