Logistics Fleet Data Engine on dbt 🚛

End-to-end data transformation pipeline built with dbt, BigQuery, and Python. This project simulates a real-world logistics scenario, processing 50,000+ trip records to analyze fuel efficiency across a commercial fleet.

🏗️ Data Architecture & Modeling

The project is structured following the Medallion Architecture to ensure data traceability and quality:

1. BRONZE Layer (Staging)

stg_trips: Technical cleansing, date normalization, and sensor error filtering.
stg_vehicles: Standardization of truck fleet metadata.
stg_drivers: Processing of driver master records.

2. SILVER Layer (Intermediate)

silver_fleet_performance: Integration table joining telemetry (trips) with dimensions (drivers and vehicles). Includes fuel efficiency business logic and outlier handling.

3. GOLD Layer (Marts)

gold_fleet_stats: Final reporting table for business stakeholders. Contains aggregated metrics and performance rankings by model and driver.

👷🏻‍♂️ Transformation Summary

Layer	Input	Output	Key Operations
Bronze	Raw Data	`stg_`	`SAFE.PARSE_DATE`, casting, and initial validation.
Silver	Staging	`silver_`	Massive `LEFT JOIN` and `L/100km` calculation.
Gold	Silver	`gold_`	`GROUP BY` and performance ranking aggregation.

🧪 Data Quality & Testing

Robustness is guaranteed through dbt tests:

Generic Tests: not_null and unique on primary keys.
Business Tests: dbt_utils.accepted_range to ensure fuel consumption and distances fall within realistic physical bounds (e.g., 0 to 200 L/100km).

🛠️ Tech Stack

Data Transformation: dbt (Data Build Tool)
Warehouse: Google BigQuery
Environment: Conda
Data Generation: Python (Pandas/Numpy)
Visualization: Looker Studio

📦 Dependencies & Packages

This project utilizes the following dbt packages to extend functionality:

dbt-utils: Used for advanced data quality testing (accepted_range) and cross-database macros.

🚀 How to Run

Clone the repo.
Setup your profiles.yml for BigQuery.
Install dependencies: dbt deps.
Run the pipeline: dbt run.
Execute tests: dbt test.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
analyses		analyses
macros		macros
models		models
seeds		seeds
snapshots		snapshots
tests		tests
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
dbt_project.yml		dbt_project.yml
generate_data.py		generate_data.py
package-lock.yml		package-lock.yml
packages.yml		packages.yml
raw_drivers.csv		raw_drivers.csv
raw_trips.csv		raw_trips.csv
raw_vehicles.csv		raw_vehicles.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Logistics Fleet Data Engine on dbt 🚛

🏗️ Data Architecture & Modeling

👷🏻‍♂️ Transformation Summary

🧪 Data Quality & Testing

🛠️ Tech Stack

📦 Dependencies & Packages

🚀 How to Run

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Logistics Fleet Data Engine on dbt 🚛

🏗️ Data Architecture & Modeling

👷🏻‍♂️ Transformation Summary

🧪 Data Quality & Testing

🛠️ Tech Stack

📦 Dependencies & Packages

🚀 How to Run

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages