Skip to content

Sk00sha/MarketDataPipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Project not finished(due to not enough free time)

"Full-Stack" data project

This project showcases different approaches in data consumption and data transform/loading

To create this system, we need:

  1. Data generator(we simulating and are not scraping real data)
  2. Dockerize the whole infra(MinIO, Kafka(Brokers,Connect), Postgres)
  3. Kafka logic for data validation rules
  4. Write Kafka Consumer, Producer
  5. Write Flink Streaming job for data transformation
  6. Spark Batch jobs for MinIO data transformations and SparkSQL analytics
  7. (Optional) Write React dashboard for data viz.
  8. Spring Boot backend REST API

Simple architecture diagram

alt text

About

Data project

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages