Skip to content

Latest commit

Β 

History

History
67 lines (49 loc) Β· 3.46 KB

File metadata and controls

67 lines (49 loc) Β· 3.46 KB

Data Developer and Engineer 🌟

For knowledge document of this organization, you can follow this πŸ“– Data Developer & Engineer document.

Note

I will delegate the tools section of this docs to each project in Data Dev & Eng Lab

πŸ“Œ Core Components

flowchart LR
   1([πŸ“Œ ddeutil]) --> 2([πŸš… ddeutil-io]) --> 3([πŸƒ ddeutil-workflow]) ---> 4

   subgraph observe
      4([πŸ“‘ ddeutil-observe])
      6([πŸ”­ ddeutil-observe<br>streamlit])
   end

   3 --> 6
   0([βœ‚οΈ fmtutil]) -.-> 2
Loading

This organization has the propose to make lightweight data orchestration framework for small - middle data platform project (πŸƒAround 10K workflows).

Firstly, I will implement base projects, πŸ“Œ Core (utility functions) and πŸš… IO (Input/Output transport utility objects) for the first dependency packages because it has a lot of base code to make main package and I do not want to develop this code on the main package, for example, it do not good if I want fix bug on the merge key function that no relate with the workflow package

🎯 The main package of this organize orchestration framework has 2 layers and I split it with 2 projects for optional installation requirement (you can only use just one of these layers without raise error).

🍸 On-Going Projects

flowchart LR
   1([πŸƒ ddeutil-workflow]) ---> 2([⚑ deflow])
   3([πŸš… ddeutil-io]) ---> 4([βš™οΈ jett])
   5([airflow]) ---> 6([🏭 dagtool])
Loading

❌ Deprecated Projects

This organize has some mini-projects that develop for specific usecase:

  • data-orchestra - Full-Stack Data Orchestration from Yaml template with Flask & HTMX
  • load-routing - Routing Application Service deploy to On-Premise server with FastAPI
  • πŸ—οΈ Extensions - An additional practices to use any 3rd API connect data source.

Warning

I have some 3rd-party projects (deprecated!!!), πŸ—οΈ Extensions, for keeping an additional practices to use any 3rd API connect data source, like polars, duckdb, etc. It is dynamic data processing & transformation functions and objects from external vendor packages. It can plug-in to the Workflow package on the hook stage.

Warning

The above projects have a lot of bugs and need times to fix and refactor the code. So, you should not use these projects.