Skip to content
View somafordata's full-sized avatar
๐ŸŽฏ
Focusing
๐ŸŽฏ
Focusing
  • Senior Data Management Specialist
  • 10:27 (UTC -12:00)

Block or report somafordata

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
somafordata/README.md

๐Ÿ’ซ About Me:

๐Ÿš€ Senior Data Management specialist passionate about building reliable, scalable, and high-performance data systems that turn raw data into impactful insights.

๐Ÿ”ญ Currently engineering cloud-native data pipelines and optimizing modern data architectures
๐Ÿค Open to collaborating on data engineering, big data, and analytics-driven projects
๐ŸŒฑ Continuously learning and exploring real-time data processing and next-gen cloud technologies
๐Ÿ’ก Strong believer that well-designed data platforms power smarter decisions
๐Ÿ’ฌ Ask me about Data Engineering, ETL/ELT, Distributed Systems, Python, SQL, and Cloud

โšก Fun fact: I love solving complex data puzzles โ€” the messier the data, the more exciting the challenge!


## โš™๏ธ Tech Toolbox

๐Ÿง  Data Engineering:
ETL/ELT Design โ€ข Data Modeling (Star & Snowflake) โ€ข Lakehouse Architecture

๐Ÿ”ฅ Big Data & Streaming:
Apache Spark (PySpark, Spark SQL, Structured Streaming) โ€ข Apache Airflow โ€ข Apache Kafka

โ˜๏ธ Cloud & Warehousing:
AWS (S3, EC2, Redshift, Glue, Lambda) โ€ข Azure (Databricks, Data Factory) โ€ข Snowflake

๐Ÿ’ป Languages:
Python โ€ข SQL (PostgreSQL, MySQL)

๐Ÿ” Data Governance & Security:
Unity Catalog โ€ข Azure Key Vault โ€ข Azure AD โ€ข RBAC

๐Ÿ—„๏ธ Databases:
PostgreSQL โ€ข SQL Server โ€ข Oracle โ€ข MongoDB (Basics)

๐Ÿ“Š Analytics:
Power BI โ€ข Advanced Excel โ€ข Power Query

๐Ÿ“ฆ File Formats:
Parquet โ€ข Avro โ€ข ORC โ€ข JSON โ€ข CSV โ€ข XML

๐Ÿ› ๏ธ DevOps & Tools:
Docker โ€ข Terraform โ€ข Git โ€ข CI/CD

๐ŸŒ Socials:

email

๐Ÿ’ป Tech Stack:

R Python Azure AWS Apache Tomcat Nginx Jenkins Gunicorn Apache Apache Airflow Neo4J MySQL Postgres scikit-learn PyTorch Plotly Pandas NumPy Scipy TensorFlow mlflow Matplotlib Keras Git GitHub Jira Kubernetes Postman Power Bi Docker Apache Spark Apache Kafka Apache Hive Apache Airflow Terraform MicrosoftSQLServer Postgres Snowflake Yarn

๐Ÿ“Š GitHub Stats:



โœ๏ธ Random Dev Quote

๐Ÿ” Top Contributed Repo


Popular repositories Loading

  1. machine-learning-online-2018 machine-learning-online-2018 Public

    Forked from rajivprajapati/machine-learning-online-2018

    ML Online Course Repository. Course videos on online.codingblocks.com

    Jupyter Notebook 1

  2. python-case-study python-case-study Public

    Solving this case study will give you an idea about how real business problems are solved using EDA. In this case study, apart from applying the techniques you have learnt in EDA, you will also devโ€ฆ

    Jupyter Notebook 1

  3. Assignments Assignments Public

    Jupyter Notebook 1

  4. Feature-Engineering Feature-Engineering Public

    Forked from noisyoscillator/Feature-Engineering

    Jupyter Notebook

  5. Webscrapingdeployment Webscrapingdeployment Public

    Web scraping is a technique using which the webpages from the internet are fetched and parsed to understand and extract specific information similar to a human being

    Jupyter Notebook

  6. Imagescrapping Imagescrapping Public

    Image scraping is a technique using which the webpages from the internet are fetched and parsed to understand and extract specific information similar to a human being

    Python