Hi, I'm Satish Kumar, a Data Engineer. I specialize in building scalable end-to-end data pipelines and working with big data technologies to drive impactful insights. Passionate about Data Engineering, Machine Learning, and Generative AI, I enjoy solving complex data problems and optimizing data workflows.
- Building real-world Data Engineering systems using Kafka, Spark, Airflow, Postgres, and Docker.
- NLP and Generative AI, focusing on Large Language Models.
- Data as a Product and Data Governance best practices.
- How to architectect better data system
- Programming Languages: HTML/CSS/JavaScript, Java, Python, SQL, Scala
- Databases: Oracle DB, SQL Server, MySQL, PostgreSQL
- Big Data & Cloud: Hadoop, YARN, MapReduce, Sqoop, Flume, Hive, HBase, Zookeeper, Spark, Cloudera Tech stacks, AWS, Databricks, Snowflake
- Streaming & Orchestration: Kafka, Oozie, AirFlow,
- NLP & AI: Transformers, LLMs, TextBlob, OpenAI APIs
- Visualization: Matplotlib, Seaborn, Streamlit
- Cloud/DevOps: Git, Jenkins, Docker
- Tools: Jira, Confluence
- Python for Everybody Specialization
- AWS Certified Solutions Architect β Associate
- Programming for Data Science
- Cloud DevOps Engineer
- πΌ LinkedIn: Satish Kumar
- βοΈ Email: [email protected]
Feel free to explore my repositories, share feedback, or collaborate on projects. Let's build something incredible together!
