Skip to content
View Tvkoushik's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Tvkoushik

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Tvkoushik/README.md

Koushik Thota

I have spent 6+ years building data platforms across AWS and Azure, and I am pushing toward principal-level scope across full-stack data and AI systems.

LinkedIn 6+ years experience Batch and streaming data Full-stack data and AI

About

I started in traditional ETL and data warehousing work, then moved deeper into cloud data platforms, real-time pipelines, serverless systems, analytics tooling, and AI-enabled internal workflows.

That mix still shapes the way I work. I am comfortable modernizing older enterprise flows, but I also like building cloud-native systems that are easier to run, inspect, and extend.

A lot of my work has lived in environments where accuracy matters: insurance, enterprise reporting, industrial analytics, customer-facing platforms, and internal tools where data and AI need to be useful instead of just impressive.

Direction

I am building toward principal-level ownership across full-stack data and AI. For me, that means going beyond pipelines into orchestration, APIs, agent workflows, evaluation, developer tooling, and the product surfaces around them.

What I Can Build

Data engineering and platform work

  • Batch ETL and ELT pipelines
  • Streaming and event-driven data workflows
  • Cloud data platforms on AWS and Azure
  • Data lake, warehouse, and operational sync pipelines
  • Infrastructure-backed delivery using CDK, Terraform, and CI/CD workflows

Analytics, AI, and internal data products

  • Reporting and BI automation for internal teams
  • Tableau governance and analytics platform tooling
  • Data quality checks, schema validation, and production guardrails
  • Applied ML workflows around forecasting, anomaly detection, recommendation logic, and AI-assisted classification
  • LLM workflows, agent tooling, MCP-based integrations, and backend or UI work when the platform needs a product surface

Where I Am Strong

  • Modernizing data flows from enterprise systems, flat files, APIs, and operational databases into cloud-ready platforms.
  • Designing pipelines that support both analytics use cases and downstream application needs.
  • Working across data, AI, and product-facing systems without treating them like separate problems.
  • Building systems that are easier to debug, easier to trust, and less painful to maintain.
  • Taking ownership from implementation through rollout, with mentoring and delivery discipline when teams need it.

Toolbox

Python SQL Scala PySpark Kafka Airflow Informatica AWS Azure GCP Azure Data Factory Azure Synapse Databricks AWS Lambda AWS Step Functions AWS CDK Redshift Snowflake BigQuery PostgreSQL Apache Iceberg DuckDB Tableau Terraform Jenkins LangChain LangGraph LangSmith Model Context Protocol Claude Code Gemini CLI

Credentials

AWS Certified Data Analytics - Specialty AWS Certified Solutions Architect - Associate SAFe 5 Practitioner

Popular repositories Loading

  1. python-oneliners python-oneliners Public

    Some of the Python One-Liners which I regularly use and feel saves a lot of time.

    24 1

  2. gq-great-expectations gq-great-expectations Public

    Great Expectations Data Quality Checks is a specialized repository designed to harness the capabilities of the great_expectations Python library. With a focus on ensuring data quality, this project…

    Jupyter Notebook 3

  3. Kachow Kachow Public

    Config files for my GitHub profile.

    1

  4. python-one-liners python-one-liners Public

    Forked from Allwin12/python-one-liners

    This repository contains python one-liners obtained from various sources.

    1

  5. data-engineering-zoomcamp data-engineering-zoomcamp Public

    Forked from DataTalksClub/data-engineering-zoomcamp

    Code for Data Engineer Zoomcamp course

    Jupyter Notebook 1

  6. awesome-python awesome-python Public

    Forked from vinta/awesome-python

    A curated list of awesome Python frameworks, libraries, software and resources

    Python 1