Product-Driven Data Engineer

Building scalable data foundations to power strategic decision-making.

I am Jaya Ganesh Kumar, a Data Engineer specializing in crafting robust ETL pipelines with PySpark and Airflow. I translate complex data ecosystems into highly actionable intelligence, currently driving analytical solutions at Visa.

Jaya Ganesh Kumar

My Story

As a dedicated data engineer, I thrive on the challenge of turning raw, complex datasets into valuable insights that drive business decisions. I approach data not as static numbers, but as the raw material for powerful storytelling and precision strategy.

Proficient in data modeling, ETL processes, and modern data warehousing techniques, I leverage cloud-based platforms like AWS and big data paradigms. Having worked across organizations like Visa, Vivriti Capital, and PayPal, my focus remains steadfast: delivering scalable, efficient data solutions that support data-driven decision making and intuitive product experiences.

Professional Experience


May 2025 - Present
Bengaluru, Karnataka

Data Engineer

Visa
  • Data warehouse engineer for Verifi Product, building scalable data infrastructure.
  • Architected and implemented 10+ Data Pipelines using PySpark, following the Medallion Architecture (Bronze, Silver, Gold) for robust data quality and lineage.
  • Designed and deployed 5+ Reporting Tables specifically to power dynamic UI widgets and dashboards.
  • Pioneered the use of ADLC (AI-Assisted Development Life Cycle) technologies to accelerate script development, significantly reducing coding turnaround time.

Jul 2023 - May 2025
Bengaluru, Karnataka

Data Analyst / Data Engineer 1

Vivriti Capital
  • Designed & implemented web scraping solutions to extract data from 5+ websites, ensuring 98% data accuracy.
  • Wrote and optimized 30+ SQL transformations to streamline the calculation of 10+ financial metrics, reducing manual error rates by 80%.
  • Designed scalable data pipelines using Airflow and PySpark to extract, transform, and load data into Redshift for efficient analysis.
  • Collaborated with four cross-functional teams to align transformations with direct business insight requirements.

Aug 2022 - Jan 2023
Remote

Data Analyst Intern

PayPal
  • Developed an interactive dashboard using Tableau to monitor losses at a daily level with dynamic filtering for deep-dive analysis.
  • Employed PySpark to optimize data processing, creating a 40% efficiency boost in continuous data integration.
  • Worked on a User-experience Analytic project predicting the number of good users penalized for blocking bad actors, utilizing statistical modeling to forecast strategic risks.

Technical Arsenal

Python Python
SQL SQL / BQ
Spark Spark
Airflow Airflow
ETL & Data Warehousing
Python Python
SQL SQL / BQ
Spark Spark
Airflow Airflow

Featured Implementations

Driver Alertness Prediction
Driver Alertness Prediction

A computer vision solution to mitigate driver fatigue in real-time. Leverages OpenCV and Python to process live video streams, identify drowsiness signs, and deploy immediate critical alerts.

Review Repository
Phishing Detection
Phishing Security Engine

An ML algorithm architected to classify malicious URLs dynamically, mitigating user vulnerability to phishing landscapes.

Review Repository
Crypto Tracker
Crypto Tracker Pipeline

A high-frequency web scraper aggregating live CoinMarketCap datasets to track market volatility and support automated analysis.

Review Repository
Airline Price Prediction
Flight Price Forecasting

Leveraging historical flight datasets to build a predictive machine learning model targeting fare fluctuations.

Review Repository
Farmer's Portal
Farmer's Support Portal

A centralized digital resource providing systematic access to agricultural insights and tools for better crop yield.

Review Repository

Certifications & Growth

Build a Face Recognition Application

Advanced Python programming; Computer Vision Algorithms

Machine Learning: Zero to GBMs

Jovian • Deep-dive into models and deployment strategies

Python Fundamentals

Crash course in programming best practices

Problem Solving (Basic)

HackerRank • Core algorithmic execution and structure