Skip to content

VarunArora14/VarunArora14

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 

Repository files navigation

Varun Arora

Email: [email protected]
Location: India
LinkedIn   LeetCode


Skills

Technical Skills

  • Python, AWS, SQL, Spark, Databricks, Snowflake, DBT, OpenSearch, PostgreSQ, Docker, Jenkins, CI/CD, Kubernetes
  • OOPs, Distributed Systems, Microservices, DSA, System Design

What am I doing right now?

  • Building data driven systems powered by AI at Enterprise level
  • Exploring Open Source in data and AI projects (OSS prev at https://github.com/antiwork/ flexile and gumroad)

Work Experience

Specialist Programmer (SDE 1)

Infosys, Noida, India
Feb 2024 - Present

  • Built scalable Python ELT pipelines to ingest, validate, and transform high volume data into Snowflake, leveraging dbt models for modular and version controlled transformations and dagster for orchestration.
  • Engineered data pipelines to ingest and index 10M+ vector embeddings into OpenSearch for a GenAI RAG system for The Economist Group optimizing bulk indexing requests to reduce latency by 40% Press Release, Live Link .
  • Enhanced Snowflake warehouse performance by 60% through advanced SQL tuning, implementation of clustering keys, materialized views and effective use of CTEs.
  • Scaled ETL pipelines to support 2B+ records/hour ingestion into Snowflake, cutting latency by 35% through incremental loading strategies, partition aware data models and clustering optimization.
  • Automated CI/CD driven deployments, dbt runs, data quality checks and tests using Azure DevOps and GitHub Actions, achieving over 90% deployment automation.

Software Engineer

Biz2Credit, Noida, India
Mar 2023 - Feb 2024

  • Built a centralized cloud monitoring system for 25+ AWS accounts, enabling DevOps team to quickly identify idle or over provisioned resources, resulting in $30k+ annual cost savings.
  • Developed Databricks workflows using PySpark to process 65M+ daily records into Delta Lake with Unity Catalog for centralized control and lineage tracking for 20+ upstream sources.
  • Engineered python data pipelines integrated with pytest suite for unit and integration testing reducing data anomalies by 35% and ensuring 99.9% pipeline reliability within a Jenkins-managed CI/CD environment.
  • Debugged cross service Redis DB write failures caused by silent master node downtime, implementing retry logic with exponential backoff for database connections and improving master node health alerts, reducing incident resolution time by 70%.

Education

B.Tech in Computer Science, MAIT CGPA: 9.21
Jul 2019 - Jul 2023


Achievements

  • Received a Infosys STG Unit Rise Award in August 2025 and 2 Infosys RISE Insta Awards in December 2024 and March 2025 for resolving critical issues during UAT phases and driving project to completion via proper communication and cross teams collaboration.
  • Awarded Biz2Credit Functional Award for December 2023 for for developing and deploying cloud monitoring system to production and debugging cross service Redis write failure.
  • Ranked in the Top 5% on LeetCode with a maximum rating of 1850+.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors