John Kevin Barrera Contreras

Data Scientist & Computer Scientist

Email: [email protected]

Phone: +(51) 967016935

Web: johnkbarrera.github.io

About Me

Hi, I’m John Kevin Barrera Contreras, a Computer Scientist from Peru with a strong foundation in mathematics, artificial intelligence, and software development. My upbringing in rural communities shaped my commitment to leveraging technology for social impact and sustainable development.

With expertise spanning machine learning, data science, and systems architecture, I specialize in rapid prototyping and proof-of-concept development using Python. My work focuses on translating complex technical challenges into practical, scalable solutions, currently exploring the intersection of data science and social good.

Core Competencies: Data Architecture, Machine Learning, Python, Graph Theory, Big Data, and Cloud Technologies (Azure, GCP)

Professional Experience

The World Bank

Data Scientist Consultant

January 2023 - December 2025

worldbank.org

Leveraging data science for development impact

Research project focused on measuring risk and accessibility to health systems across multiple countries. Developed innovative approaches combining:

  • Graph theory and geographic datasets for accessibility modeling
  • Machine learning models for risk assessment and prediction
  • Comparative analysis with existing country-specific methodologies
  • Mathematical modeling and algorithm development for large-scale data processing

Technologies: Python, Machine Learning, Graph Theory, Geospatial Analysis, Big Data

This project is part of my master’s degree research.

Globant

Data Architect

August 2021 - December 2024

globant.com

Building scalable data architectures for enterprise clients

Led multiple high-impact data architecture projects:

Liga MX Data Architecture (Mexico & LATAM)

  • Designed and implemented end-to-end data architecture using Azure ecosystem
  • Built scalable pipelines connecting multiple sources through Bronze, Silver, and Gold layers
  • Enabled data-driven decision making for one of Latin America’s premier football leagues

BICE Bank Foundation Platform (Chile)

  • Architected batch processing platform on Google Cloud Platform
  • Designed secure data governance framework for financial institution
  • Enabled large-scale data processing and analytics capabilities

InStride Platform Enhancement

  • Collaborated on proof-of-concept development and platform improvements
  • Defined comprehensive data taxonomy for multi-environment deployments
  • Contributed to development team as architecture consultant

Technologies: Azure (Databricks, ADF, Data Lake Storage, Functions), GCP, Python, PySpark, SQL

Rappi

Data Engineer

October 2020 - August 2021

rappi.com

Empowering fintech innovation through data

Data Engineer at RAPPIBANK, providing comprehensive business support for data-related initiatives across LATAM:

  • Developed end-to-end data solutions, proposals, and ETL/ELT pipelines
  • Specialized in graph-based data storage for complex relationship modeling
  • Implemented data governance and quality frameworks
  • Supported analytical models requiring geographical analysis, degree-of-separation calculations, and transaction data
  • Explored and implemented CDC (Change Data Capture) solutions

Technologies: Snowflake, PySpark, Apache NiFi, Airflow, Fivetran, dbt, S3, AWS Glue, Neo4j, PostgreSQL

Research Center - Universidad del Pacífico

Data Scientist & Research Assistant

April 2020 - September 2020

up.edu.pe

Advancing research at the intersection of data science and social impact

Collaborated on diverse research projects spanning public health, consumer behavior, and blockchain:

World Bank & PCM Projects

  • Co-authored research published in “Analysis of the Health Network of Metropolitan Lima Against Large-Scale Earthquakes”
  • Developed contact tracing algorithms and COVID-19 contagion probability models
  • Simulated emergency scenarios and optimized routes using graph theory

Consumer Behavior Analysis

  • Applied unsupervised machine learning and NLP to mine customer buying patterns
  • Led by M. Núñez del Prado, Ph.D. and H. Alatrista S., Ph.D.

Blockchain Research

  • Created educational cryptocurrency from Bitcoin fork
  • Developed DApps on Ethereum platform with API integration
  • Algorithm development in R and Julia (Led by F. Rosales M., Ph.D.)

INDRA

Data Engineer

August 2019 - November 2019

indracompany.com

Telecommunications Data Lake Project:

  • Modeled data domains for enterprise data lake architecture
  • Improved data quality processes across Raw, Silver, and Gold layers
  • Implemented data ingestion pipelines using Hortonworks stack

Everis (NTT Data)

Data Engineer

May 2018 - May 2019

nttdata.com

Big Data implementations for Peruvian banking sector:

  • Developed solutions using Apache ecosystem (Hadoop, Hive, Spark, Sqoop, Kafka, HBase, Zookeeper)
  • Worked with Databricks, Cassandra for distributed data processing
  • Created multiple proof-of-concepts for client evaluation
  • Completed comprehensive Big Data training and certification

ONPE-GSFP

Systems Engineer Intern

August 2017 - December 2017

onpe.gob.pe

Electoral data processing and validation:

  • Developed ETL processes for political organization reports
  • Implemented data cleaning and validation against RENIEC database
  • Created error verification and user confirmation workflows

Education

Pontificia Universidad Católica del Perú

Master of Science in Computer Science

2022 - 2023

pucp.edu.pe

Advanced studies in artificial intelligence, machine learning, and computational methods.

Master’s Thesis: Research project with The World Bank on measuring risk and accessibility to health systems using graph theory and machine learning.

Universidad Nacional Mayor de San Marcos

Bachelor of Science in Systems Engineering

2012 - 2017

unmsm.edu.pe

Comprehensive foundation in computer science, mathematics, and software engineering.

Universidad de Chile

Exchange Program - Faculty of Economics and Business

2016

uchile.cl

Recipient of Alianza del Pacífico Scholarship

International exchange program focused on economics and business analytics.

Publications & Research

Analysis of the Health Network of Metropolitan Lima Against Large-Scale Earthquake

Springer - ISBN 978-3-030-76228-5

https://link.springer.com/chapter/10.1007/978-3-030-76228-5_32

Contributing to disaster preparedness through data-driven analysis

Peru is a highly seismic country located in the Ring of Fire, making it vulnerable to earthquakes and tsunamis. This research examines Lima’s health system capacity from three different and complementary perspectives, utilizing graph theory, geographic information systems, and simulation modeling.

Key Contributions:

  • Multi-dimensional analysis of health network resilience
  • Accessibility modeling under disaster scenarios
  • Policy recommendations for health system strengthening

Research Methods: Graph Theory, Geospatial Analysis, Network Modeling, Simulation

Impact: Provides critical evidence for strengthening Peru’s fragile health infrastructure against seismic events.

Notable Projects

PCM - MINSA Contact Tracing

Peru in Your Hands - COVID-19 Response

Developed contact tracing algorithms and contagion probability models for Peru’s Ministry of Health. Applied graph theory and machine learning to public health crisis response.

World Bank Health Network Analysis

Earthquake Preparedness Research

Multi-country research project analyzing health system accessibility and risk in seismic-prone regions. Published research informing policy decisions for disaster preparedness.

VacBs - Mobile App for Dairy Production

Collaboration with INIA, UP, and UALM to create mobile application for artisanal milk production. Implemented neural networks for quality prediction and production optimization.

EsanCoin/CiupCoin

Educational Cryptocurrency

Created educational cryptocurrency based on Bitcoin and Learncoin source code. Developed comprehensive video tutorial series for blockchain education.

Ethereum DApps Development

Projects: Rocket & Black Rabbits Get Lost In The Woods

Developed decentralized applications on Ethereum blockchain platform. Explored smart contract development and blockchain-based solutions.

Achievements & Recognition

Hackathons

Innovation through collaboration

2020

  • 🥈 2nd Place - Hackathon Contact Tracing (COVID-19 Response)

2018

  • 🥇 1st Place - HACKATHON ARTTEC (Art & Technology)
  • 🥇 1st Place - IBM CODING DAY: Hackathon for a Peru #SINANEMIA (Social Impact)

Scholarships & Awards

2016

  • 🎓 Recipient of Alianza del Pacífico Scholarship
  • Competitive international exchange program across Pacific Alliance countries

Skills & Technologies

Programming Languages: Python, R, Julia, SQL, Scala

Data Engineering: Apache Spark (PySpark), Airflow, NiFi, dbt, Fivetran, Sqoop, Kafka

Cloud Platforms: Azure (Databricks, ADF, Data Lake Storage, Functions), Google Cloud Platform, AWS (S3, Glue)

Databases: Snowflake, PostgreSQL, Neo4j, Cassandra, HBase

Big Data Stack: Hadoop, Hive, HBase, Zookeeper, Hortonworks

Machine Learning: Scikit-learn, TensorFlow, PyTorch, NLP, Unsupervised Learning

Data Visualization: Tableau, Power BI, Matplotlib, Seaborn

Specialized: Graph Theory, Network Analysis, Geospatial Analysis, Mathematical Modeling

Blockchain: Ethereum, Bitcoin, Smart Contracts, DApp Development

Interests & Hobbies

Beyond my professional work in data science and computer science, I’m passionate about:

  • 🎸 Music: Playing guitar and exploring different musical genres
  • 🎮 Gaming: Strategy games and problem-solving challenges
  • 🌱 Social Impact: Community projects with ecological and social focus
  • 🥷 Continuous Learning: Always seeking new challenges
  • 🌍 Technology for Good: Leveraging tech for sustainable development in rural communities