About Me
Hi, I’m John Kevin Barrera Contreras, a Computer Scientist from Peru with a strong foundation in mathematics, artificial intelligence, and software development. My upbringing in rural communities shaped my commitment to leveraging technology for social impact and sustainable development.
With expertise spanning machine learning, data science, and systems architecture, I specialize in rapid prototyping and proof-of-concept development using Python. My work focuses on translating complex technical challenges into practical, scalable solutions, currently exploring the intersection of data science and social good.
Core Competencies: Data Architecture, Machine Learning, Python, Graph Theory, Big Data, and Cloud Technologies (Azure, GCP)
Professional Experience
Leveraging data science for development impact
Research project focused on measuring risk and accessibility to health systems across multiple countries. Developed innovative approaches combining:
- Graph theory and geographic datasets for accessibility modeling
- Machine learning models for risk assessment and prediction
- Comparative analysis with existing country-specific methodologies
- Mathematical modeling and algorithm development for large-scale data processing
Technologies: Python, Machine Learning, Graph Theory, Geospatial Analysis, Big Data
This project is part of my master’s degree research.
Building scalable data architectures for enterprise clients
Led multiple high-impact data architecture projects:
Liga MX Data Architecture (Mexico & LATAM)
- Designed and implemented end-to-end data architecture using Azure ecosystem
- Built scalable pipelines connecting multiple sources through Bronze, Silver, and Gold layers
- Enabled data-driven decision making for one of Latin America’s premier football leagues
BICE Bank Foundation Platform (Chile)
- Architected batch processing platform on Google Cloud Platform
- Designed secure data governance framework for financial institution
- Enabled large-scale data processing and analytics capabilities
InStride Platform Enhancement
- Collaborated on proof-of-concept development and platform improvements
- Defined comprehensive data taxonomy for multi-environment deployments
- Contributed to development team as architecture consultant
Technologies: Azure (Databricks, ADF, Data Lake Storage, Functions), GCP, Python, PySpark, SQL
Empowering fintech innovation through data
Data Engineer at RAPPIBANK, providing comprehensive business support for data-related initiatives across LATAM:
- Developed end-to-end data solutions, proposals, and ETL/ELT pipelines
- Specialized in graph-based data storage for complex relationship modeling
- Implemented data governance and quality frameworks
- Supported analytical models requiring geographical analysis, degree-of-separation calculations, and transaction data
- Explored and implemented CDC (Change Data Capture) solutions
Technologies: Snowflake, PySpark, Apache NiFi, Airflow, Fivetran, dbt, S3, AWS Glue, Neo4j, PostgreSQL
Research Center - Universidad del Pacífico
Data Scientist & Research Assistant
April 2020 - September 2020
up.edu.pe
Advancing research at the intersection of data science and social impact
Collaborated on diverse research projects spanning public health, consumer behavior, and blockchain:
World Bank & PCM Projects
- Co-authored research published in “Analysis of the Health Network of Metropolitan Lima Against Large-Scale Earthquakes”
- Developed contact tracing algorithms and COVID-19 contagion probability models
- Simulated emergency scenarios and optimized routes using graph theory
Consumer Behavior Analysis
- Applied unsupervised machine learning and NLP to mine customer buying patterns
- Led by M. Núñez del Prado, Ph.D. and H. Alatrista S., Ph.D.
Blockchain Research
- Created educational cryptocurrency from Bitcoin fork
- Developed DApps on Ethereum platform with API integration
- Algorithm development in R and Julia (Led by F. Rosales M., Ph.D.)
Telecommunications Data Lake Project:
- Modeled data domains for enterprise data lake architecture
- Improved data quality processes across Raw, Silver, and Gold layers
- Implemented data ingestion pipelines using Hortonworks stack
Big Data implementations for Peruvian banking sector:
- Developed solutions using Apache ecosystem (Hadoop, Hive, Spark, Sqoop, Kafka, HBase, Zookeeper)
- Worked with Databricks, Cassandra for distributed data processing
- Created multiple proof-of-concepts for client evaluation
- Completed comprehensive Big Data training and certification
Electoral data processing and validation:
- Developed ETL processes for political organization reports
- Implemented data cleaning and validation against RENIEC database
- Created error verification and user confirmation workflows
Education
Pontificia Universidad Católica del Perú
Master of Science in Computer Science
2022 - 2023
pucp.edu.pe
Advanced studies in artificial intelligence, machine learning, and computational methods.
Master’s Thesis: Research project with The World Bank on measuring risk and accessibility to health systems using graph theory and machine learning.
Universidad Nacional Mayor de San Marcos
Bachelor of Science in Systems Engineering
2012 - 2017
unmsm.edu.pe
Comprehensive foundation in computer science, mathematics, and software engineering.
Recipient of Alianza del Pacífico Scholarship
International exchange program focused on economics and business analytics.
Publications & Research
Analysis of the Health Network of Metropolitan Lima Against Large-Scale Earthquake
Springer - ISBN 978-3-030-76228-5
https://link.springer.com/chapter/10.1007/978-3-030-76228-5_32
Contributing to disaster preparedness through data-driven analysis
Peru is a highly seismic country located in the Ring of Fire, making it vulnerable to earthquakes and tsunamis. This research examines Lima’s health system capacity from three different and complementary perspectives, utilizing graph theory, geographic information systems, and simulation modeling.
Key Contributions:
- Multi-dimensional analysis of health network resilience
- Accessibility modeling under disaster scenarios
- Policy recommendations for health system strengthening
Research Methods: Graph Theory, Geospatial Analysis, Network Modeling, Simulation
Impact: Provides critical evidence for strengthening Peru’s fragile health infrastructure against seismic events.
Notable Projects
PCM - MINSA Contact Tracing
Peru in Your Hands - COVID-19 Response
Developed contact tracing algorithms and contagion probability models for Peru’s Ministry of Health. Applied graph theory and machine learning to public health crisis response.
World Bank Health Network Analysis
Earthquake Preparedness Research
Multi-country research project analyzing health system accessibility and risk in seismic-prone regions. Published research informing policy decisions for disaster preparedness.
VacBs - Mobile App for Dairy Production
Collaboration with INIA, UP, and UALM to create mobile application for artisanal milk production. Implemented neural networks for quality prediction and production optimization.
EsanCoin/CiupCoin
Educational Cryptocurrency
Created educational cryptocurrency based on Bitcoin and Learncoin source code. Developed comprehensive video tutorial series for blockchain education.
Ethereum DApps Development
Projects: Rocket & Black Rabbits Get Lost In The Woods
Developed decentralized applications on Ethereum blockchain platform. Explored smart contract development and blockchain-based solutions.
Achievements & Recognition
Hackathons
Innovation through collaboration
2020
- 🥈 2nd Place - Hackathon Contact Tracing (COVID-19 Response)
2018
- 🥇 1st Place - HACKATHON ARTTEC (Art & Technology)
- 🥇 1st Place - IBM CODING DAY: Hackathon for a Peru #SINANEMIA (Social Impact)
Scholarships & Awards
2016
- 🎓 Recipient of Alianza del Pacífico Scholarship
- Competitive international exchange program across Pacific Alliance countries
Skills & Technologies
Programming Languages: Python, R, Julia, SQL, Scala
Data Engineering: Apache Spark (PySpark), Airflow, NiFi, dbt, Fivetran, Sqoop, Kafka
Cloud Platforms: Azure (Databricks, ADF, Data Lake Storage, Functions), Google Cloud Platform, AWS (S3, Glue)
Databases: Snowflake, PostgreSQL, Neo4j, Cassandra, HBase
Big Data Stack: Hadoop, Hive, HBase, Zookeeper, Hortonworks
Machine Learning: Scikit-learn, TensorFlow, PyTorch, NLP, Unsupervised Learning
Data Visualization: Tableau, Power BI, Matplotlib, Seaborn
Specialized: Graph Theory, Network Analysis, Geospatial Analysis, Mathematical Modeling
Blockchain: Ethereum, Bitcoin, Smart Contracts, DApp Development
Interests & Hobbies
Beyond my professional work in data science and computer science, I’m passionate about:
- 🎸 Music: Playing guitar and exploring different musical genres
- 🎮 Gaming: Strategy games and problem-solving challenges
- 🌱 Social Impact: Community projects with ecological and social focus
- 🥷 Continuous Learning: Always seeking new challenges
- 🌍 Technology for Good: Leveraging tech for sustainable development in rural communities