Skip to content
View tharoun's full-sized avatar

Block or report tharoun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
tharoun/README.md

Hi, I'm Harouna Traore 👋

Data Analyst & Scientist | Quantitative Modeler | MSc Applied Econometrics & Statistics


👤 About Me

I'm a quantitative Data Analyst & Scientist with a strong background in statistical modeling, data pipeline engineering, and credit risk modeling. Holding a Master's in Applied Econometrics & Statistics from the University of Orléans (France) and an engineering degree from ENSAE Dakar, I combine rigorous analytical thinking with a passion for building reliable, production-ready data solutions.


💼 Professional Experience

🌍 International Data Consultant — International Trade Centre (UN Agency), Geneva

Sept 2023 – Jan 2026

  • Automated data pipelines for trade data processing: quality control, cleaning, standardization, and indicator production
  • Designed visualizations and quality monitoring dashboards for trade flow analysis
  • Stored, consolidated, and made data available in a SQL Server data warehouse for reporting and analytics
  • Migrated SAS codebases to Python to industrialize analytical processes; documented business rules and processes
  • Leveraged data to produce analytical and thematic reports for internal teams and institutional partners
  • Delivered targeted training sessions on data tools, dashboards, and analytics (internal & partners)
  • Stack: Python · Dagster · R/Shiny · SQL · SAS · Power BI · SAP Analytics Cloud

🏦 Model Risk Manager Intern — Société Générale, Paris (La Défense)

Mar 2023 – Aug 2023

  • Built an R/Shiny application to automate backtesting of credit stress test models — cutting report production from several days to 7 minutes
  • Enriched performance indicators: PSI, HHI, AR/AUC, Hosmer-Lemeshow, MAE/RMSE, conformity tests
  • Extended data quality diagnostics: missing values, outlier detection (Grubbs), population stability
  • Conducted IFRS9 PD model backtesting: PiT PD vs ODR comparison, migration matrices, conservatism evaluation
  • Analyzed climate stress tests (NGFS scenarios, transition & physical risks)
  • Stack: Python · R/Shiny

🚀 Projects

📊 Credit Scoring Model — MSc ESA Project

  • Feature engineering, Logistic Regression, Decision Tree, Random Forest, Bagging, XGBoost
  • Class imbalance handling via SMOTE resampling; model evaluation & scorecard construction

📉 Probability of Default (PD) Modeling — RiskMDP Advisory

  • Segmentation: feature engineering, logistic regression, scorecard construction
  • Calibration: Long-Run Average (LRA) estimation and Margin of Conservatism (MoC) quantification

🌍 African Countries GNI Per Capita Animation (Public — GitHub)

🔗 github.com/tharoun/African-Countries-GNI-Per-Capita-Animation

Python Matplotlib License: MIT

  • Animated bar chart race showing the evolution of the top 10 African countries by GNI per capita (1960–2024)
  • Features country flags, World Bank income classification thresholds, and smooth year-by-year transitions
  • Outputs available as MP4 video, HTML animation, or GIF
  • Data sourced from World Bank World Development Indicators (Atlas method)
  • Stack: Python · Matplotlib · Pandas · Pillow · FFmpeg

🗄️ WDI Data Pipeline (Public — GitHub)

🔗 github.com/tharoun/wdi-data-pipeline

R PostgreSQL License: MIT

  • Automated pipeline to download the full World Bank WDI dataset via API and load it into a PostgreSQL database
  • Creates 6 structured tables: data, country, series, footnotes, and relationship tables
  • Handles data cleaning, column standardization, schema management, and connection lifecycle
  • Stack: R · WDI · RPostgres · DBI · tidyverse · PostgreSQL

🦠 COVID-19 Dashboard — Burkina Faso (Public — GitHub)

🔗 github.com/tharoun/covid-2019-dashboard · 🌐 Live Demo

R

  • Interactive dashboard to monitor the evolution of COVID-19 in Burkina Faso
  • Built a data collection and processing pipeline to generate key epidemic indicators
  • Features choropleth mapping and trend visualizations of case evolution
  • Stack: R · Shiny · ShinyApps.io

🛠️ Tech Stack

Domain expertise: Basel Regulation · IFRS9 · Credit Risk (PD / LGD / CCF / EAD / RWA) · Backtesting · Stress Testing · Machine Learning · Spatial Econometrics · Data Warehousing


🎓 Education

Degree Institution Period
MSc Applied Econometrics & Statistics (ESA) University of Orléans, France 2021 – 2023
Ingénieur des Travaux Statistqiues (ITS) ENSAE Dakar, Senegal 2017 – 2021

📜 Certifications

  • 🏅 Designing Stories with SAP Analytics Cloud — 2025
  • 🏅 Advanced Python Programming — Udemy, 2024
  • 🏅 Data Analyst in Power BI — DataCamp, 2024
  • 🏅 Quantitative Analyst Pro (Scoring, PD, LGD, Prudential Regulation) — RiskMDP Advisory
  • 🏅 Base Programming SAS — 2022
  • 🏅 Core Designer — Dataiku, 2022

🌐 Languages

  • 🇬🇧 English — Fluent (speaking, writing, reading, listening)
  • 🇫🇷 French — Native

Open to opportunities in Data Analytics, Data Science, and Quantitative Risk Modeling.
📬

Pinned Loading

  1. African-Countries-GNI-Per-Capita-Animation African-Countries-GNI-Per-Capita-Animation Public

    Animated bar chart race: Top 10 African countries by GNI per capita evolution (1960-2024) | Python | Matplotlib | Data Visualization

    Python

  2. ETL-pipeline--World-Development-Indicators-to-PostgreSQL-database-migration ETL-pipeline--World-Development-Indicators-to-PostgreSQL-database-migration Public

    R script that automatically downloads World Bank indicators, cleans the data, and uploads to PostgreSQL. Perfect for economic research and analysis. 🏷️ r | world-bank | wdi | postgresql | data-pipe…

    R

  3. Python_codes_for_data_analysis_and_econometrics Python_codes_for_data_analysis_and_econometrics Public

    This project gathers code examples for data analysis and econometrics with Python.

  4. Python_codes_for_ML Python_codes_for_ML Public

    This repository aims to provide a sample of python code for a ML and data science project.

    Jupyter Notebook