Skip to content

Sitraka17/Centrale_Marseille_ML

Repository files navigation

🚀 Centrale Marseille — Master Machine Learning

A structured course repository for advanced Machine Learning concepts and business applications.

This repository collects teaching material, notebooks, and references used in the Master-level Machine Learning course led by SITRAKA.


📚 Repository Structure

  • 00_Introduction/ — onboarding notebooks, Python/Colab setup, and early LLM references.
  • 01_Fundamentals/ — core Machine Learning concepts, algorithm summaries, and flashcards.
  • 03_Advanced/ — deep learning examples, TensorFlow modelling, and orchestration with Airflow.
  • docs/ — supplementary theory, formulas, and detailed notes.
  • resources/ — datasets, external references, and enrichment materials.
  • LOGISTIC_REGRESSION_DDEFI.ipynb — practical logistic regression notebook.

🎯 Learning Objectives

Students completing this course should be able to:

  • Explain the main principles of supervised, unsupervised, and deep learning.
  • Design and evaluate end-to-end ML workflows for business use cases.
  • Interpret model behaviour through metrics, regularization, and bias-variance tradeoffs.
  • Connect modern NLP architectures such as transformers to practical applications.
  • Understand how orchestration tools like Airflow support production ML pipelines.

🗓 Course Timeline

November

  • 13/11 — LIRONE #1 (6h)
  • 25/11 — SITRAKA #2 (6h) — Introduction to Machine Learning
  • 28/11 — SITRAKA #3 (6h) — ML, Deep Learning, and collaborative work

December

  • 19/12 — LIRONE #4 (6h)

January

  • 29/01 — LIRONE #5 (6h)
  • Additional sessions: TBD

🧠 Course Overview

1. Foundations

  • Data preparation: cleaning, missing values, duplicates, outliers
  • Feature engineering: scaling, encoding, feature creation
  • Feature selection: signal versus noise
  • Bias-variance tradeoff and model complexity
  • Evaluation metrics and selection of the right metric

2. Supervised Learning

  • Linear regression
  • Logistic regression
  • Tree-based models: decision trees, random forests, gradient boosting

Business use cases: marketing mix modeling, client scoring, churn prediction, sports analytics

3. Unsupervised Learning

  • Clustering: K-Means, hierarchical clustering
  • Dimensionality reduction: PCA

Business use cases: customer segmentation, targeting strategies

4. Deep Learning

  • Differences between traditional ML and deep learning
  • Neural network training and backpropagation
  • Loss functions such as cross-entropy

Business use cases: computer vision for tracking, advanced predictive models

5. NLP & Transformers

  • NLP fundamentals and model evolution
  • Word embeddings and attention mechanisms
  • Transformer architectures in modern applications

Business use cases: finance analytics, automated report generation, sentiment analysis


📖 Core Resources


🔬 Advanced Topics

  • Synthetic data generation
  • Agent-Based Modelling (ABM)
  • Quantitative finance trends and research

🛠️ Tools


✅ Next Steps

  • Add Finance NLP use cases
  • Add datasets and data exploration pipelines
  • Add an end-to-end ML project
  • Add MLOps and deployment examples

💡 Philosophy

Machine Learning = Data + Models + Business Impact

About

Machine Learning for Centrale Marseille

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors