- matplotlib
- scipy
- seaborn
- pandas
- numpy
- scikit-learn
- tensorflow
- pyodc (if you need to connect to a database)
For this project I used a connection to a SQL Database, so the pyodbc library is installed. But you could either import a csv file or make a connection to where your dataset is located. The 'database' folder contains some csv files used to explain some classes like Regressions and Clusters
- Pareto's Chart
- Histogram
- Bars' charts
- Scatter Plot
- Probability Plot
- Skew
- Variance, Covariance, Standard Deviation and Variance coefficient
- Correlation coefficient
- Standarize Normal Distribution (zscore)
- Central Limit Theorem
- Confidence Intervals. Z & T for var population known or unknown, respectively
- Confidence Interval. Dependent Samples
- Confidence Interval. Independent Samples
- Var populations known (populations normally distributed, var populations known, sample sizes differ)
- Var populations unknown but assumed are equals
- Var populations unknown but assumed are different
- Hypothesis Test. (Applied for each type of Confidence Interval)
- Simple linear Regression (OLS)
- Multiple linear Regression (OLS)
- Dummy variables
- Logistic Regression
- Binary predictors
- Supervised Models
- Regressions
- Classification
- Non-supervised Models (PENDING)
- Objective Function
- Loss Norma-L2 (Regressions)
- Cross Entropy (Classification)
- Optimization Algorithm
- Gradient descent
- Probability distributions
- Plot Confidence Intervals in normal distributions and probplots
- Simple linear Regression
- Other solution methods
- Log transformation variables as elasticity (economics)
- Durbin-Watson to detect no autocorrelation
- Other regression models to time-series or when the error terms (No Autocorrelation) are correlated
- Hiperparameters vs Parameters
- Amplitud
- Deep
- Learning rate (eta)
- Lot size
- Momentum coefficient (alpha)
- Decay coefficient (c)
- DeepLearning Activation functions
- sigmoide (0, 1)
- tanh (-1, 1)
- ReLu (0, inf)
- softmax (0, 1)
- Preventing overfitting. Early stop
- Set number of epochs
- Stop when delta <= 0.001
- Validation / Cross-validation
- Initialization
- Find the Global minimum (not local)
- Programming the Learning rate
- Preprocessing
- Standarization
- Normalization
- PCA
- Bleaching
- Categorical data
- Binary
- One-Hot
- Balanced previous/data
- RNCs and RNRs
- Other Deep Learning models
- Bayesian nets
- Generative models
- Decision Trees