This repo has been rendered for the initial purpose of introducing Arend M. W., Roland (no middle name) C., and Caroline R. H. to the coding world.
- Make sure
gitis installed
## make sure u have homebrew:
# /bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"
brew install git- Install & setup Python3 (the right way)
python -V # check the version- I recommend
pyenv...
brew install pyenv
pyenv install 3.8.7
pyenv global 3.8.7
pip install ipython # source ~/.bash_profile- Install & setup your IDE(s)/Text editor(s)/etc...
- VSCode: https://code.visualstudio.com/download
- Jupyter
pip install jupyterlab
- Clone this repository
- Learn how to read the data:
data/nrg_fng_04022019.f(hint: file type is offeatherformat) - Commit, push, and pull changes!
- Look into
Pythonmachine learning classification algorithms, install the relevant packages, get familiar with hyperparameters - Train a basic model (with data provided by Sam). Explore its accuracy. Test it on some names.
- Try grid searching or randomized hyperparameter tuning (hint:
GridSearchCV&RandomSearchCV) - Also consider different ways of pre-processing your data & feature engineering before training some models (hint:
CountVectorizercan be helpful!)
- TBD... stay tuned!!
DEBUGGING NOTES: We noticed you're using a conda environment. If you are experiencing issues with this environment in the integrated terminal, we recommend that you let the Python extension change "terminal.integrated.inheritEnv" to false in your user settings.
For compilers to find readline you may need to set: export LDFLAGS="-L/usr/local/opt/readline/lib" export CPPFLAGS="-I/usr/local/opt/readline/include"
For pkg-config to find readline you may need to set: export PKG_CONFIG_PATH="/usr/local/opt/readline/lib/pkgconfig"
E-Commerce Data: https://www.kaggle.com/carrie1/ecommerce-data
- Kaggle retail data downloadable
- GitHub: https://github.com/the-pudding/data/tree/master/pockets
measurements.csvIncludes many variables (categorical and continuous), ie: measurements, style, brand, menWomen, name, fabric, price
- Can search different keywords
https://www.kaggle.com/rmisra/clothing-fit-dataset-for-size-recommendation
os.system('pip install pycountry-convert')