Computer Vision Final Project

Model Code

공식 Github Repo Clone

git clone https://github.com/facebookresearch/mae.git

main_linprobe.py 파일 수정

Pretrained ViT Base Model

공식 Github Repo에서 모델 다운로드: https://dl.fbaipublicfiles.com/mae/finetune/mae_finetuned_vit_base.pth
실행 파일들과 같은 경로에 둠

Dataset

Huggingface api 활용을 위해 token 발급
.env 파일 생성 후 TOKEN, DATA_PATH 변수 설정
process_data.py 실행
DATA_PATH의 폴더명 'validation'을 'vali'로 변경

Extra Analysis

analysis.ipynb 실행

Linear Probing Baseline

OMP_NUM_THREADS=1 torchrun --nproc_per_node=2 main_linprobe.py \
    --batch_size 512 \
    --model vit_base_patch16 --cls_token \
    --finetune ${PRETRAIN_CHKPT} \
    --epochs 90 \
    --blr 0.1 \
    --weight_decay 0.0 \
    --dist_eval --data_path ${IMAGENET_DIR} \
    --num_workers 32 \
    --output_dir ./output_linprobe_linear \
    --log_dir ./output_dir_linear \
    --head_type linear

Linear Probing MLP ver.

OMP_NUM_THREADS=1 torchrun --nproc_per_node=2 main_linprobe.py \
    --batch_size 512 \
    --model vit_base_patch16 --cls_token \
    --finetune ${PRETRAIN_CHKPT} \
    --epochs 90 \
    --blr 0.1 \
    --weight_decay 0.0 \
    --dist_eval --data_path ${IMAGENET_DIR} \
    --num_workers 32 \
    --output_dir ./output_linprobe_mlp \
    --log_dir ./output_dir_mlp \
    --head_type mlp

Linear Probing Transformer Block ver.

OMP_NUM_THREADS=1 torchrun --nproc_per_node=2 main_linprobe.py \
    --batch_size 512 \
    --model vit_base_patch16 --cls_token \
    --finetune ${PRETRAIN_CHKPT} \
    --epochs 90 \
    --blr 0.1 \
    --weight_decay 0.0 \
    --dist_eval --data_path ${IMAGENET_DIR} \
    --num_workers 32 \
    --output_dir ./output_linprobe_tf \
    --log_dir ./output_dir_tf \
    --head_type transformer

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
__pycache__		__pycache__
outputs		outputs
util		util
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
FINETUNE.md		FINETUNE.md
PRETRAIN.md		PRETRAIN.md
README.md		README.md
analysis.ipynb		analysis.ipynb
cls_token_tsne_analysis.png		cls_token_tsne_analysis.png
engine_finetune.py		engine_finetune.py
engine_pretrain.py		engine_pretrain.py
linear_head_weights_tsne.png		linear_head_weights_tsne.png
main_finetune.py		main_finetune.py
main_linprobe.py		main_linprobe.py
main_pretrain.py		main_pretrain.py
models_mae.py		models_mae.py
models_vit.py		models_vit.py
process_data.py		process_data.py
submitit_finetune.py		submitit_finetune.py
submitit_linprobe.py		submitit_linprobe.py
submitit_pretrain.py		submitit_pretrain.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computer Vision Final Project

Model Code

Pretrained ViT Base Model

Dataset

Extra Analysis

Linear Probing Baseline

Linear Probing MLP ver.

Linear Probing Transformer Block ver.

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Computer Vision Final Project

Model Code

Pretrained ViT Base Model

Dataset

Extra Analysis

Linear Probing Baseline

Linear Probing MLP ver.

Linear Probing Transformer Block ver.

About

Resources

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages