-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy path.gitignore
More file actions
85 lines (67 loc) · 1.01 KB
/
.gitignore
File metadata and controls
85 lines (67 loc) · 1.01 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
# Python
__pycache__/
*.pyc
*.pyo
*.egg-info/
dist/
build/
# Virtual environments
.env
.venv/
venv/
ai-env/
# Model weights & checkpoints
checkpoints/
*.ckpt
*.pt
*.pth
# Training data (large binary files & raw datasets)
data/*.bin
data/*.log
data/korean_extra/
data/code/
data/sft_*/
*.parquet
*.jsonl
*.xz
*.arrow
# Logging & experiment tracking
wandb/
*/tensorboard/
nohup.out
# Temp benchmark configs
/tmp/
# IDE
.idea/
.vscode/
# OS
.DS_Store
# Claude Code local settings
.claude/
# Logs
*.log
# Tokenizer model files (large)
tokenizer/*.model
tokenizer/korean_sp/
# Raw data (huge text files)
data/raw/
data/raw/**/*.txt
!tokenizer/merges.txt
# Archives
*.tar.zst
*.tar.gz
*.zip
# Backup files
*.bak
# NCloud credentials & backups
download_from_ncloud.py
ncloud_backup/
# SFT data
data/sft_*.jsonl
# Eval outputs & reports (generated, not source)
eval/outputs/
reports/
# HF export (large model files)
hf_export/
# Do NOT ignore (for clarity):
# *.py, *.yaml (configs/), *.sh, *.md, tokenizer/merges.txt, CLAUDE.md