Midterm Report - Group 12 - Course 11-785

We are using self-code align for our base implementation, although we will be testing the implementation on a smaller model set.

To run the evaluation of the model please use the evaluate_model.sh script present in the evaluation model. The base paper was implemented on A100 GPU, for our implementation we are doing it on a lesser compute, hence we have created scripts that will be useful for setting up the environment and executing smaller models.

Evaluation Script

Template to run evaluation:

./evaluate_model.sh <MODEL_KEY> <MODEL_PATH> <DATASET_NAME>

Parameters:

<MODEL_KEY>: Key of model
<MODEL_PATH>: Hugging Face path of model or local location of your model
<DATASET_NAME>: HumanEval or MBPP

Fine-tuning Script

To fine-tune the model, execute the following script which is present in the evaluation folder:

./finetune_model.sh <MODEL_KEY> <OUTPUT_DIR> <DATASET_FILE>

Parameters:

<MODEL_KEY>: The model to fine-tune
<OUTPUT_DIR>: Location where output model will be stored
<DATASET_FILE>: Instruction dataset

Name		Name	Last commit message	Last commit date
Latest commit History 117 Commits
Midterm-results		Midterm-results
evaluation		evaluation
execution_scripts		execution_scripts
prompts		prompts
seed_gathering		seed_gathering
src/star_align		src/star_align
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
create_dpo_pairs.py		create_dpo_pairs.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
sanitize.sh		sanitize.sh
self_ossinstruct_sc2.sh		self_ossinstruct_sc2.sh
self_ossinstruct_sc2_parallel.sh		self_ossinstruct_sc2_parallel.sh
train_dpo.py		train_dpo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Midterm Report - Group 12 - Course 11-785

Evaluation Script

Fine-tuning Script

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Midterm Report - Group 12 - Course 11-785

Evaluation Script

Fine-tuning Script

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages