sts-b-dir

STS-B-DIR

Installation

Prerequisites

Download GloVe word embeddings (840B tokens, 300D vectors) using

python glove/download_glove.py

(Optional) We have provided both original STS-B dataset and our created balanced STS-B-DIR dataset in folder ./glue_data/STS-B. To reproduce the results in the paper, please use our created STS-B-DIR dataset. If you want to try different balanced splits, you can delete the folder ./glue_data/STS-B and run

python glue_data/create_sts.py

Dependencies

The required dependencies for this task are quite different to other three tasks, so it's better to create a new environment for this task. If you use conda, you can create the environment and install dependencies using the following commands:

conda create -n sts python=3.6
conda activate sts
# PyTorch 0.4 (required) + Cuda 9.2
conda install pytorch=0.4.1 cuda92 -c pytorch
# other dependencies
pip install -r requirements.txt
# The current latest "overrides" dependency installed along with allennlp 0.5.0 will now raise error. 
# We need to downgrade "overrides" version to 3.1.0
pip install overrides==3.1.0

Code Overview

Main Files

train_morebranch.py: main training and evaluation script
create_sts.py: download original STS-B dataset and create STS-B-DIR dataset with balanced val/test set

Main Arguments

--num_branch: number of branch for model
--reweight: cost-sensitive re-weighting scheme to use
--loss: training loss type
--resume: whether to resume training (only for training)
--evaluate: evaluate only flag

Training

# for example, train with 3-expert model
python train_morebranch.py --loss l1nll --num_branch 3 --dynamic_loss

Evaluation

python train_morebranch.py [...evaluation model arguments...] --evaluate --eval_model <path_to_evaluation_ckpt>

Pretrained model

model for sts-b

Name		Name	Last commit message	Last commit date
parent directory ..
allennlp_mods		allennlp_mods
glove		glove
glue_data		glue_data
README.md		README.md
evaluate_morebranch.py		evaluate_morebranch.py
fds.py		fds.py
loss.py		loss.py
models_morebranch.py		models_morebranch.py
preprocess_morebranch.py		preprocess_morebranch.py
requirements.txt		requirements.txt
tasks_morebranch.py		tasks_morebranch.py
train_morebranch.py		train_morebranch.py
trainer_morebranch.py		trainer_morebranch.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

STS-B-DIR

Installation

Prerequisites

Dependencies

Code Overview

Main Files

Main Arguments

Training

Evaluation

Pretrained model

FilesExpand file tree

sts-b-dir

Directory actions

More options

Directory actions

More options

Latest commit

History

sts-b-dir

Folders and files

parent directory

README.md

STS-B-DIR

Installation

Prerequisites

Dependencies

Code Overview

Main Files

Main Arguments

Training

Evaluation

Pretrained model