Name		Name	Last commit message	Last commit date
parent directory ..
student_configs		student_configs
Readme.md		Readme.md
audo_distiller_candidates.csv		audo_distiller_candidates.csv
augmenter_config.json		augmenter_config.json
auto_distillation.sh		auto_distillation.sh
auto_distiller_exp.py		auto_distiller_exp.py
autoaug_redis.py		autoaug_redis.py
back_translation_augmenter_config.json		back_translation_augmenter_config.json
contextual_augmenter_config.json		contextual_augmenter_config.json
distillation.sh		distillation.sh
distiller.py		distiller.py
generate_exp_shell.py		generate_exp_shell.py
random_augmenter_config.json		random_augmenter_config.json

Readme.md

How to launch experiments on knowledge distillation

First you shoud prepare & download your data

Use this script to download GLUE datasets to directory datasets

python3 ../download_glue_data.py --data_dir ../datasets/glue_data --tasks all

And you can download SQuAD data here

And then you need to install the dependencies by running:

pip3 install -r ../src/Distiller/requirements.txt

Remember to install torch which is compatible with your CUDA version!

Then you can launch the experiment by:

bash distillation.sh

Or you can execute as a python script by:

python ../src/Distiller/distiller.py -- \
    --task_type glue \
    --task_name sst-2 \
    --data_dir ../datasets/glue_data \
    --T_model_name_or_path howey/electra-base-sst2 \
    --S_model_name_or_path howey/electra-small-sst2 \
    --output_dir output-student \
    --max_seq_length 128 \
    --train \
    --eval \
    --doc_stride 128 \
    --per_gpu_train_batch_size 32 \
    --seed 40 \
    --num_train_epochs 20 \
    --learning_rate 1e-4 \
    --thread 64 \
    --gradient_accumulation_steps 1} \
    --temperature 8 \
    --kd_loss_weight 1.0 \
    --kd_loss_type ce

For the meaning of hyperparameters, see configs.py

At this time, we can run experiments on SQuAD and GLUE, if you want to try experiments on other baselines, first rewrite a preprocess function such as glue_preprocess , and then import them in distiller.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

experiments

experiments

Readme.md

How to launch experiments on knowledge distillation

Files

experiments

Directory actions

More options

Directory actions

More options

Latest commit

History

experiments

Folders and files

parent directory

Readme.md

How to launch experiments on knowledge distillation