This is the repo for SRLasSDGP, a novel approach to form span-based SRL as word-based graph parsing, to be presented at COLING2022. The paper can be found at here.
This paper proposes to cast end-to-end span-based SRL as a word-based graph parsing task. The major challenge is how to represent spans at the word level. Borrowing ideas from research on Chinese word segmentation and named entity recognition, we propose and compare four different schemata of graph representation, i.e., BES, BE, BIES, and BII, among which we find that the BES schema performs the best. We further gain interesting insights through detailed analysis. Moreover, we propose a simple constrained Viterbi procedure to ensure the legality of the output graph in the sense of SRL. We conduct experiments on two widely used benchmark datasets, i.e., CoNLL05 and CoNLL12. Results show that our word-based graph parsing approach achieves new state-of-the-art (SOTA) performance under the end-to-end setting, both without and with pre-trained language models (PLMs). More importantly, our model can parse 669/252 sentences per second, without and with PLMs respectively. Under the simpler predicate-given setting, our approach, after certain model modification, achieves comparable performance with previous SOTA.
pip install -r requirements.txt
Run the following command to get the data in form of BES. The PTB is needed.
bash scripts/conll05.sh PTB=<path-to-ptb> SRL=data
python -m supar.cmds.vi_SRL train -b \
--train data/conll05/train-BES.conllu \
--dev data/conll05/dev-BES.conllu \
--test data/conll05/test-BES.conllu \
--batch-size 3000 \
--n-embed 100 \
--feat lemma,char \
--itp 0.06 \
--embed data/glove.6B.300d.txt \
--n_pretrained_embed 300 \
--min_freq 7 \
--split \
--seed 1 \
--schema BES \
-p exp/exp1/model \
-d 5
- You can add the "--train_given_prd" in the script to train a predicate-given model.
- "--schema" to specify which schema to use.
- "-d" to specify which gpu to use.
- Detailed description about args can be found in "supar/cmds/cmd.py" and "supar/cmds/vi_SRL.py".
python -m supar.cmds.vi_SRL predict --data data/conll05/test-BES.conllu \
-p exp/exp1/model \
--pred BES-wsj.pred \
--gold data/conll05/conll05.test.props.gold.txt \
--task 05 \
--schema BES \
--vtb \
-d 4
- "--vtb" means whether to use constrained viterbi.
# switch to the finetune-given_prd
git checkout finetune-given_prd
# training
python -m supar.cmds.vi_srl train -b \
--train data/conll05/train-BES.conllu \
--dev data/conll05/dev-BES.conllu \
--test data/conll05/test-BES.conllu \
--batch-size 500 \
--encoder bert \
--itp 0.06 \
--bert bert-large-uncased \
--split \
--lr_rate 1 \
--seed 1 \
--train_given_prd \
-p exp/exp2/model \
-d 4
# prediction and evaluation
python -m supar.cmds.vi_srl predict --data data/conll05/test-BES.conllu \
-p exp/exp2/model \
--pred BES-wsj-bert.pred \
--gold data/conll05/conll05.test.props.gold.txt \
--task 05 \
--schema BES \
--vtb \
-d 4
- use "--bert" to specify which language model, currently support bert, roberta, xlnet.
- You can delete the "--train_given_prd" in the script to train a end-to-end model.
- Here, we currently provide two model trained on conll2005 with bert in BES schema, i.e., end-to-end model, predicate-given model.