Relation extraction component - assertion error raised #12755

stellaires · 2023-06-27T00:43:30Z

stellaires
Jun 27, 2023

Issue

While using the relation extraction component, I'm regularly coming across an AssertionError related to thinc while training the component on data annotated with Prodigy.

Code and configuration

Code has been updated following a thread in Prodigy discussion forum. Updated code and configuration files are available in this repository. I've deleted assets, data and training folders as well as erased my labels in SYMM_LABEL and DIRECTED_LABELS in scripts/parse_data_generic.py file to ensure data confidentiality.

CLI

Data command output :

(venv) rel_component$ python3 -m spacy project run data
venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")

==================================== data ====================================
Running command: /venv/bin/python3 ./scripts/parse_data_generic.py assets/annotations.jsonl data/train.spacy data/dev.spacy data/test.spacy
venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")
ℹ 3 training sentences, 25/25 pos instances.
ℹ 1 dev sentences, 1/1 pos instances.
ℹ 1 test sentences, 12/12 pos instances.

Training command output (raising the error) :

(venv) rel_component$ python3 -m spacy project run train_joint_cpu
venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")

============================== train_joint_cpu ==============================
Running command: venv/bin/python3 -m spacy train configs/rel_joint.cfg --output training --paths.train data/train.spacy --paths.dev data/dev.spacy -c ./scripts/custom_functions.py
venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")
ℹ Saving to output directory: training
ℹ Using CPU

=========================== Initializing pipeline ===========================
[2023-06-27 00:14:44,989] [INFO] Set up nlp object from config
[2023-06-27 00:14:44,997] [INFO] Pipeline: ['tok2vec', 'ner', 'relation_extractor']
[2023-06-27 00:14:44,999] [INFO] Created vocabulary
[2023-06-27 00:14:44,999] [INFO] Finished initializing nlp object
[2023-06-27 00:14:45,057] [INFO] Initialized pipeline components: ['tok2vec', 'ner', 'relation_extractor']
✔ Initialized pipeline

============================= Training pipeline =============================
ℹ Pipeline: ['tok2vec', 'ner', 'relation_extractor']
ℹ Set annotations on update for: ['ner']
ℹ Initial learn rate: 0.001
E # LOSS TOK2VEC LOSS NER LOSS RELAT... ENTS_F ENTS_P ENTS_R REL_MICRO_P REL_MICRO_R REL_MICRO_F SCORE

ℹ Could not determine any instances in doc.
ℹ Could not determine any instances in any docs - can not make any
predictions.
⚠ Aborting and saving the final best model. Encountered exception:
AssertionError()
Traceback (most recent call last):
File "/usr/lib/python3.10/runpy.py", line 196, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/usr/lib/python3.10/runpy.py", line 86, in _run_code
exec(code, run_globals)
File "venv/lib/python3.10/site-packages/spacy/main.py", line 4, in
setup_cli()
File "venv/lib/python3.10/site-packages/spacy/cli/_util.py", line 74, in setup_cli
command(prog_name=COMMAND)
File "venv/lib/python3.10/site-packages/click/core.py", line 1130, in call
return self.main(*args, **kwargs)
File "venv/lib/python3.10/site-packages/typer/core.py", line 778, in main
return _main(
File "venv/lib/python3.10/site-packages/typer/core.py", line 216, in _main
rv = self.invoke(ctx)
File "venv/lib/python3.10/site-packages/click/core.py", line 1657, in invoke
return _process_result(sub_ctx.command.invoke(sub_ctx))
File "venv/lib/python3.10/site-packages/click/core.py", line 1404, in invoke
return ctx.invoke(self.callback, **ctx.params)
File "venv/lib/python3.10/site-packages/click/core.py", line 760, in invoke
return __callback(*args, **kwargs)
File "venv/lib/python3.10/site-packages/typer/main.py", line 683, in wrapper
return callback(**use_params) # type: ignore
File "venv/lib/python3.10/site-packages/spacy/cli/train.py", line 45, in train_cli
train(config_path, output_path, use_gpu=use_gpu, overrides=overrides)
File "venv/lib/python3.10/site-packages/spacy/cli/train.py", line 75, in train
train_nlp(nlp, output_path, use_gpu=use_gpu, stdout=sys.stdout, stderr=sys.stderr)
File "venv/lib/python3.10/site-packages/spacy/training/loop.py", line 124, in train
raise e
File "venv/lib/python3.10/site-packages/spacy/training/loop.py", line 107, in train
for batch, info, is_best_checkpoint in training_step_iterator:
File "venv/lib/python3.10/site-packages/spacy/training/loop.py", line 232, in train_while_improving
score, other_scores = evaluate()
File "venv/lib/python3.10/site-packages/spacy/training/loop.py", line 287, in evaluate
scores = nlp.evaluate(dev_corpus(nlp))
File "venv/lib/python3.10/site-packages/spacy/language.py", line 1415, in evaluate
for eg, doc in zip(examples, docs):
File "venv/lib/python3.10/site-packages/spacy/language.py", line 1574, in pipe
for doc in docs:
File "venv/lib/python3.10/site-packages/spacy/util.py", line 1653, in _pipe
yield from proc.pipe(docs, **kwargs)
File "spacy/pipeline/trainable_pipe.pyx", line 79, in pipe
File "venv/lib/python3.10/site-packages/spacy/util.py", line 1672, in raise_error
raise e
File "spacy/pipeline/trainable_pipe.pyx", line 75, in spacy.pipeline.trainable_pipe.TrainablePipe.pipe
File "rel_component/scripts/rel_pipe.py", line 90, in predict
scores = self.model.predict(docs)
File "venv/lib/python3.10/site-packages/thinc/model.py", line 315, in predict
return self._func(self, X, is_train=False)[0]
File "venv/lib/python3.10/site-packages/thinc/layers/chain.py", line 55, in forward
Y, inc_layer_grad = layer(X, is_train=is_train)
File "venv/lib/python3.10/site-packages/thinc/model.py", line 291, in call
return self._func(self, X, is_train=is_train)
File "rel_component/scripts/rel_model.py", line 78, in instance_forward
pooled, bp_pooled = pooling(entities, is_train)
File "venv/lib/python3.10/site-packages/thinc/model.py", line 291, in call
return self._func(self, X, is_train=is_train)
File "venv/lib/python3.10/site-packages/thinc/layers/reduce_mean.py", line 19, in forward
Y = model.ops.reduce_mean(cast(Floats2d, Xr.data), Xr.lengths)
File "thinc/backends/numpy_ops.pyx", line 318, in thinc.backends.numpy_ops.NumpyOps.reduce_mean
AssertionError

Occasionnaly, on other samples of training data, I also encounter the following error during the data command execution :

==================================== data ====================================
Running command: venv/bin/python3 ./scripts/parse_data_generic.py assets/annotations.jsonl data/train.spacy data/dev.spacy data/test.spacy
venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")
⚠ Could not parse any entities from the JSON file.
ℹ 12 training sentences, 43/43 pos instances.
ℹ 2 dev sentences, 15/15 pos instances.
ℹ 1 test sentences, 8/8 pos instances.

Data sample

For confidentialy issues, I can't share the training data. It has been annotated with Prodigy. Here is a small example to check the data format returned by Prodigy :

{"text":"The equipment, the Supplier and the processes applied must comply with AAAAAAA and any applicable documents","_input_hash":964968797,"_task_hash":-1368578864,"_is_binary":false,"spans":[{"start":4,"end":13,"token_start":1,"token_end":1,"label":"HARDWARE"},{"start":19,"end":27,"token_start":4,"token_end":4,"label":"ROLE"},{"start":36,"end":45,"token_start":7,"token_end":7,"label":"PROCESS"},{"start":71,"end":78,"token_start":12,"token_end":12,"label":"STANDARD"},{"start":87,"end":97,"token_start":15,"token_end":15,"label":"CONDITION"},{"start":98,"end":107,"token_start":16,"token_end":16,"label":"DOCUMENT"}],"tokens":[{"text":"The","start":0,"end":3,"id":0,"ws":true,"disabled":false},{"text":"equipment","start":4,"end":13,"id":1,"ws":false,"disabled":false},{"text":",","start":13,"end":14,"id":2,"ws":true,"disabled":false},{"text":"the","start":15,"end":18,"id":3,"ws":true,"disabled":false},{"text":"Supplier","start":19,"end":27,"id":4,"ws":true,"disabled":false},{"text":"and","start":28,"end":31,"id":5,"ws":true,"disabled":false},{"text":"the","start":32,"end":35,"id":6,"ws":true,"disabled":false},{"text":"processes","start":36,"end":45,"id":7,"ws":true,"disabled":false},{"text":"applied","start":46,"end":53,"id":8,"ws":true,"disabled":false},{"text":"must","start":54,"end":58,"id":9,"ws":true,"disabled":false},{"text":"comply","start":59,"end":65,"id":10,"ws":true,"disabled":false},{"text":"with","start":66,"end":70,"id":11,"ws":true,"disabled":false},{"text":"AAAAAAA","start":71,"end":78,"id":12,"ws":true,"disabled":false},{"text":"and","start":79,"end":82,"id":13,"ws":true,"disabled":false},{"text":"any","start":83,"end":86,"id":14,"ws":true,"disabled":false},{"text":"applicable","start":87,"end":97,"id":15,"ws":true,"disabled":false},{"text":"documents","start":98,"end":107,"id":16,"ws":false,"disabled":false}],"_view_id":"relations","relations":[{"head":16,"child":15,"head_span":{"start":98,"end":107,"token_start":16,"token_end":16,"label":"DOCUMENT"},"child_span":{"start":87,"end":97,"token_start":15,"token_end":15,"label":"CONDITION"},"color":"#96e8ce","label":"IN_CONDITION"},{"head":1,"child":12,"head_span":{"start":4,"end":13,"token_start":1,"token_end":1,"label":"HARDWARE"},"child_span":{"start":71,"end":78,"token_start":12,"token_end":12,"label":"STANDARD"},"color":"#ffdaf9","label":"COMPLY_WITH"},{"head":4,"child":12,"head_span":{"start":19,"end":27,"token_start":4,"token_end":4,"label":"ROLE"},"child_span":{"start":71,"end":78,"token_start":12,"token_end":12,"label":"STANDARD"},"color":"#ffdaf9","label":"COMPLY_WITH"},{"head":7,"child":12,"head_span":{"start":36,"end":45,"token_start":7,"token_end":7,"label":"PROCESS"},"child_span":{"start":71,"end":78,"token_start":12,"token_end":12,"label":"STANDARD"},"color":"#ffdaf9","label":"COMPLY_WITH"},{"head":1,"child":16,"head_span":{"start":4,"end":13,"token_start":1,"token_end":1,"label":"HARDWARE"},"child_span":{"start":98,"end":107,"token_start":16,"token_end":16,"label":"DOCUMENT"},"color":"#ffdaf9","label":"COMPLY_WITH"},{"head":4,"child":16,"head_span":{"start":19,"end":27,"token_start":4,"token_end":4,"label":"ROLE"},"child_span":{"start":98,"end":107,"token_start":16,"token_end":16,"label":"DOCUMENT"},"color":"#ffdaf9","label":"COMPLY_WITH"},{"head":7,"child":16,"head_span":{"start":36,"end":45,"token_start":7,"token_end":7,"label":"PROCESS"},"child_span":{"start":98,"end":107,"token_start":16,"token_end":16,"label":"DOCUMENT"},"color":"#ffdaf9","label":"COMPLY_WITH"}],"answer":"accept","_timestamp":1687273794}

It seems like the data is not correctly read. It is named annotations.jsonl and put in assets folder. It is created thanks to the db-out command from Prodigy.

If you need any additional information; please let me know.

Answered by svlandeg

Jul 10, 2023

Hi Stella,

Thanks, that's all very useful.

Of course, my final training data won't contain 21 examples, but hopefully more than a thousand. But if the workflow is not working or if there is anything wrong with the annotation process, I think it is not wise to focus on the annotation.

Totally - I agree. I wanted to check whether you already have more data annotated right now, to verify whether the errors still occur if you'd use all of the data available.

First, the model training part works perfectly on very few data inputs (maybe because there is not enough data to train a rel model, so it may be skipped). Then, on a bigger dataset, it could trigger the "occasionaly' error (like if th…

View full answer

svlandeg · 2023-06-28T16:10:04Z

svlandeg
Jun 28, 2023
Maintainer

Hi Stella,

From your config file, I understand that you're training the NER model and the REL model at the same time, adding the NER to annotating_components which makes sense. In your sample text, it certainly looks like there are both entities and relations annotated.

Yet, we're getting this warning:

ℹ Could not determine any instances in doc.
ℹ Could not determine any instances in any docs - can not make any
predictions.

This warning points to the fact that the relation extractor doesn't have any instances to train on. This most likely happens because at that point in time, the NER wasn't yet trained sufficiently to actually recognize the entities from the gold REL data, and thus the REL couldn't continue onwards.

File "thinc/backends/numpy_ops.pyx", line 318, in thinc.backends.numpy_ops.NumpyOps.reduce_mean
AssertionError

This AssertionError is really just a bad way of failing downstream - our underlying Thinc ML library gets an empty batch and crashes ungracefully. We're looking into fixing this so the error is less confusing, but that ultimately won't address the core of the issue here.

For the REL to be able to work, we need to make sure that the NER actually works by itself. Have you tried training the NER in isolation (i.e. with no relations?). If you have a sufficiently well working NER model, then we can use that as a frozen model in the next phase when training the REL. I think this will result in a more stable approach.

4 replies

stellaires Jun 30, 2023
Author

Hi Sofie,

Exactly, I'm training both models simultaneously.

Concerning the AssertionError, it's good to know.

Yes, I've made sure that the NER model works by itself. Here is the CLI and its output (labels anonymized) :

python3 -m prodigy train dataset/ --ner dataset_en --eval-split 0.1 --base-model en_core_web_sm --label-stats

venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")
ℹ Using CPU

========================= Generating Prodigy config =========================
ℹ Auto-generating config with spaCy
ℹ Using config from base model
✔ Generated training config

=========================== Initializing pipeline ===========================
[2023-06-30 14:10:27,735] [INFO] Set up nlp object from config
Components: ner
Merging training and evaluation data for 1 components

[ner] Training: 21 | Evaluation: 2 (10% split)
Training: 21 | Evaluation: 2
Labels: ner (13)
[2023-06-30 14:10:27,748] [INFO] Pipeline: ['tok2vec', 'tagger', 'parser', 'attribute_ruler', 'lemmatizer', 'ner']
[2023-06-30 14:10:27,748] [INFO] Resuming training for: ['ner', 'tok2vec']
[2023-06-30 14:10:27,753] [INFO] Created vocabulary
[2023-06-30 14:10:27,754] [INFO] Finished initializing nlp object
[2023-06-30 14:10:27,754] [INFO] Initialized pipeline components: []
✔ Initialized pipeline

============================= Training pipeline =============================
Components: ner
Merging training and evaluation data for 1 components

[ner] Training: 21 | Evaluation: 2 (10% split)
Training: 21 | Evaluation: 2
Labels: ner (13)
ℹ Pipeline: ['tok2vec', 'tagger', 'parser', 'attribute_ruler',
'lemmatizer', 'ner']
ℹ Frozen components: ['tagger', 'parser', 'attribute_ruler',
'lemmatizer']
ℹ Initial learn rate: 0.001
E # LOSS TOK2VEC LOSS NER ENTS_F ENTS_P ENTS_R SPEED SCORE

0 0 0.00 35.69 50.00 100.00 33.33 3561.56 0.50
204 1000 0.00 2445.67 80.00 100.00 66.67 4134.01 0.80
785 2000 0.00 63.27 80.00 100.00 66.67 4082.90 0.80
1785 3000 0.00 17.41 80.00 100.00 66.67 4107.30 0.80
2785 4000 0.00 9.36 80.00 100.00 66.67 4103.55 0.80
3785 5000 0.00 28.53 80.00 100.00 66.67 4127.07 0.80
4785 6000 0.00 50.15 80.00 100.00 66.67 4091.48 0.80
✔ Saved pipeline to output directory
/model-last

=============================== NER (per type) ===============================
            P        R        F
A 100.00 100.00 100.00
B 100.00 100.00 100.00
C 0.00 0.00 0.00

According to the output, the NER model works. So using the NER model looks a more promising approach ? How should I proceed ? (More precisely, what would be the appropriate configuration file ?)

svlandeg Jul 4, 2023
Maintainer

Hi Stella,

It's become more difficult for me to follow the exact trace of this thread, as you've edited the original post you made after I replied to it. Specifically, I made a comment about your config file, which I now can't see in the post anymore. I see that instead you've linked a directory which contains different config files etc. While it's useful to share as much code & data that you've got, it's also necessary that I understand exactly which scripts & config files were run.

I also see that you added a section "Occasionnaly, on other samples of training data, ...". As I said before, it's unhelpful to be mixing various problems into one thread. If you want to address this issue also, please open a separate thread and clarify which code & data resulted in that error, so we can look into it as a separate thing. Here, I'll try to stay focused on the issue with the NER data and the REL not getting instances to train from.

From the output logs you've shared, it looks like you're not always using the same data set, is that correct? Throughout this thread, I see varying mentions of data sets: one with 12 training instances, one with 3, one with 21. It would be good to keep this fixed in order for us to be able to drill down onto the problem.

Looking at the latest log you posted from training the NER:

[ner] Training: 21 | Evaluation: 2 (10% split)
Training: 21 | Evaluation: 2
Labels: ner (13)

Am I understanding this correctly that you've got 21 instances to train on, and 13 distinct labels in your dataset? If so, that's definitely too little data to train an NER model on. You're evaluating on only 2 instances, which means that the reported 80% F-score is extremely unreliable, and your model is unlikely to generalize properly to unseen data.

This may seem like a detail to you - but I still believe that too little data, and an insufficiently trained NER model, is actually the root cause of the error you saw in your original post. Specifically, the REL needs to build on an NER model that is robust and accurate, which is not something you can achieve with only 21 instances for 13 labels, I'm afraid.

Do you happen to have more data annotated?

stellaires Jul 6, 2023
Author

Hi Sofie,

Sorry about that. Actually, the config file previously described was rel_joint.cfg, but rel_tok2vec.cfg also looks involved in the evaluation process, that's why I've linked the whole repository, to prevent lack of data.

I think concerning the "occasionaly..." part, it may be linked to the same issue. I'll try to clarify this later, as it could be confusing.
I've tried to run the workflow on training data throughout the whole annotation process, in an attempt to test it and find a minimal reproducible example that triggers an error.

Of course, my final training data won't contain 21 examples, but hopefully more than a thousand. But if the workflow is not working or if there is anything wrong with the annotation process, I think it is not wise to focus on the annotation. My thought was to ensure that the workflow works from beginning to end before going too deep in the annotation process.

So I'll explain why I think the "occasionaly" part may concern the same issue. I have one very small dataset (approximately 30 examples). I'm sometimes deleting few data samples to test on a minimal reproducible example. First, the model training part works perfectly on very few data inputs (maybe because there is not enough data to train a rel model, so it may be skipped). Then, on a bigger dataset, it could trigger the "occasionaly' error (like if the training data was not correctly parsed, yet it is valid). On the biggest dataset, it triggers the error described in the thread.

I understand that the insufficiently trained NER model could trigger the error. But isn't there a way to ensure the component stability without first spending time on developing the training data ?

I don't have more data annotated for the moment, as the fact that an errror was triggered prevented me from going too deep into this process.

Thanks.

stellaires Jul 7, 2023
Author

I've annotated more data (as much as possible in the span of time as the data is very complex and from a very specific domain of application).

The NER model alone can be trained without any issue :

venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")
ℹ Using CPU

========================= Generating Prodigy config =========================
ℹ Auto-generating config with spaCy
ℹ Using config from base model
✔ Generated training config

=========================== Initializing pipeline ===========================
[2023-07-07 16:03:46,156] [INFO] Set up nlp object from config
Components: ner
Merging training and evaluation data for 1 components

[ner] Training: 52 | Evaluation: 5 (10% split)
Training: 52 | Evaluation: 5
Labels: ner (15)
[2023-07-07 16:03:46,180] [INFO] Pipeline: ['tok2vec', 'tagger', 'parser', 'attribute_ruler', 'lemmatizer', 'ner']
[2023-07-07 16:03:46,180] [INFO] Resuming training for: ['ner', 'tok2vec']
[2023-07-07 16:03:46,185] [INFO] Created vocabulary
[2023-07-07 16:03:46,186] [INFO] Finished initializing nlp object
[2023-07-07 16:03:46,186] [INFO] Initialized pipeline components: []
✔ Initialized pipeline

============================= Training pipeline =============================
Components: ner
Merging training and evaluation data for 1 components

[ner] Training: 52 | Evaluation: 5 (10% split)
Training: 52 | Evaluation: 5
Labels: ner (15)
ℹ Pipeline: ['tok2vec', 'tagger', 'parser', 'attribute_ruler',
'lemmatizer', 'ner']
ℹ Frozen components: ['tagger', 'parser', 'attribute_ruler',
'lemmatizer']
ℹ Initial learn rate: 0.001
E # LOSS TOK2VEC LOSS NER ENTS_F ENTS_P ENTS_R SPEED SCORE

0 0 0.00 34.07 0.00 0.00 0.00 7745.12 0.00
55 1000 0.00 8431.98 40.78 40.38 41.18 8666.85 0.41
211 2000 0.00 1249.35 43.14 43.14 43.14 8450.23 0.43
534 3000 0.00 2036.24 44.00 44.90 43.14 8739.78 0.44
867 4000 0.00 1971.61 47.52 48.00 47.06 8792.30 0.48
1201 5000 0.00 1915.20 46.00 46.94 45.10 8775.19 0.46
1534 6000 0.00 1873.26 48.08 47.17 49.02 8725.52 0.48
1867 7000 0.00 1882.31 47.52 48.00 47.06 8838.68 0.48
2201 8000 0.00 1838.51 48.00 48.98 47.06 8830.35 0.48
2534 9000 0.00 1875.58 45.10 45.10 45.10 8770.49 0.45
2867 10000 0.00 1816.19 47.06 47.06 47.06 8799.50 0.47
3201 11000 0.00 1850.27 47.52 48.00 47.06 8759.62 0.48
✔ Saved pipeline to output directory
model-last

=============================== NER (per type) ===============================
             P        R        F
A 45.83 68.75 55.00
B 0.00 0.00 0.00
C 0.00 0.00 0.00
D 0.00 0.00 0.00
E 33.33 100.00 50.00
F 100.00 83.33 90.91
G 0.00 0.00 0.00
H 0.00 0.00 0.00
I 0.00 0.00 0.00
J 100.00 100.00 100.00
K 0.00 0.00 0.00
L 100.00 44.44 61.54

The NER + rel model trained together has issues. The triggered issue is inconsistent : with less data, I was able to train a model once without an issue, and without changing anything except deleting the model and .spacy files, the rerun triggered an issue.

With the actual training data, the issue triggered depends on the run. I'll show you multiple runs of the same command on the same data :

First run :

venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")
ℹ Fetching 1 asset(s)
✔ Asset already exists:
rel_component/assets/annotations.jsonl
venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")

==================================== data ====================================
Running command: venv/bin/python3 ./scripts/parse_data_generic.py assets/annotations.jsonl data/train.spacy data/dev.spacy data/test.spacy
venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")
⚠ Could not parse any entities from the JSON file.
ℹ 38 training sentences, 329/329 pos instances.
ℹ 7 dev sentences, 71/71 pos instances.
ℹ 3 test sentences, 23/23 pos instances.
venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")

============================== train_joint_cpu ==============================
Running command: venv/bin/python3 -m spacy train configs/rel_joint.cfg --output training --paths.train data/train.spacy --paths.dev data/dev.spacy -c ./scripts/custom_functions.py
venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")
ℹ Saving to output directory: training
ℹ Using CPU

=========================== Initializing pipeline ===========================
[2023-07-07 16:23:58,300] [INFO] Set up nlp object from config
[2023-07-07 16:23:58,309] [INFO] Pipeline: ['tok2vec', 'ner', 'relation_extractor']
[2023-07-07 16:23:58,311] [INFO] Created vocabulary
[2023-07-07 16:23:58,312] [INFO] Finished initializing nlp object
[2023-07-07 16:23:58,481] [INFO] Initialized pipeline components: ['tok2vec', 'ner', 'relation_extractor']
✔ Initialized pipeline

============================= Training pipeline =============================
ℹ Pipeline: ['tok2vec', 'ner', 'relation_extractor']
ℹ Set annotations on update for: ['ner']
ℹ Initial learn rate: 0.001
E # LOSS TOK2VEC LOSS NER LOSS RELAT... ENTS_F ENTS_P ENTS_R REL_MICRO_P REL_MICRO_R REL_MICRO_F SCORE

ℹ Could not determine any instances in doc.
0 0 0.00 35.61 0.00 4.48 4.50 4.46 0.01 50.00 0.01 0.02
⚠ Aborting and saving the final best model. Encountered exception:
ValueError('operands could not be broadcast together with shapes (4750,17)
(1788,17) ')
Traceback (most recent call last):
File "/usr/lib/python3.10/runpy.py", line 196, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/usr/lib/python3.10/runpy.py", line 86, in _run_code
exec(code, run_globals)
File "venv/lib/python3.10/site-packages/spacy/main.py", line 4, in
setup_cli()
File "venv/lib/python3.10/site-packages/spacy/cli/_util.py", line 74, in setup_cli
command(prog_name=COMMAND)
File "venv/lib/python3.10/site-packages/click/core.py", line 1130, in call
return self.main(*args, **kwargs)
File "venv/lib/python3.10/site-packages/typer/core.py", line 778, in main
return _main(
File "venv/lib/python3.10/site-packages/typer/core.py", line 216, in _main
rv = self.invoke(ctx)
File "venv/lib/python3.10/site-packages/click/core.py", line 1657, in invoke
return _process_result(sub_ctx.command.invoke(sub_ctx))
File "venv/lib/python3.10/site-packages/click/core.py", line 1404, in invoke
return ctx.invoke(self.callback, **ctx.params)
File "venv/lib/python3.10/site-packages/click/core.py", line 760, in invoke
return __callback(*args, **kwargs)
File "venv/lib/python3.10/site-packages/typer/main.py", line 683, in wrapper
return callback(**use_params) # type: ignore
File "venv/lib/python3.10/site-packages/spacy/cli/train.py", line 45, in train_cli
train(config_path, output_path, use_gpu=use_gpu, overrides=overrides)
File "venv/lib/python3.10/site-packages/spacy/cli/train.py", line 75, in train
train_nlp(nlp, output_path, use_gpu=use_gpu, stdout=sys.stdout, stderr=sys.stderr)
File "venv/lib/python3.10/site-packages/spacy/training/loop.py", line 124, in train
raise e
File "venv/lib/python3.10/site-packages/spacy/training/loop.py", line 107, in train
for batch, info, is_best_checkpoint in training_step_iterator:
File "venv/lib/python3.10/site-packages/spacy/training/loop.py", line 209, in train_while_improving
nlp.update(
File "venv/lib/python3.10/site-packages/spacy/language.py", line 1155, in update
proc.update(examples, sgd=None, losses=losses, **component_cfg[name]) # type: ignore
File "rel_component/scripts/rel_pipe.py", line 133, in update
loss, gradient = self.get_loss(examples, predictions)
File "rel_component/scripts/rel_pipe.py", line 146, in get_loss
gradient = scores - truths
ValueError: operands could not be broadcast together with shapes (4750,17) (1788,17)

Second run :

venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")
ℹ Fetching 1 asset(s)
✔ Asset already exists:
rel_component/assets/annotations.jsonl
venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")

==================================== data ====================================
Running command: venv/bin/python3 ./scripts/parse_data_generic.py assets/annotations.jsonl data/train.spacy data/dev.spacy data/test.spacy
venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")
⚠ Could not parse any entities from the JSON file.
ℹ 38 training sentences, 347/347 pos instances.
ℹ 6 dev sentences, 48/48 pos instances.
ℹ 4 test sentences, 28/28 pos instances.
venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")

============================== train_joint_cpu ==============================
Running command: venv/bin/python3 -m spacy train configs/rel_joint.cfg --output training --paths.train data/train.spacy --paths.dev data/dev.spacy -c ./scripts/custom_functions.py
venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")
ℹ Saving to output directory: training
ℹ Using CPU

=========================== Initializing pipeline ===========================
[2023-07-07 16:25:11,601] [INFO] Set up nlp object from config
[2023-07-07 16:25:11,609] [INFO] Pipeline: ['tok2vec', 'ner', 'relation_extractor']
[2023-07-07 16:25:11,611] [INFO] Created vocabulary
[2023-07-07 16:25:11,611] [INFO] Finished initializing nlp object
[2023-07-07 16:25:11,760] [INFO] Initialized pipeline components: ['tok2vec', 'ner', 'relation_extractor']
✔ Initialized pipeline

============================= Training pipeline =============================
ℹ Pipeline: ['tok2vec', 'ner', 'relation_extractor']
ℹ Set annotations on update for: ['ner']
ℹ Initial learn rate: 0.001
E # LOSS TOK2VEC LOSS NER LOSS RELAT... ENTS_F ENTS_P ENTS_R REL_MICRO_P REL_MICRO_R REL_MICRO_F SCORE

ℹ Could not determine any instances in doc.
0 0 0.00 79.80 0.00 13.58 8.87 28.95 0.02 51.11 0.04 0.07
⚠ Aborting and saving the final best model. Encountered exception:
ValueError('operands could not be broadcast together with shapes (3190,17)
(186,17) ')
Traceback (most recent call last):
File "/usr/lib/python3.10/runpy.py", line 196, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/usr/lib/python3.10/runpy.py", line 86, in _run_code
exec(code, run_globals)
File "venv/lib/python3.10/site-packages/spacy/main.py", line 4, in
setup_cli()
File "venv/lib/python3.10/site-packages/spacy/cli/_util.py", line 74, in setup_cli
command(prog_name=COMMAND)
File "venv/lib/python3.10/site-packages/click/core.py", line 1130, in call
return self.main(*args, **kwargs)
File "venv/lib/python3.10/site-packages/typer/core.py", line 778, in main
return _main(
File "venv/lib/python3.10/site-packages/typer/core.py", line 216, in _main
rv = self.invoke(ctx)
File "venv/lib/python3.10/site-packages/click/core.py", line 1657, in invoke
return _process_result(sub_ctx.command.invoke(sub_ctx))
File "venv/lib/python3.10/site-packages/click/core.py", line 1404, in invoke
return ctx.invoke(self.callback, **ctx.params)
File "venv/lib/python3.10/site-packages/click/core.py", line 760, in invoke
return __callback(*args, **kwargs)
File "venv/lib/python3.10/site-packages/typer/main.py", line 683, in wrapper
return callback(**use_params) # type: ignore
File "venv/lib/python3.10/site-packages/spacy/cli/train.py", line 45, in train_cli
train(config_path, output_path, use_gpu=use_gpu, overrides=overrides)
File "venv/lib/python3.10/site-packages/spacy/cli/train.py", line 75, in train
train_nlp(nlp, output_path, use_gpu=use_gpu, stdout=sys.stdout, stderr=sys.stderr)
File "venv/lib/python3.10/site-packages/spacy/training/loop.py", line 124, in train
raise e
File "venv/lib/python3.10/site-packages/spacy/training/loop.py", line 107, in train
for batch, info, is_best_checkpoint in training_step_iterator:
File "venv/lib/python3.10/site-packages/spacy/training/loop.py", line 209, in train_while_improving
nlp.update(
File "venv/lib/python3.10/site-packages/spacy/language.py", line 1155, in update
proc.update(examples, sgd=None, losses=losses, **component_cfg[name]) # type: ignore
File "rel_component/scripts/rel_pipe.py", line 133, in update
loss, gradient = self.get_loss(examples, predictions)
File "rel_component/scripts/rel_pipe.py", line 146, in get_loss
gradient = scores - truths
ValueError: operands could not be broadcast together with shapes (3190,17) (186,17)

Third run :

venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")
ℹ Fetching 1 asset(s)
✔ Asset already exists:
rel_component/assets/annotations.jsonl
venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")

==================================== data ====================================
Running command: venv/bin/python3 ./scripts/parse_data_generic.py assets/annotations.jsonl data/train.spacy data/dev.spacy data/test.spacy
venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")
⚠ Could not parse any entities from the JSON file.
ℹ 36 training sentences, 314/314 pos instances.
ℹ 8 dev sentences, 70/70 pos instances.
ℹ 4 test sentences, 39/39 pos instances.
venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")

============================== train_joint_cpu ==============================
Running command: venv/bin/python3 -m spacy train configs/rel_joint.cfg --output training --paths.train data/train.spacy --paths.dev data/dev.spacy -c ./scripts/custom_functions.py
venv/lib/python3.10/site-packages/torch/cuda/init.py:546: UserWarning: Can't initialize NVML
warnings.warn("Can't initialize NVML")
ℹ Saving to output directory: training
ℹ Using CPU

=========================== Initializing pipeline ===========================
[2023-07-07 16:47:46,318] [INFO] Set up nlp object from config
[2023-07-07 16:47:46,325] [INFO] Pipeline: ['tok2vec', 'ner', 'relation_extractor']
[2023-07-07 16:47:46,327] [INFO] Created vocabulary
[2023-07-07 16:47:46,328] [INFO] Finished initializing nlp object
[2023-07-07 16:47:46,472] [INFO] Initialized pipeline components: ['tok2vec', 'ner', 'relation_extractor']
✔ Initialized pipeline

============================= Training pipeline =============================
ℹ Pipeline: ['tok2vec', 'ner', 'relation_extractor']
ℹ Set annotations on update for: ['ner']
ℹ Initial learn rate: 0.001
E # LOSS TOK2VEC LOSS NER LOSS RELAT... ENTS_F ENTS_P ENTS_R REL_MICRO_P REL_MICRO_R REL_MICRO_F SCORE

ℹ Could not determine any instances in doc.
ℹ Could not determine any instances in any docs - can not make any
predictions.
⚠ Aborting and saving the final best model. Encountered exception:
AssertionError()
Traceback (most recent call last):
File "/usr/lib/python3.10/runpy.py", line 196, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/usr/lib/python3.10/runpy.py", line 86, in _run_code
exec(code, run_globals)
File "venv/lib/python3.10/site-packages/spacy/main.py", line 4, in
setup_cli()
File "venv/lib/python3.10/site-packages/spacy/cli/_util.py", line 74, in setup_cli
command(prog_name=COMMAND)
File "venv/lib/python3.10/site-packages/click/core.py", line 1130, in call
return self.main(*args, **kwargs)
File "venv/lib/python3.10/site-packages/typer/core.py", line 778, in main
return _main(
File "venv/lib/python3.10/site-packages/typer/core.py", line 216, in _main
rv = self.invoke(ctx)
File "venv/lib/python3.10/site-packages/click/core.py", line 1657, in invoke
return _process_result(sub_ctx.command.invoke(sub_ctx))
File "venv/lib/python3.10/site-packages/click/core.py", line 1404, in invoke
return ctx.invoke(self.callback, **ctx.params)
File "venv/lib/python3.10/site-packages/click/core.py", line 760, in invoke
return __callback(*args, **kwargs)
File "venv/lib/python3.10/site-packages/typer/main.py", line 683, in wrapper
return callback(**use_params) # type: ignore
File "venv/lib/python3.10/site-packages/spacy/cli/train.py", line 45, in train_cli
train(config_path, output_path, use_gpu=use_gpu, overrides=overrides)
File "venv/lib/python3.10/site-packages/spacy/cli/train.py", line 75, in train
train_nlp(nlp, output_path, use_gpu=use_gpu, stdout=sys.stdout, stderr=sys.stderr)
File "venv/lib/python3.10/site-packages/spacy/training/loop.py", line 124, in train
raise e
File "venv/lib/python3.10/site-packages/spacy/training/loop.py", line 107, in train
for batch, info, is_best_checkpoint in training_step_iterator:
File "venv/lib/python3.10/site-packages/spacy/training/loop.py", line 232, in train_while_improving
score, other_scores = evaluate()
File "venv/lib/python3.10/site-packages/spacy/training/loop.py", line 287, in evaluate
scores = nlp.evaluate(dev_corpus(nlp))
File "venv/lib/python3.10/site-packages/spacy/language.py", line 1415, in evaluate
for eg, doc in zip(examples, docs):
File "venv/lib/python3.10/site-packages/spacy/language.py", line 1574, in pipe
for doc in docs:
File "venv/lib/python3.10/site-packages/spacy/util.py", line 1653, in _pipe
yield from proc.pipe(docs, **kwargs)
File "spacy/pipeline/trainable_pipe.pyx", line 79, in pipe
File "venv/lib/python3.10/site-packages/spacy/util.py", line 1672, in raise_error
raise e
File "spacy/pipeline/trainable_pipe.pyx", line 75, in spacy.pipeline.trainable_pipe.TrainablePipe.pipe
File "rel_component/scripts/rel_pipe.py", line 90, in predict
scores = self.model.predict(docs)
File "venv/lib/python3.10/site-packages/thinc/model.py", line 315, in predict
return self._func(self, X, is_train=False)[0]
File "venv/lib/python3.10/site-packages/thinc/layers/chain.py", line 55, in forward
Y, inc_layer_grad = layer(X, is_train=is_train)
File "venv/lib/python3.10/site-packages/thinc/model.py", line 291, in call
return self._func(self, X, is_train=is_train)
File "rel_component/scripts/rel_model.py", line 78, in instance_forward
pooled, bp_pooled = pooling(entities, is_train)
File "venv/lib/python3.10/site-packages/thinc/model.py", line 291, in call
return self._func(self, X, is_train=is_train)
File "venv/lib/python3.10/site-packages/thinc/layers/reduce_mean.py", line 19, in forward
Y = model.ops.reduce_mean(cast(Floats2d, Xr.data), Xr.lengths)
File "thinc/backends/numpy_ops.pyx", line 318, in thinc.backends.numpy_ops.NumpyOps.reduce_mean
AssertionError

If I'm doing anything wrong, could you tell me what ?

If there is nothing wrong with what I'm doing, could you :

Provide configuration files and a small tutorial for NER then separate rel model training ?
Ensure that the component behaviour is stable ?

And if not possible :

Indicate a way to use Prodigy annotations for rel model training with another component / method ?

I don't think annotating more data will solve the issue ? I'm looking for a quick solution if possible.

svlandeg · 2023-07-10T18:00:20Z

svlandeg
Jul 10, 2023
Maintainer

Hi Stella,

Thanks, that's all very useful.

Of course, my final training data won't contain 21 examples, but hopefully more than a thousand. But if the workflow is not working or if there is anything wrong with the annotation process, I think it is not wise to focus on the annotation.

Totally - I agree. I wanted to check whether you already have more data annotated right now, to verify whether the errors still occur if you'd use all of the data available.

First, the model training part works perfectly on very few data inputs (maybe because there is not enough data to train a rel model, so it may be skipped). Then, on a bigger dataset, it could trigger the "occasionaly' error (like if the training data was not correctly parsed, yet it is valid). On the biggest dataset, it triggers the error described in the thread.

That's interesting. In the case of the "very few data inputs", what do you mean by "works perfectly", while you're also hypothesizing that the training might be skipped? Do you mean that the training runs without error, but the score is really just all 0?

The NER + rel model trained together has issues. The triggered issue is inconsistent : with less data, I was able to train a model once without an issue, and without changing anything except deleting the model and .spacy files, the rerun triggered an issue.

Right, that's good to know. To me, this again points to the fact that the REL breaks down when the NER model is being trained and is not yet stable.

could you [...] Ensure that the component behaviour is stable ?

I want to clarify once more that, as stated in the video tutorial, the REL project serves as a tutorial on how to implement a custom spaCy component with a custom Thinc model. It is definitely not meant to serve as some kind of stable REL component. If we do develop an actual proper REL component in the future, we would include it in spaCy's core code base, not in the "tutorials" section of our example projects.

I'm saying this to clarify that our main motivation here is not to develop a stable REL component. It's to help you get going with your specific use-case and to give you pointers on how to implement your own custom NLP solution with spaCy.

If I'm doing anything wrong, could you tell me what ?

The approach you've tried so far could have worked, but at this point I suggest a more phased approach that lets us tackle your challenges one by one. You've already demonstrated that the NER model training works, so that would be the first step: train an NER model from your data, and store the model to disk (let's say it's in a directory ner-trained)

In the next step then, we'll train the REL model only, while keeping the ner fixed. To do so, you need this in your config file:

[nlp]
pipeline = ["tok2vec","ner","relation_extractor"]

...
[components.ner]
source = "ner-trained"
component = "ner"
replace_listeners = ["model.tok2vec"]
(don't add any other information to the `components.ner` block)

[components.tok2vec]
factory = "tok2vec"
... (everything as usual)

[components.relation_extractor]
factory = "relation_extractor"
... (everything as usual)

[training]
frozen_components = ["ner"]

[corpora.train]
@readers = "Gold_ents_Corpus.v1"

[corpora.dev]
@readers = "Gold_ents_Corpus.v1"

What this effectively does, is that a new pipeline is composed that grabs the trained "ner" component and "sources" it, keeping it frozen so only the REL component actually gets updated/trained.

The replace_listeners bit ensures that the tok2vec model from the ner component is kept private to the NER model, so that it doesn't get updated either and there's no conflict with the tok2vec component you want to use for REL.

Finally, we use the Gold_ents_Corpus.v1 reader in this step to let the REL train from the gold entities that are provided in your data.

Once this pipeline is trained (🤞) and saved to disk, to let's say rel-trained, you should be able to load it in with

spacy.load("rel-trained")

and run it on text, at which point the pipeline will create predictions for both the NER and the REL components.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Relation extraction component - assertion error raised #12755

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments 4 replies

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Relation extraction component - assertion error raised #12755

stellaires Jun 27, 2023

Issue

Code and configuration

CLI

Data sample

Replies: 2 comments · 4 replies

svlandeg Jun 28, 2023 Maintainer

stellaires Jun 30, 2023 Author

svlandeg Jul 4, 2023 Maintainer

stellaires Jul 6, 2023 Author

stellaires Jul 7, 2023 Author

svlandeg Jul 10, 2023 Maintainer

stellaires
Jun 27, 2023

Replies: 2 comments 4 replies

svlandeg
Jun 28, 2023
Maintainer

stellaires Jun 30, 2023
Author

svlandeg Jul 4, 2023
Maintainer

stellaires Jul 6, 2023
Author

stellaires Jul 7, 2023
Author

svlandeg
Jul 10, 2023
Maintainer