Differing NER Results when Disabling Pipeline Components #9364
-
Hello! I am interested in performing ner using the
And I only find this to be true 75% of the time over 100 data observations. For reference, the spaCy version is 3.1.3 and the model version is 3.1.0. Any advice? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
Hi, the difference is probably related to sentence boundaries. The |
Beta Was this translation helpful? Give feedback.
Hi, the difference is probably related to sentence boundaries. The
ner
component doesn't predict entities that cross sentence boundaries, so if you leave out theparser
, which sets sentence boundaries, the exact samener
component can make different predictions because the input docs are different. You might be able to check and see that thedocs_ner
cases have more entities with periods in them likeSome Company. Group
or similar?