-
Notifications
You must be signed in to change notification settings - Fork 103
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
KeyError: 'lemma' #48
Comments
There are no Lemmas in the training data. So there can't be lemmatizer?! Can't i use the the other parts of the pipeline?
the following error occurs:
|
I get the same error when trying to train the lemmatizer:
|
I am also getting this error, even though the .conllu file I am loading has the lemmas in the second column, as I think should be expected. Has anyone found a solution to this error? Is it a problem with the training data or Trankit itself? |
Never mind, it appears that the lemmatization pipeline has issues handling "_" in the lemma slot of conllus. I ended up just deleting these sentences from consideration. I don't know if that is the same issue others have faced, but I hope this helps! |
Following the code from https://trankit.readthedocs.io/en/latest/training.html#training-a-lemmatizer i get a KeyError: 'lemma':
The recent version from https://github.com/UniversalDependencies/UD_Thai-PUD is used as trainings and development data.
The text was updated successfully, but these errors were encountered: