Skip to content

How to convert jsonl format files into spacy format files? #13074

Discussion options

You must be logged in to vote

The JSON converter isn't a generic JSON converter. It's only intended for converting a specific spacy v2 JSON format, so here it unfortunately ends up producing an empty .spacy file, which is what leads to this initialization error (there is no data).

For this JSONL format, there isn't a built-in converter and your best option may be to write your own converter. In a quick search, I can find a number of examples related to converting doccano to spacy, but I haven't tried any of them myself, and they may produce spacy v2 JSON format. The v2 JSON format can still be used, but then you'd need to an additional conversion step to convert it for spacy v3 with the current spacy convert.

To help …

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by Rainsmumu
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / cli Feature: Command-line interface
2 participants