You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Similarly to #147, Create the importer for SwissInfo Radio bulletins.
The WWII radio bulleting data is already on the EPFL NAS, unfortunately it's in text-embedded pdf format, so the OCR first needs to be extracted from the PDF.
Action points for this issue are:
Look at how the OCr text can be extracted from the pdfs and what output formats it would create.
Tetml? Other tools? Look at libraries that do this
Similarly to #147, Create the importer for SwissInfo Radio bulletins.
The WWII radio bulleting data is already on the EPFL NAS, unfortunately it's in text-embedded pdf format, so the OCR first needs to be extracted from the PDF.
Action points for this issue are:
The text was updated successfully, but these errors were encountered: