You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
During this procedure the manager more or less "hangs", only changing the status bar to display the paper being processed. This means that to "pause" and "continue" the procedure (for example, mining a big corpus of papers over several days), one needs to:
Remember the name of current paper.
Terminate the PDFAnalyzer process.
Delete the current paper folder - all the previous ones will be fully functional, but processed pages in current one will be lost.
Start the PDFAnalyzer again.
Add the papers again, starting from the current one (or from the beginning to receive messages about existing papers).
This is somewhat inconvenient, so it would be handy to:
Implement proper pausing and resuming of extraction procedure without process termination.
Implement resuming from the middle of a paper. One should keep in mind that some parameters (number of pages, rotated pages, etc) are not stored anywhere until metadata.json is created.
Update: exporting works differently, but suffers from pretty much same problems.
The text was updated successfully, but these errors were encountered:
Current procedure for adding new papers is roughly as following:
During this procedure the manager more or less "hangs", only changing the status bar to display the paper being processed. This means that to "pause" and "continue" the procedure (for example, mining a big corpus of papers over several days), one needs to:
This is somewhat inconvenient, so it would be handy to:
Update: exporting works differently, but suffers from pretty much same problems.
The text was updated successfully, but these errors were encountered: