Skip to content

parsing the plos corpus dump of fall 2016 (Python + R)

Notifications You must be signed in to change notification settings

thomas-keller/plos_corpus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

PLoS Parsing

Mix of python (first stage - to fix problems and parse the xml and calculate initial statistics) and R scripts (later statistics & plotting).

Comments and help welcome. This is all in the very early stages, fair amount of articles are missing/failing parsing. Stuff can always be more elaborate.

About

parsing the plos corpus dump of fall 2016 (Python + R)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages