Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The Big List of Things to Scrape #36

Open
31 of 57 tasks
enigmalea opened this issue Aug 26, 2022 · 0 comments
Open
31 of 57 tasks

The Big List of Things to Scrape #36

enigmalea opened this issue Aug 26, 2022 · 0 comments

Comments

@enigmalea
Copy link
Member

enigmalea commented Aug 26, 2022

As promised here's the big list of things to scrape AO3 for, as based on the python API, with some of my wishes as well. Feel free to rename things, or to add to this list as we go.

Works

  • workid_from_url - grabs the workid from an AO3 URL
  • Work - sets the fic you're working with using the ID
  • work.title
  • work.url - if you have a workid, but not a URL, it will recompile the URL for you
  • work.series
  • work.series.id - gets the series ids associated with the fic
  • work.authors
  • work.authors.username
  • work.tags
  • work.nchapters - number of chapters posted
  • work.expected_chapters
  • work.fandoms
  • work.relationships
  • work.characters
  • work.language
  • work.categories
  • work.summary
  • work.rating
  • work.get_images - gets the images posted in the fic
  • work.date_published
  • work.date_updated
  • work.status
  • work.warnings
  • work.words
  • work.hits
  • work.kudos
  • work.bookmarks
  • work.comments - number of comments only

Chapters

  • chapter.id
  • chapter.url
  • chapter.title
  • chapter.summary
  • chapter.number
  • chapter.words
  • chapter.number

Series

  • series.id
  • series.url
  • series.name
  • series.words
  • series.creators
  • series.creators.username
  • series.description
  • series.notes
  • series.complete
  • series.series_begun
  • series.series_update
  • series.nworks - number of works in the series
  • series.work_list - a list of every work in the series
  • series.work_list.title - a list of only the titles in the series
  • series.work_list.url - a list of the URLs of the fics in the series

User

  • user.bio
  • user.works
  • user.bookmarks
  • user.url
  • user.get_avatar

Errors

  • InvalidIDError - ID for fic/series does not exist
  • AuthError - must be registered user to see work
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant