Skip to content

Python script to scrape viewership data for every item in an Internet Archive collection.

Notifications You must be signed in to change notification settings

rachelwhaley/internet-archive-collection-stats

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

internet-archive-collection-stats

Python script to scrape viewership data for every item in an Internet Archive collection.

To run

First make sure you have the dependencies installed: pip install -r requirements.txt Then run: python main.py

Usage

Copy the URL or id of a collection from Internet Archive and enter when prompted.

For example: https://archive.org/details/al-larvick-fund

Sample output

A csv file with one row for each item in the collection:

all_time have_data last_30day last_7day identifier title timestamp
159 TRUE 10 6 ALCFJamesKilgoreClip5217 ALCF James Kilgore Films Clip 5217 2021-01-16 14:30:00

See definitions of the views fields (all_time, have_data, last_30day, last_7day) here: https://archive.org/services/docs/api/views_api.html

About

Python script to scrape viewership data for every item in an Internet Archive collection.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages