Skip to content

A collection of scripts to detect and remove duplicate files across your google drive account

License

Notifications You must be signed in to change notification settings

valx/google_drive_deduplicate

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

google_drive_deduplicate

A collection of Python scripts to detect and remove duplicate files across your Google Drive account

Intro

The script is based on the Example of Google Drive API quickstart (https://developers.google.com/drive/api/v3/quickstart/python)

The script browses across your drive account to find duplicated (based on md5 checksum of each file). It ignores file in Trash bin of the Google Drive.

Prerequisites

  1. Python 2.6 or greater
  2. Required libraries installed (as listed in file requirements.txt)

Usage

  1. Create a credentials.json file in the project folder (see step 1 at https://developers.google.com/drive/api/v3/quickstart/python)
  2. Create a config.json file in the project folder (based on cofig.example.json)
  3. Run: python .\gdrivededup.py
  4. On the first execution the script will require you to login, the token will be saved as token.pickle (please ensure this file is kept/removed safely)
  5. On success, the script will prompt a list of dictionaries with information for each duplicated file (if none, no output)

TODO

Potential Improvements for this repo:

  1. save the list as a csv file
  2. document python file
  3. package into an executable file

About

A collection of scripts to detect and remove duplicate files across your google drive account

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages