The purpose of this repo is to generate Sample and Real World Taxonomy Data that can be used when testing and developing on the edx-platform.
Running this script will do the following:
- 2 Test Organizations will be created
- For each organization created, a sample course is created that is populated with a course export that contains a variety of section/subsections/units and components
- For each of these created organizations the following taxonomies will be created:
- a disabled taxonomy with 10 Tags
- an enabled flat taxonomy with 5000 Tags
- an enabled hierarchical taxonomy with three levels and 4^x tags per level (4 root tags, each with 16 child tags, each with 64 grandchild tags)
- a small enabled taxonomy with 2 levels with 2 Tags each
- A multi org Taxonomy is created and enabled/used by both orgs
- (Optional) A 4 level Taxonomy containing data obtained from Open Canada Taxonomy
- (Optional) A 3 level Taxonomy containing data obtained from LightCast Skills Taxonomy
- Once the Taxonomies and their Tags have been created, the script will Tag each of the courses along with all the components they contain with 1 of each Tag from the the above
Note: This script is designed to be idempotent. Meaning that the end state is the same every time you run it. So if you make modifications to the sample courses on Studio or the Taxonomy data in the shell and run this script again, it will reset all your changes.
-
To begin, clone this repo to your host computer.
- If you use Tutor devstack, mount it using this comamnd:
tutor mounts add cms:/host/path/to/taxonomy-sample-data:/openedx/taxonomy-sample-data
- If you use "devstack", put the cloned repo inside a directory so it can be accessed from within the devstack (eg:
/edx/src/
), and set theTAXONOMY_SAMPLE_PATH
variable inside thegenerate.py
file to point to the cloned repo inside your devstack/tutor environment, eg:/edx/src/taxonomy-sample-data
- If you use Tutor devstack, mount it using this comamnd:
-
Set the
USER_EMAIL
variable ingenerate.py
to the user the courses/units will be created under. It defaults to[email protected]
. -
(Optional) If you would not like to include taxonomy data from real world examples, such as:
Then set the following flags to
False
ingenerate.py
accordingly:IMPORT_OPEN_CANADA_TAXONOMY = True IMPORT_LIGHTCAST_SKILLS_TAXONOMY = True
-
To run the script, enter the CMS shell (
tutor dev run cms bash
) and run the following command:python manage.py cms shell < /openedx/taxonomy-sample-data/generate.py
This includes a copy of the "Skills and Competencies Taxonomy Data" from Employment and Social Development Canada, which is under the Open Government Licence - Canada.
This includes a copy of the Lightcast Open Skills Taxonomy, which is "free for individual and not-for-profit use" and is available under the Lightcast Open Skills Terms of Use.