-
Notifications
You must be signed in to change notification settings - Fork 1
Open Ticket
--- | Open ticket |
---|---|
Lead | Collect all available information on the data set. |
Do | Document dataset in a new ticket (Issue) |
Measure | Can other people find the dataset? Do they understand what data is being proposed? |
Opening a ticket is the second phase in the input step to ingesting a new data set. After finding a possible dataset you need to collect all relevant information for an evaluation. While this is going to vary dataset by dataset, generally there are several expected elements that you will need to find.
Data information could include:
- url for data package (should be a landing page not the download url)
- Example: ISCN3
- full suggested citation
- Example:
Nave, L., K. Johnson, C. van Ingen, D. Agarwal, M. Humphrey, and N. Beekwilder. 2022. International Soil Carbon Network version 3 Database (ISCN3) ver 1. Environmental Data Initiative. https://doi.org/10.6073/pasta/cc751923c5576b95a6d6a227d5afe8ba (Accessed 2024-06-03).
- Example:
- abstract given to describe the data
- Example:
The ISCN is an international scientific community devoted to the advancement of soil carbon research. The ISCN manages an open-access, community-driven soil carbon database. This is version 3-1 of the ISCN Database, released in December 2015. It gathers 38 separate data set contributions, totaling 67,112 sites with data from 71,198 soil profiles and 431,324 soil layers. For more information about the ISCN, its scientific community and resources, data policies and partner networks visit: http://iscn.fluxdata.org/. For information about processes used to construct the DB: https://iscn.fluxdata.org/data/data-information/.
- Example:
- links to any papers associated with the data
- Example:
No obvious papers found.
- Example:
- how the dataset was found
- Example:
Conversation by KTB at workshop in 2017.
- Example:
- any points of contact known for the dataset
- Example:
Dr J Doe at XYZ University is listed point of contact. KTB knows them and can facilitate questions.
- Example:
- data license and intellectual rights. Many modern data sets will be licensed under the Creative Commons system but not all.
- Example:
Data use: Use of data assumes that the user has joined the ISCN and will act according to the mission and guidelines stated in the Charter. Phase II data are available to select users on the condition that contributors will be invited to assist with interpretation, presentations, and publications, if these tasks are being performed by a user other than the original contributor of the data (such as a colleague of the contributor). When using Phase III data, users are encouraged to contact original data contributors for questions or clarifications; users agree to cite and/or acknowledge the data contributors and the ISCN in presentations and publications. References of all accepted papers developed from ISCN data products must be sent to the ISCN Coordinator. Although these policies are intended to encourage communication between data contributors and data users, the interpretation of data and adherence to fair use guidelines are ultimately the responsibility of the user of ISCN products.
- Example:
The title of the issue should include the name of the data set (if applicable) or the last name of the first author followed by the year of publication.
The issue should include the above details and any missing information noted. You are striving to provide enough information for other people on project to be able to answer the following questions. Can I find the dataset? Do I know where to go for information about the data?
-
Workflow for new data additions
- Find data
- Open ticket
- Evaluation
- Annotations
- Read scripts
- Integration
- QA/QC
- Merge to main
- Publish
- Data collections