-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Release 1.0.0 #150
Release 1.0.0 #150
Conversation
I've started a release notes section in the docs to outline the process of setting up spin and the initial creation of the schemas. This section can also be later used for more code developments notes |
@JoanneBogart, Opinion on what should we change
The default production schema could also be in the code, at the moment its just an
Could also change the variable name to not include |
Yes, we should change the variable name to not include |
I think there is enough now to start a review, mainly the docs. Not finalized from my end, but close. |
I'll start looking seriously today. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I haven't looked at everything yet but thought you might want to address the things - all pretty minor - that I've found so far, which all have to do with code under src/dataregistry
. I've also looked at src/dataregistry_cli
and tests
, which seem fine.
I've updated the create script slightly to give a bit better output when running sqlite (and also addressed the comments you made). With regards to the privileges, they are not really tested. I think on the spin instance I added Should it be like
|
About your table above - for now, as long as we run the script at NERSC using |
I've made some minor comments about the tutorials (which you probably can't yet see since I haven't submitted a review) just from looking at the diffs. I intend to also look at the full text of the tutorials at some point but it's not critical that it happen in time for the release. I started looking at the documentation but thought you might still be working on it so I'll hold off until you tell me you're ready. |
They are ready to look at, though most of the docs is the notebooks |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In addition to comments on specific lines of code I noticed a few other things:
- installation.rst needs an overhaul
- do we want to mention reg_reader and reg_writer in there and expected use of them?
- in tutorial_CLI could we change the example owner from "DESC Generic Working Group" to something without blanks? E.g. text could be "For example to see all the datasets from the DESC Generic Working Group (GWG)..." and then value for owner could be GWG. It should work as it stands, but I'd rather not encourage people to use values with embedded spaces.
- in tutorial_python.rst there is no link to the production_schema tutorial
I haven't yet looked at the tutorials. Instructions for installation and use of the tutorials will change as soon as we have made the release and uploaded to PyPI.
I'm thinking that we don't need to get all the documentation-type changes, including changes to tutorials, in before we make a release, as long as we believe the code and database structure are ok and usable (that includes permanent backing store for the database). Once we've made a release we can finish up any remaining documentation issue in a separate PR. For now, I'd just say fix the things that seem most essential (I'm not even sure there are any that are truly essential; I'd have to look) or are quick to do.
I used the create_registry_schema script to make a production-like |
I've removed I've shifted some of the comments here into a issue so we don't forget some final tweaks we can do later. Once this code is approved, we can make the release. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks fine. There are things we need to fix but nothing that is important for the release. I successfully made some gcr catalog entries in the alt_production
schema (necessarily using the reg_admin
account) and also made entries in alt_working
with reg_writer
.
As soon as this PR is merged we should get rid of existing lsst_desc_*
schemas and make new ones.
Placeholder branch for release
Todo
site
, and updatesrc/dataregistry/site_config/site_rootdir.yaml
. Updated to/global/cfs/cdirs/lsst/utilities/data-registry
SCHEMA_VERSION
insrc/dataregistry/db_basic.py
). Default schema names are now atsrc/dataregistry/schema/default_schema_names.yaml
, one for WORKING and one for PRODUCTIONtutorial_working
andtutorial_production
, will update notebooks to matchregister_datasets.ipynb
datasets_deeper_look.ipynb
production_schema.ipynb
query_datasets.ipynb
pipelines.ipynb
installation
tutorial
reference
dev_notes
contact
schema
andowner_type
directories. These should be static, and that level of the directory structure should have no write access for regular users. New script in scripts folder calledcreate_schema_dirs.sh
that creates the initial schema directories and theirowner_type
groups but needs some revision.python_bleed
.