Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implemented OrcaVault tsa schema spreadsheet Google LIMS #20

Merged
merged 1 commit into from
Jan 12, 2025

Conversation

victorskl
Copy link
Member

@victorskl victorskl commented Jan 12, 2025

  • Story: Let Glue the Google LIMS! (continue)
    Now that we have tsa.spreadsheet_google_lims in staging data area
    by Glue ETL job in Implemented Glue ETL spreadsheet Google LIMS processing #19, we can source this data table with dbt to further feed into
    the downstream warehouse layers in psa and vault schema.
  • Technical steps are now mainly inherited by the framework implemented in PR Implemented OrcaVault tsa schema #15.
    Hence, this step becomes pretty straight forward task and template code.

* Story: Let Glue the Google LIMS! (continue)
  Now that we have `tsa.spreadsheet_google_lims` in staging data area
  by Glue ETL job in #19, we can source this data table with dbt to further feed into
  the downstream warehouse layers in psa and vault schema.
* Technical steps are now mainly inherited by the framework implemented in PR #15.
  Hence, this step becomes pretty straight forward task and template code.
@victorskl victorskl force-pushed the implement-tsa-schema-spreadsheet-google-lims branch from 32fc061 to d3a4a67 Compare January 12, 2025 00:49
@victorskl victorskl self-assigned this Jan 12, 2025
@victorskl victorskl added documentation Improvements or additions to documentation enhancement New feature or request labels Jan 12, 2025
@victorskl victorskl added this pull request to the merge queue Jan 12, 2025
Merged via the queue into main with commit bbba969 Jan 12, 2025
4 checks passed
@victorskl victorskl deleted the implement-tsa-schema-spreadsheet-google-lims branch January 12, 2025 00:55
victorskl added a commit that referenced this pull request Jan 12, 2025
* Story: Let Glue the Google LIMS! (continue)
  As discussed in #20, we now source `tsa.spreadsheet_google_lims` staging data
  table with dbt and feed into the downstream warehouse psa schema.
* Technical steps are now mainly inherited by the framework implemented in PR #17.
* With psa, Google LIMS is incrementally loaded with differential data records per
  daily scheduled run with dbt ELT job.
* Chiefly note; since Google LIMS preserved "timestamp" date column, we made use of it
  as (replay) historical time for the row record. Warehouse load datetime is derived
  from this timestamp column as initial cutover data extraction date.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant