Skip to content

Latest commit

 

History

History
151 lines (92 loc) · 4.35 KB

README.md

File metadata and controls

151 lines (92 loc) · 4.35 KB

What is CovCheck?

The CovCheck analysis computes a personal genome report indicating 'risk of severe symptoms' (CFR) from COVID-19 infection.

The risk analysis is based on published, peer-reviewed studies: https://www.covid19hg.org/publications/

Personal genome data and age (if provided) is read from a simple 'genome file' in JSON format.

Note that this analyis is work in progress!!!

CI/CD

Python package

Installation and Usage

To install, pull from git:

[email protected]:Geromics/covcheck.git

change to the appropriate directory:

cd covcheck

and run the code:

python3 covid_genetic_check.py -h

usage: check.py [-h] [--version] [--verbose] infile [outfile]

Score an individual.

positional arguments:
  infile         JSON format file containing individual data
  outfile        JSON format results file (default: <stdout>)

optional arguments:
  -h, --help     show this help message and exit
  --version      show program's version number and exit
  --verbose, -v

The only formal requirements (requirements.txt) are for testing:

pip install -r requirements.txt

which is done by:

python3 -m pytest -v .

Note, project dependncies are not managed by a high level tool such as flit, poetry or Pipenv.

Analysis details

The analysis of risk with age is based on data from here:

and has been done with help from:

The genome report is currently based on the analysis in this preprint: https://www.researchsquare.com/article/rs-37798/v1

Notes from the preprint...

##reference=GRCh38.p12
chr21 41480570 rs12329760 C T . . .
chr21 41507982 rs75603675 C A . . .

rs12329760, TMPRSS2(-), V197M, C->T (G->A, V[GTG] -> M[ATG])
C is the risk allele, T is 'protective'

rs75603675, TMPRSS2(-), G008V, C->A (G->T, G[GGT] -> V[GTT])
A is the risk allele, C is 'protective'

Spearman’s correlation with COVID-19 CFR:
ρ = -0.464, P = 0.0157 for V197M C->T (G->A)
ρ = +0.713, P = 0.0018 for G008V C->A (G->T)

The higher the score, the greater your genetic risk of severe
COVID-19 infection.

Additional references

COVID:

Polygenic Risk Scores:

Papers studying the genetics of COVID

Random notes

A common pattern to mark an unfinished code is to raise a NotImplementedError that is noticed at runtime:

@classmethod
def from_api(cls, url):
    raise NotImplementedError  # TODO

TODO: Use a few tools to automatically take care of code formatting (black), flag style-related problems (flake8), as well as warn about potential bugs (pylint).