Skip to content

Latest commit

 

History

History
68 lines (56 loc) · 3.34 KB

README.md

File metadata and controls

68 lines (56 loc) · 3.34 KB

metMHN

Mutual Hazard Networks for metastatic disease or in short metMHN is an extension to the MHN-algorithm by Schill et al. (2020) and Schill et al. (2023) to account for the joint evolution of primary tumor and metastasis pairs. It accounts for sampling bias and different primary/tumor metastasis diagnosis orderings.

Installation

We advise to use a virtual environment. Create a new virtual environment

python3 -m venv .venv

Activate the virtual environment

source .venv/bin/activate

We rely on the JAX library for our computations. If you don't have access to a gpu please install the cpu-only version of the libraries by running:

pip3 install -r requirements.txt

If you have access to a gpu, first install the requirements as detailed above and then install the jaxlib cuda package from here

Finally install the metMHN package locally using

pip install -e .

Run metMHN

We provide an example analysis of a reduced lung adeno carcinoma (LUAD) dataset that can be run on a standard desktop by first starting a jupyter notebook server

jupyter notebook

and then by executing the file examples/data_analysis.ipynb.
Additionally we provide an example of simple simulation based post-hoc analyses in examples/simulation_study.ipynb.
Alternatively metMHN can also be run from the command line by

python3 examples/analysis.py -args <input-annotation-file.csv> <input-event-data.csv> <output-inferred-model.csv>

where <input-annotation-file.csv> contains supporting information for each patient, <input-event-data.csv> contains the binarized primary tumor/metastasis genotypes for each patient and <output-inferred-model.csv> is the file where the final inferred metMHN should be stored.

You can also set the following command line arguments:

Argument Description
-cv Boolean, If set, perform crossvalidation
-cv_start Float, Lower limit of hyperparameter range to test in crossvalidation, defaults to 1e-05
-cv_end Float, Upper limit of hyperparameter range to test in crossvalidation, defaults to 1e-02
-cv_fold Integer, Number of crossvalidation folds, defaults to 5
-cv_splits Integer, Number of hyperparameters to test in the range cv_start to cv_end, defaults to 5
-pm_ratio Float, Expected ratio of never metastasizing primary tumors to metastasizing primary tumors
-lam Float, Weight of penalization. Should only be set if no cross validation is performed
-logs String, relative filepath for log-files
-seed Integer, Seed to be used random number generator

We also provide a script to assess how well metMHN can recover groundtruth parameters:

python3 examples/recall_study.py -args 

You can also set the following command line arguments:

Argument Description
-n_dat Integer, Number of datapoints in simulated dataset
-n_reps Integer, Number of simulated datasets to generate
-logs String, Relative filepath for log-files
-seed Integer, Seed for random number generator