Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Gapfill update #73

Merged
merged 55 commits into from
Mar 7, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
55 commits
Select commit Hold shift + click to select a range
1e2b0eb
Added BiGG ID filtering #52
GwennyGit Feb 24, 2023
71454cb
Changed a test line in analysis_db #52
GwennyGit Feb 24, 2023
9721f62
Changed output format for gapfill #52
GwennyGit Feb 24, 2023
b8e83a9
Changed output handling for gapfill in main #52
GwennyGit Feb 24, 2023
9aaa366
Fixed issue with KEGG gapfill analysis #52
GwennyGit Feb 24, 2023
204113f
setup include ratelimit and sqlalchemy #52
famosab Feb 27, 2023
18950f9
How to get BioCyc smart tables docs #52
famosab Feb 27, 2023
5b8f040
Update Docs
famosab Feb 27, 2023
6bcb3ac
Modify auto-imports
famosab Feb 27, 2023
b6cd6ab
Update Docs
famosab Feb 27, 2023
d9801fc
change gitignore
famosab Feb 27, 2023
e66bbf7
Update Docs
famosab Feb 27, 2023
1f1649f
Completed function gapfill_model #52
GwennyGit Feb 27, 2023
103d98f
Adjusted main for gapfill #52
GwennyGit Feb 27, 2023
68a251a
Did some minor changes to improve code #52
GwennyGit Feb 27, 2023
3229ffc
Adjusted config file for gapfill #52
GwennyGit Feb 27, 2023
6d0ae6f
Created example file for Excel table #52
GwennyGit Feb 27, 2023
3f812ed
Changed mapping in cvterms dicts for BioCyc #59
GwennyGit Feb 27, 2023
c50c0de
Adjusted entities for gapfill #52
GwennyGit Feb 27, 2023
e7603d3
Adjusted curate, growth & polish for the docs
GwennyGit Feb 27, 2023
2c496e8
Updated Docs
GwennyGit Feb 27, 2023
f1697be
Update Docs
famosab Feb 27, 2023
462a0fa
Merge branch 'dev' of https://github.com/draeger-lab/refinegems into …
GwennyGit Mar 1, 2023
3ec25cc
Updated documentation website
GwennyGit Mar 1, 2023
ec14a32
Updated README
GwennyGit Mar 1, 2023
3de4ccb
Added missing comma to setup.py
GwennyGit Mar 1, 2023
c9c8273
Update README.md
GwennyGit Mar 1, 2023
6735d90
Fixed some major issues #52
GwennyGit Mar 2, 2023
ee815bc
Commented gff part out as currently not working #52
GwennyGit Mar 2, 2023
622c6ac
Fixed all remaining problems #52
GwennyGit Mar 3, 2023
b0fd632
Fixed last issues #52
GwennyGit Mar 4, 2023
2ebc635
Remove * from imports
famosab Mar 4, 2023
90eba5f
Update README.md
famosab Mar 4, 2023
61a3f0c
Remove * import from libsbml
famosab Mar 4, 2023
0cbb0dd
remove * import from libsbml
famosab Mar 4, 2023
595fd4d
Removed id2loc.csv
GwennyGit Mar 4, 2023
fb30014
resolve merge conflicts and rename Args to Params
famosab Mar 6, 2023
10203f2
Update Docs
famosab Mar 6, 2023
bd5927a
undo rename
famosab Mar 6, 2023
fe322f3
Adjusted doc strings for gapfill #52 #74
GwennyGit Mar 6, 2023
8defbc9
Update doc strings databases #74
GwennyGit Mar 6, 2023
dad973b
Update doc strings io #74
GwennyGit Mar 6, 2023
4cf61e6
Resolved merge conflicts for dev #52
GwennyGit Mar 7, 2023
3828f6f
Update Docs
famosab Mar 7, 2023
f181acf
Integrated gapfill into the new main.py #52
GwennyGit Mar 7, 2023
55cb972
Integrated gapfill into the new config.yaml #52
GwennyGit Mar 7, 2023
3d80ff0
Integrated gapfill into io.save_user_input #52
GwennyGit Mar 7, 2023
2c1ac3e
Merge branch 'gapfill-update' of https://github.com/draeger-lab/refin…
GwennyGit Mar 7, 2023
cfc1d2f
Adjusted gapfill function names to new main & config #52
GwennyGit Mar 7, 2023
735c9ee
Reduced main doc string in gapfill #52 #74
GwennyGit Mar 7, 2023
7a65816
Created gapfill page for website #74
GwennyGit Mar 7, 2023
8f561ae
Update Docs
famosab Mar 7, 2023
b35c255
Update usage.rst
famosab Mar 7, 2023
38cdf00
Update Docs
famosab Mar 7, 2023
d8017f5
Update Pipfile
famosab Mar 7, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
13 changes: 7 additions & 6 deletions Pipfile
Original file line number Diff line number Diff line change
Expand Up @@ -16,13 +16,14 @@ refinegems = {path = "."}
setuptools = "==58.0.0"
markupsafe = "==2.0.1"
depinfo = "==1.7.0"
sortedcontainers = "==2.4.0",
libchebipy= "==1.0.10",
venn= "==0.1.3",
ols-client = "==0.1.3",
seaborn = "==0.12.2",
sqlalchemy = "==1.4.43",
sortedcontainers = "==2.4.0"
libchebipy= "==1.0.10"
venn= "==0.1.3"
ols-client = "==0.1.3"
seaborn = "==0.12.2"
sqlalchemy = "==1.4.43"
click = "==8.1.3"
ratelimit = "==2.2.1"

[dev-packages]
autopep8 = "*"
Expand Down
149 changes: 78 additions & 71 deletions Pipfile.lock

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

28 changes: 26 additions & 2 deletions config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -57,7 +57,7 @@ sboterms: FALSE

### Model polishing ### The database of the model identifiers needs to be specified with 'id_db'
polish: FALSE
id_db: 'BIGG' # Required!
id_db: 'BIGG' # Required!
# Possible identifiers, currently: BiGG & VMH
# For other IDs the `polish` function in `polish.py` might need adjustment
lab_strain: FALSE # Needs to be set to ensure that protein IDs get the 'bqbiol:isHomologTo' qualifier
Expand All @@ -70,4 +70,28 @@ charge_corr: FALSE
### Manual Curation ###
man_cur: FALSE
man_cur_type: 'gapfill' # either 'gapfill' or 'metabs'
man_cur_table: 'data/manual_curation.xlsx'
man_cur_table: 'data/manual_curation.xlsx'

### Automatic gap filling ###
# All parameters are required for all db_to_compare choices except:
# - organismid which is only required for db_to_compare: 'KEGG'/'KEGG+BioCyc'
# - and biocyc_files which is not required for 'KEGG'
gap_analysis: FALSE
gap_analysis_params:
db_to_compare: 'KEGG' # One of the choices KEGG|BioCyc|GFF|KEGG+BioCyc
organismid: 'T05059' # Needs to be specified for KEGG
gff_file: 'data/cstr.gff' # Path to RefSeq GFF file
biocyc_files:
- 'Path0' # Path to TXT file containing a SmartTable from BioCyc with the columns 'Accession-2' 'Reaction of gene' (-)
- 'Path1' # Path to TXT file containing a SmartTable with all reaction relevant information (*)
- 'Path2' # Path to TXT file containing a SmartTable with all metabolite relevant information (+)
- 'Path3' # Path to protein FASTA file used as input for CarveMe (Needed to get the protein IDs from the locus tags)
# (-) If the organism is not in BioCyc retrieve a table mapping all reactions in BioCyc to the corresponding sequence
# (*) 'Reaction' 'Reactants of reaction' 'Products of reaction' 'EC-Number' 'Reaction-Direction' 'Spontaneous?'
# (+) 'Compound' 'Object ID' 'Chemical Formula' 'InChI-Key' 'ChEBI'
# For all BioCyc files the order should be the same. If the organism does not occur in the BioCyc database the
# complete tables for reactions can be used with the same columns.
gapfill_model: FALSE
gap_analysis_file: 'Path to Excel file with which gaps in model should be filled'
# Either obtained by running gapfill_analysis/Created by hand with the same structure as the result file from gapfill_analysis
# Example Excel file to fill in by hand: data/modelName_gapfill_analysis_date_example.xlsx
Binary file added data/modelName_gapfill_analysis_date_example.xlsx
Binary file not shown.
Binary file modified docs/build/doctrees/application.doctree
Binary file not shown.
Binary file modified docs/build/doctrees/development.doctree
Binary file not shown.
Binary file modified docs/build/doctrees/environment.pickle
Binary file not shown.
Binary file removed docs/build/doctrees/examples.doctree
Binary file not shown.
Binary file removed docs/build/doctrees/explanation.doctree
Binary file not shown.
Binary file added docs/build/doctrees/gapfill.doctree
Binary file not shown.
Binary file modified docs/build/doctrees/index.doctree
Binary file not shown.
Binary file modified docs/build/doctrees/installation.doctree
Binary file not shown.
Binary file modified docs/build/doctrees/modules.doctree
Binary file not shown.
Loading