-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New biomol fields #400
Open
d-beltran
wants to merge
23
commits into
Materials-Consortia:develop
Choose a base branch
from
d-beltran:iss389_biomol
base: develop
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
New biomol fields #400
Changes from 1 commit
Commits
Show all changes
23 commits
Select commit
Hold shift + click to select a range
eca24df
added new biomol fields
d-beltran 572a29f
added new biomol sequence fields
d-beltran 173172f
Update optimade.rst
d-beltran 5911d53
Title underlines fit
d-beltran 05a8fed
More explained biomol_chain types
d-beltran 76f690c
Added standard labels for biomol_chain types
d-beltran 1f5650d
Added underscore to new fieldnames
d-beltran a8781ac
Amend
d-beltran ac6ed65
New species property: biomol atom name
d-beltran 55f71e2
Restructured biomol sequences
d-beltran 534ec8d
Merge branch 'develop' into iss389_biomol
d-beltran f865a5a
Laussane discussions update
d-beltran af3c817
Update optimade.rst
d-beltran 616019c
Update optimade.rst
d-beltran bd3e9e1
Update optimade.rst
d-beltran ce93c9d
Update optimade.rst
d-beltran a875aaf
added a few breaklines with correct indent
d-beltran a1ddd88
Update optimade.rst
d-beltran a0ee16a
Update optimade.rst
d-beltran c0ea1ac
Update optimade.rst
d-beltran af14d72
Update optimade.rst
d-beltran 7610146
insertion_code renamed as icode
d-beltran c38075c
Merge branch 'develop' into iss389_biomol
rartino File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -2490,7 +2490,9 @@ biomol_chains | |
- **name**: The chain name/letter. | ||
- **residues**: A list of integers referring to the index of :field:`biomol_residues`, that belong to this chain. | ||
The list SHOULD NOT be empty. The index of the first residue is 0. | ||
- **types**: A list of tags specifying the type of molecules this chain contains. | ||
- **types**: A list of custom tags/labels specifying the type of molecules this chain contains (e.g. 'protein'). | ||
This field is useful as an overview of every chain and as a query target for the structure. | ||
Labels in this field are non-standard. Every implementation may use different labels according to its needs. | ||
- **sequences**: A list of residue sequences in current chain. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. If the same sequence occurs twice in a chain. Should the sequence be listed here twice or just once? |
||
- **sequence_types**: A list of tags specifying the type of each sequence in the :property:`sequences` field. The type of a sequence is defined by its components (e.g. 'aminoacids'). | ||
- There SHOULD NOT be two or more chains with the same :property:`name`. | ||
|
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would advise standardizing labels at least for the most common molecule types to benefit the queryability of the field. Implementations could use their own labels, but prefixed with their own database-specific prefixes.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good idea. This way we could save a lot of work.
I have been searching for references for our labels in the current mmCIF format (the future PDB standard) and this is the best I have found. They are meant for assemblies and they do not totally suit me, but I will try to resemble them.
So I suggest the standard labels to be the following: 'PROTEIN', 'NUCLEIC ACID', 'CARBOHYDRATES', 'LIPID', 'MEMBRANE', 'LIGAND', 'ION', 'SOLVENT', 'OTHER'.
If you agree I will commit changes soon.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, it seems like a good idea to define these labels.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I very much like basing on mmCIF. Maybe there is already a JSON representation for mmCIF data?
These labels sound very good. I would just render them in lowercase and describe the use of prefixes for custom labels in the form of
<prefix>:<label>
.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In a fast search I found several mmCIF to JSON parsers and one of them seems to be the official one.
Allright