[5] Writer #4

fgrunewald · 2024-04-17T12:43:11Z

This is a first draft of the writer for the CGSmiles.

To Do:

make squash operator work
make self-cycles work
have option to compress string; currently [#PEO]|3 -> [#PEO][#PEO][#PEO]
fix pysmiles hyrdorgen writing; the pysmiles writer transform for example [$]cc[$] -> [$][cH2][cH2][$], which technically speaking is correct and works but looks ugly.

pckroon

Could you rebase this on master, or at least merge master? I'll go over it again then.
Untill then, some small notes.

cgsmiles/graph_utils.py

cgsmiles/pysmiles_utils.py

cgsmiles/resolve.py

cgsmiles/write_cgsmiles.py

Co-authored-by: Peter C Kroon <[email protected]>

pckroon

Besides the specific remarks, I would also like some more comments in add_bond_descrp

cgsmiles/graph_utils.py

cgsmiles/write_cgsmiles.py

fgrunewald · 2024-10-02T16:05:46Z

@pckroon I think the writer is rather complete now; I noticed that we probably should have a reader that returns the highest resolution molecule and fragment dicts. It is wrapped into the MoleculeResolver, but from a data science point of view, it could be handy to write and write the same stuff. I think it should be a separate PR however.

pckroon

Nice work! I do think I found a few small issues besides some minor formatting though. This also means those cases need to get tested.

I noticed that we probably should have a reader that returns the highest resolution molecule and fragment dicts. It is wrapped into the MoleculeResolver, but from a data science point of view, it could be handy to write and write the same stuff. I think it should be a separate PR however.

Yes and yes. Reading and writing the same thing is super useful, also from a testing point of view.

cgsmiles/read_fragments.py

cgsmiles/tests/test_write_cgsmiles.py

pckroon · 2024-10-04T08:33:05Z

cgsmiles/write_cgsmiles.py

+        if order_symb != '-':
+            bond_str = order_symb
+        bond_str += "["+str(bonding_descrpt[:-1])+"]"


Is it possible to elide aromatic bonds between aromatic atoms?
I prefer string formatting over concatenation personally:

Suggested change

if order_symb != '-':

bond_str = order_symb

bond_str += "["+str(bonding_descrpt[:-1])+"]"

if order_symb != '-':

bond_str += order_symb

bond_str += "[{}]".format(bonding_descrpt[:-1])

pckroon · 2024-10-04T08:41:23Z

cgsmiles/write_cgsmiles.py

+def write_graph(molecule, smiles_format=False, default_element='*'):
+    """
+    Creates a CGsmiles string describing `molecule`.
+    `molecule` should be a single connected component.


Meh. Not too much work to find all connected components and call a (this) function on those is it? And then just join them with a .

😆 this one comes from the pysmiles parser I copied. It still has to be a connected component to be valid. That's simply a CGSmiles requirement. You can have edges with zero bond order though. But they have to be edges.

cgsmiles/write_cgsmiles.py

pckroon · 2024-10-04T09:03:27Z

cgsmiles/write_cgsmiles.py

+    fragment_str = ""
+    for fragname, frag_graph in fragment_dict.items():
+        fragment_str += f"#{fragname}="
+        # format graph depending on resolution
+        fragment_str += write_graph(frag_graph, smiles_format=all_atom) + ","
+    fragment_str = "{" + fragment_str[:-1] + "}"


I would make a list of fragment_str's for the separate fragments, then ','.join them

it doesn't matter does it?

cgsmiles/write_cgsmiles.py

pckroon

Pushed the wrong button, changes are needed.

cgsmiles/tests/test_write_cgsmiles.py

pckroon · 2024-10-08T09:30:19Z

cgsmiles/tests/test_write_cgsmiles.py

+            # we cannot be sure that the atomnames are the same because they
+            # will depend on the order
+            nx.set_node_attributes(frag_dict_out[fragname], None, "atomname")
+            nx.set_node_attributes(frag_dict[fragname], None, "atomname")


Can't we assume anything? For example that the first (two?) character should be the same?

It is just the element plus the atom index. The element is already part of the testing. Therefore, I think it is not required.

Co-authored-by: Peter C Kroon <[email protected]>

fgrunewald changed the title ~~Writer~~ [5] Writer Apr 27, 2024

fgrunewald mentioned this pull request May 2, 2024

[2] Aromatic fragments #5

Closed

pckroon requested changes May 13, 2024

View reviewed changes

fgrunewald added 8 commits July 9, 2024 13:48

first draft write cgsmiles

41ac6ad

implement cgsmiles writer for resgraphs

ac865ed

implement cgsmiles writer for fragments

a51f186

get connections from full resolution molecule

5277c5d

get connections from full resolution molecule

7351354

doc strings

91eafe2

simplify and clean up writer

7ba2ab5

rebase continue

85b1c72

fgrunewald force-pushed the write_cgsmiles branch from 5efc474 to 85b1c72 Compare July 9, 2024 15:21

Apply suggestions from code review

ecfe5e1

Co-authored-by: Peter C Kroon <[email protected]>

pckroon requested changes Sep 6, 2024

View reviewed changes

fgrunewald added 7 commits September 18, 2024 22:06

Merge branch 'master' into write_cgsmiles

e6d8586

address some comments

93f717d

overhaul write cgsmiles

0968dec

fix bug

aad9e4a

update writer

620f538

address comment

45f649e

clean and add some wrappers

f45a6ff

fgrunewald requested a review from pckroon October 2, 2024 16:05

pckroon reviewed Oct 4, 2024

View reviewed changes

pckroon requested changes Oct 4, 2024

View reviewed changes

fgrunewald added 2 commits October 5, 2024 12:46

address comments

7c4a94d

allow higher order rings

db7bd7a

pckroon approved these changes Oct 7, 2024

View reviewed changes

fgrunewald and others added 2 commits October 7, 2024 16:17

Merge branch 'master' into write_cgsmiles

2d43d69

update tests

54ad72b

pckroon approved these changes Oct 7, 2024

View reviewed changes

fgrunewald added 3 commits October 8, 2024 11:17

update tests

4ae7917

bug fix

431b988

remove print

7370d54

pckroon requested changes Oct 8, 2024

View reviewed changes

Update cgsmiles/tests/test_write_cgsmiles.py

116b5e9

Co-authored-by: Peter C Kroon <[email protected]>

pckroon approved these changes Oct 8, 2024

View reviewed changes

fgrunewald merged commit 3465bea into master Oct 8, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[5] Writer #4

[5] Writer #4

fgrunewald commented Apr 17, 2024 •

edited

Loading

pckroon left a comment

pckroon left a comment

fgrunewald commented Oct 2, 2024

pckroon left a comment

pckroon Oct 4, 2024

pckroon Oct 4, 2024

fgrunewald Oct 5, 2024

pckroon Oct 4, 2024

fgrunewald Oct 5, 2024

pckroon left a comment

pckroon Oct 8, 2024

fgrunewald Oct 8, 2024

[5] Writer #4

[5] Writer #4

Conversation

fgrunewald commented Apr 17, 2024 • edited Loading

pckroon left a comment

Choose a reason for hiding this comment

pckroon left a comment

Choose a reason for hiding this comment

fgrunewald commented Oct 2, 2024

pckroon left a comment

Choose a reason for hiding this comment

pckroon Oct 4, 2024

Choose a reason for hiding this comment

pckroon Oct 4, 2024

Choose a reason for hiding this comment

fgrunewald Oct 5, 2024

Choose a reason for hiding this comment

pckroon Oct 4, 2024

Choose a reason for hiding this comment

fgrunewald Oct 5, 2024

Choose a reason for hiding this comment

pckroon left a comment

Choose a reason for hiding this comment

pckroon Oct 8, 2024

Choose a reason for hiding this comment

fgrunewald Oct 8, 2024

Choose a reason for hiding this comment

fgrunewald commented Apr 17, 2024 •

edited

Loading