Implementation of the Martini3 Go-model #550

fgrunewald · 2023-10-08T18:32:35Z

work in progress but almost finished;

To Do

@pckroon I've added the missing Go functionality so far that it can now read a contact map, but not do the contact analysis. Because the functionality missing from the Go code concerns interactions written to the atomtypes/nonbondparams directives it is not a processor in the original sense. Hence I've dumped the files in an extra toplevel directory. Does that make sense or should we rather have it as a processor?

@Lp0lp Can you generate a reference contact map using the CreateGoVirt script for lysozyme? The canonical AA structure can be found here. Unfortunately, it doesn't run on Mac (maybe just my Mac). Do you know if the format of the contact map in terms of columns is always the same?

Also this project has a hard deadline October 10.

pckroon

First of all, great! I'm very happy to have this functionality in vermouth/martinize2.

I do think it should be a Processor (which can live in its own folder, see dssp), the question is what to do with the non-bonded parameter info. I'm tempted to overengineer this, but YAGNI. So maybe just dump them in a format that's convenient for you in either System or Forcefield. I think my preference would be in System (analogous to Molecule.interactions). Add a comment to that to indicate we'll engineer something proper when we need to touch non-bonded parameters and atom types for a second time.

align ouput file names with CreateGoVirt script

This is super optional to me. I'm more than happy to make backward breaking changes wrt that script.

bin/martinize2

vermouth/rcsu/get_go.py

Lp0lp · 2023-10-09T10:36:58Z

Heyo! I've generated the contact map for lysozyme using the old, new, and locally executable contact map generator. Annoyingly, the format is not mega-consistent across the 3 versions. They have pretty different headers (not a big deal, I think) but more annoying is their substantially different column layouts. I think the create_goVirt script can only use the map from the old webserver currently.

On top of that... besides the different file layout the output is also quite different..... but that is a whole other deal...

I've attached below the maps for you to have a look at:
exec.txt
webservernew.txt
webserverold.txt

EDIT: Also note that this is probably why MAD gives different go-models. Since mad uses the locally executable version of the contact map generator, which besides having a different layout seems to also spit out a different map altogether.

Lp0lp · 2023-10-09T10:41:55Z

align ouput file names with CreateGoVirt script

This is super optional to me. I'm more than happy to make backward breaking changes wrt that script.

The file names out of creategovirt are a mess, if we can clean them up now then all the better I think.

Lp0lp · 2023-10-09T11:11:40Z

They have pretty different headers (not a big deal, I think) but more annoying is their substantially different column layouts. I think the create_goVirt script can only use the map from the old webserver currently.

I just revised the contact maps again, the executable and old webservers are actually consistent. I was wrongly looking at the first table when it is the second one (line 5454 and onwards) that the script reads. This residue-residue contact table is in the same format as the executable it just has more stuff printed out before it. IMO, we drop support for the output out of the new webserver for now, and I can send them an email later asking them to make it more consistent perhaps.

fgrunewald · 2023-10-10T15:24:42Z

@pckroon why do we want the atomtypes to be dumped into system or molecules? They can be written to file directly because they are coupled to molecule i.e. two molecules cannot have the same set of atomtypes

pckroon · 2023-10-11T13:40:42Z

Because it fits better into the general workflow imho. It also makes the include statements easier in the top file. I do understand your point though.

fgrunewald · 2023-10-16T07:33:07Z

@Lp0lp Last but not least I'm refactoring the water biasing a little, such that it can be applied without the Go model. I think that's also the original idea of the IDPs. One question I have is whether you think someone would only want to bias IDRs in a structured protein and NOT the rest of the protein (i.e. applying idp fix only). My idea was a CLI like so:

martinize2 -water-bias -> applies the auto water bias
martinize2 -water-bias-eps -> you can set the epsilon for all sec-struc types including an "idr" key for IDPs/IDRs
martinize2 -id-regions -> define which regions are IDRs

In this scenario, any IDR biasing overwrites the default auto biasing. If we have default setting for the sec-struc biases the only tricky part would be to only apply biases to the IDP regions because that would require setting the specific biases to 0.

Any thoughts?

pckroon · 2023-10-16T09:29:47Z

all sec-struc types including an "idr" key for IDPs/IDRs

What do you mean by this? Would it make sense to allow users to set which sec-struct types to apply the bias to as well (with a sane default)?

fgrunewald · 2023-10-16T09:33:49Z

like as follows:

-water-bias-eps H:2.1 C:2.3 idr:2.1

This means applying a bias of 2.1 kJ/mol to helix but 2.3 to coil. All intrinsically disordered regions as defined by the region argument are biased with 2.1. Regions always superseded the other assignments.

pckroon · 2023-10-16T09:38:49Z

Alright, that looks cool.

Lp0lp · 2023-10-16T11:11:47Z

@Lp0lp Last but not least I'm refactoring the water biasing a little, such that it can be applied without the Go model. I think that's also the original idea of the IDPs.

Perfect! Would be helpful to also be able to combine the water biasing with elastic networks for peptides, etc.

One question I have is whether you think someone would only want to bias IDRs in a structured protein and NOT the rest of the protein (i.e. applying idp fix only). My idea was a CLI like so:
martinize2 -water-bias -> applies the auto water bias
martinize2 -water-bias-eps -> you can set the epsilon for all sec-struc types including an "idr" key for IDPs/IDRs
martinize2 -id-regions -> define which regions are IDRs
In this scenario, any IDR biasing overwrites the default auto biasing. If we have default setting for the sec-struc biases the only tricky part would be to only apply biases to the IDP regions because that would require setting the specific biases to 0.

Could be a possibility for some I guess... If I understood correctly you define 4 assignments (helix, coil, strand and idr) with specific eps' and where idr overrides the other 3. What is there to stop you from setting helix coil and strand to 0 with -water-bias-eps and leaving only the IDR with a eps != 0? That would let users only apply the bias to the IDP region, no? Bit ugly CLI wise and writes a few useless BB-W interactions with a bunch of (harmless, I think) zeros though.

pckroon

I'll continue reviewing later today

bin/martinize2

Co-authored-by: Peter C Kroon <[email protected]>

pckroon

I went over the code again and still found some things. Most are very small, some are really just nitpicks.
I also noticed some of my old comments didn't get addressed or resolved, could you also go back and have a look at those.

As for the signature of the GoPipeline, leave it as is for now.

vermouth/gmx/topology.py

pckroon · 2023-12-06T13:12:14Z

vermouth/gmx/topology.py

+    # First we write the atomtypes directive
+    if "atomtypes" in system.gmx_topology_params:
+        _path = itp_paths.pop()
+        write_atomtypes(system, _path, C6C12)
+        include_string += f'\n #include "{_path}"'
+    # Next we write the nonbond_params directive
+    if "nonbond_params" in system.gmx_topology_params:
+        _path = itp_paths.pop()
+        write_nonbond_params(system, _path, C6C12)
+        include_string += f'\n #include "{_path}"\n'


Did we ever reach a conclusion on this? My original comment mentioned generating the _path s based on moltypes. IIRC the outcome was that we can't do that since things like nbparams are system-wide attributes.

vermouth/processors/water_bias.py

vermouth/tests/test_water_bias.py

vermouth/tests/test_vs_generation.py

changes not relating to tests Co-authored-by: Peter C Kroon <[email protected]>

Co-authored-by: Peter C Kroon <[email protected]>

vermouth/tests/rcsu/test_go_utils.py

vermouth/tests/integration_tests/test_integration.py

Co-authored-by: Peter C Kroon <[email protected]>

pckroon

Nice work!

vermouth/gmx/topology.py

fgrunewald and others added 2 commits October 6, 2023 12:47

init draft for Go model

8891cc5

first implementation Go model

ae818e3

pckroon requested changes Oct 9, 2023

View reviewed changes

pckroon added the hacktoberfest-accepted Accepted Hacktoberfest contribution label Oct 9, 2023

fgrunewald added 12 commits October 12, 2023 15:22

implement structural bias processor

d2ed0f6

have some handy utilites for the Go pipeline

ab9303b

add water bias functionality

b315603

use new contact map format

77d7c70

rename get go

96be469

add licensce header

6e05c09

move the go_vs_includes

92fb9b9

add go_pipeline

462d033

implement go_pipeline

01b617b

move logging to processor

f0bc07f

lining

1fd451a

refactor topology writing

e60e120

pckroon mentioned this pull request Oct 13, 2023

Bug: The residue order between the input all-atom model and the output coarse-grained model is different. #551

Closed

fgrunewald added 2 commits October 13, 2023 15:49

some clean up

e01dc30

refactor water bias

b0b4b6c

make water bias workflow worl

5cd2c82

fgrunewald and others added 3 commits December 5, 2023 13:11

Update contact_map.py

6e8382d

Fix sphinx references

806d1f5

Fix sphinx references

6f0f20b

pckroon requested changes Dec 6, 2023

View reviewed changes

bin/martinize2 Outdated Show resolved Hide resolved

bin/martinize2 Show resolved Hide resolved

bin/martinize2 Show resolved Hide resolved

pckroon and others added 2 commits December 6, 2023 11:16

fix CLI typo

0f31eed

Update bin/martinize2

71f19aa

Co-authored-by: Peter C Kroon <[email protected]>

pckroon requested changes Dec 6, 2023

View reviewed changes

fgrunewald and others added 3 commits December 11, 2023 09:15

Apply suggestions from code review

317f813

changes not relating to tests Co-authored-by: Peter C Kroon <[email protected]>

Apply suggestions from code review

484f904

Co-authored-by: Peter C Kroon <[email protected]>

Apply suggestions from code review

372f226

Co-authored-by: Peter C Kroon <[email protected]>

pckroon reviewed Dec 14, 2023

View reviewed changes

vermouth/tests/rcsu/test_go_utils.py Outdated Show resolved Hide resolved

pckroon reviewed Dec 14, 2023

View reviewed changes

vermouth/tests/integration_tests/test_integration.py Outdated Show resolved Hide resolved

pckroon reviewed Dec 14, 2023

View reviewed changes

vermouth/tests/integration_tests/test_integration.py Outdated Show resolved Hide resolved

fgrunewald and others added 13 commits December 14, 2023 13:31

Update vermouth/tests/rcsu/test_go_utils.py

01b32b5

Co-authored-by: Peter C Kroon <[email protected]>

Update vermouth/tests/helper_functions.py

023e457

Co-authored-by: Peter C Kroon <[email protected]>

Update vermouth/tests/rcsu/test_go_structure_bias.py

cf40022

Co-authored-by: Peter C Kroon <[email protected]>

Update vermouth/tests/integration_tests/test_integration.py

aaf8ee2

Co-authored-by: Peter C Kroon <[email protected]>

Update vermouth/tests/integration_tests/test_integration.py

69ae011

Co-authored-by: Peter C Kroon <[email protected]>

put comment

393e1bb

fix docstring go vs includes

72a3817

fix spelling

e2491d0

Merge branch 'master' into Go

012c1c9

make itp_paths dict in write_gmx_topology

2a36e63

fix error type in test

daa792b

fix bug in topology regarding itp_paths

f5f5dea

fix bug in topology regarding itp_paths

2fb2821

pckroon approved these changes Dec 14, 2023

View reviewed changes

vermouth/gmx/topology.py Show resolved Hide resolved

fgrunewald merged commit 71710c5 into master Dec 14, 2023
7 of 8 checks passed

fgrunewald deleted the Go branch December 14, 2023 14:15

pckroon mentioned this pull request Jun 14, 2024

-govs-include no longer present? #603

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation of the Martini3 Go-model #550

Implementation of the Martini3 Go-model #550

fgrunewald commented Oct 8, 2023 •

edited

Loading

pckroon left a comment

Lp0lp commented Oct 9, 2023 •

edited

Loading

Lp0lp commented Oct 9, 2023

Lp0lp commented Oct 9, 2023

fgrunewald commented Oct 10, 2023

pckroon commented Oct 11, 2023

fgrunewald commented Oct 16, 2023

pckroon commented Oct 16, 2023

fgrunewald commented Oct 16, 2023 •

edited

Loading

pckroon commented Oct 16, 2023

Lp0lp commented Oct 16, 2023

pckroon left a comment

pckroon left a comment

pckroon Dec 6, 2023 •

edited

Loading

pckroon left a comment

Implementation of the Martini3 Go-model #550

Implementation of the Martini3 Go-model #550

Conversation

fgrunewald commented Oct 8, 2023 • edited Loading

pckroon left a comment

Choose a reason for hiding this comment

Lp0lp commented Oct 9, 2023 • edited Loading

Lp0lp commented Oct 9, 2023

Lp0lp commented Oct 9, 2023

fgrunewald commented Oct 10, 2023

pckroon commented Oct 11, 2023

fgrunewald commented Oct 16, 2023

pckroon commented Oct 16, 2023

fgrunewald commented Oct 16, 2023 • edited Loading

pckroon commented Oct 16, 2023

Lp0lp commented Oct 16, 2023

pckroon left a comment

Choose a reason for hiding this comment

pckroon left a comment

Choose a reason for hiding this comment

pckroon Dec 6, 2023 • edited Loading

Choose a reason for hiding this comment

pckroon left a comment

Choose a reason for hiding this comment

fgrunewald commented Oct 8, 2023 •

edited

Loading

Lp0lp commented Oct 9, 2023 •

edited

Loading

fgrunewald commented Oct 16, 2023 •

edited

Loading

pckroon Dec 6, 2023 •

edited

Loading