go parallel #14

thorade · 2013-01-27T17:23:23Z

I don't see how to do this in Modelica, but going parallel could significantly speed up things:
All 5-14 HelmholtzDerivs can be calculated simultaneously, see HelmholtzDerivs and setHelmholtzDerivs
and all 12 up to 50 terms of each HelmholtzDeriv could be evaluated simultaneously, see e.g. f_r

Combining these two should give 60 or more independent threads.
This should be investigated in combination with (automatic?) common subexpression elimination, because the f_r etc function calls have very many terms in common! Maybe combine with #26 ?

The text was updated successfully, but these errors were encountered:

thorade · 2020-09-09T06:40:56Z

There are many other places where two or more functions could be called in parallel:

more sum() operators:

HelmholtzMedia/HelmholtzMedia/Interfaces/PartialHelmholtzMedium/Ancillary/saturationPressure_T.mo

Line 21 in 7b867a5

p := p_crit*exp(tau*sum(n[i]*T_theta^theta[i] for i in 1:nPressureSaturation));

setSat functions:

HelmholtzMedia/HelmholtzMedia/Interfaces/PartialHelmholtzMedium/setSat_d.mo

Lines 42 to 43 in 7b867a5

    
           fl := EoS.setHelmholtzDerivsSecond(d=sat.liq.d, T=sat.Tsat, phase=1); 
        
           fv := EoS.setHelmholtzDerivsSecond(d=sat.vap.d, T=sat.Tsat, phase=1);

casella · 2020-09-09T16:49:00Z

@mahge, do you think we can exploit such a fine-grained parallelism? I'm afraid the overhead could kill any potential speedup.

mahge · 2020-09-09T19:29:22Z

I can not say much without looking at it further. However, the design and implementation is intended to be used for fine-grained parallelism, i.e., at equation level instead of just strongly connected components. Unfortunately, it will not go down into functions and parallelize things there yet.

The good news is that, if these large functions (computations) are attached (called from) equations that can be computed independent from each other in a single time step, it should be parallelizable. In other words, consider each instance of the call to these functions from different equations as part of that equation's computation. If, after causalization, one of the assignments does not use the LHS of the other equation, it is all the same for the implementation and we should be able to run them in parallel.

As for the sum() operators and similar data-parallel computations within functions/algorithms there is another parallelization implementation I did a while back that can handle them even on GPUs. However, this will require modifications to the library source code making it unusable on other Modelica tools. Plus the arrays/computations need to be quite large (by Modelica standards) to see any speedup. We can look at that afterwards if you are interested.

casella · 2020-09-09T20:39:13Z

I can not say much without looking at it further. However, the design and implementation is intended to be used for fine-grained parallelism, i.e., at equation level instead of just strongly connected components.

OK.

Unfortunately, it will not go down into functions and parallelize things there yet.

I guess this issue could be solved by clever generation of auxiliary variables. We already have some kind of Common Subexpression Elimination on functions carried out by wrapFunctionCalls, which generates auxiliary equations $cseNN = f(...); for each function call in the model, and use $cseNN in place of that in the functions inside equations. Maybe this could be good enough to get separate function calls in parallel.

The good news is that, if these large functions (computations) are attached (called from) equations that can be computed independent from each other in a single time step, it should be parallelizable.

Yes, that is the point.

In other words, consider each instance of the call to these functions from different equations as part of that equation's computation. If, after causalization, one of the assignments does not use the LHS of the other equation, it is all the same for the implementation and we should be able to run them in parallel.

As for the sum() operators and similar data-parallel computations within functions/algorithms there is another parallelization implementation I did a while back that can handle them even on GPUs. However, this will require modifications to the library source code making it unusable on other Modelica tools. Plus the arrays/computations need to be quite large (by Modelica standards) to see any speedup. We can look at that afterwards if you are interested.

Yeah, I guess the size of arrays is not so large that we can benefit from that. After all, a double-precision summation is one clock cycle on modern CPUs (or even less for superscalar architectures), so if you need to sum a few dozen numbers going paralle probably doesn't make sense.

thorade closed this as completed Nov 30, 2015

thorade reopened this Sep 9, 2020

thorade mentioned this issue Sep 9, 2020

inlining and functions #26

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

go parallel #14

go parallel #14

thorade commented Jan 27, 2013 •

edited

Loading

thorade commented Sep 9, 2020

casella commented Sep 9, 2020

mahge commented Sep 9, 2020

casella commented Sep 9, 2020

go parallel #14

go parallel #14

Comments

thorade commented Jan 27, 2013 • edited Loading

thorade commented Sep 9, 2020

casella commented Sep 9, 2020

mahge commented Sep 9, 2020

casella commented Sep 9, 2020

thorade commented Jan 27, 2013 •

edited

Loading