Implement a new BFGS optimizer, used for geometry relaxation #5467

19hello · 2024-11-12T10:51:46Z

Reminder

Have you linked an issue with this pull request?
Have you added adequate unit tests and/or case tests for your pull request?
Have you noticed possible changes of behavior below or in the linked issue?
Have you explained the changes of codes in core modules of ESolver, HSolver, ElecState, Hamilt, Operator or Psi? (ignore if not applicable)

Linked Issue

Fix #...

Unit Tests and/or Case Tests for my changes

A unit test is added for each new feature or bug fix.

What's changed?

Example: My changes might affect the performance of the application under certain conditions, and I have tested the impact on various scenarios...

Any changes of core modules? (ignore if not applicable)

Example: I have added a new virtual function in the esolver base class in order to ...

dyzheng · 2024-11-13T05:48:27Z

Please rewrite the title of this PR, where is this method from?

dyzheng · 2024-11-13T05:50:35Z

Please raise a issue to describe this method you have implemented, and show some results to prove your code.

QuantumMisaka · 2024-11-13T07:46:37Z

@19hello Apart from the method description, from the users' side we need test results from your BFGS method, and compare test for old BFGS and ASE BFGS

… into mybfgs

mohanchen · 2024-11-16T06:34:47Z

SiH2-3et-relax/H.ccECP.upf

do not add new file, this file is too large

@19hello
for SiH2 example, you can attach it as zip file under the corresponding issue, but not in source-code

I've delete SiH2-3et-relax

@19hello After you resolve this comment, please mark them as resolved

SiH2-3et-relax/Si.ccECP.upf

SiH2-3et-relax/abacus.out

mohanchen · 2024-11-16T06:35:41Z

SiH2-3et-relax/log

we don't need this file

tests/integrate/109_PW_CR/log1

QuantumMisaka · 2024-11-18T23:47:28Z

I'll have test with #3119 and my system

QuantumMisaka · 2024-11-19T03:01:58Z

@19hello After you finish adding your BFGS method, please consider the way user know and use it

Docs about your BFGS method need to be updated at https://abacus.deepmodeling.com/en/latest/advanced/opt.html and https://abacus.deepmodeling.com/en/latest/advanced/input_files/input-main.html
You need to explain your BFGS method and its difference with the old one in this PR or a related issue

mohanchen · 2024-11-19T03:39:39Z

source/module_relax/CMakeLists.txt

@@ -6,6 +6,7 @@ add_library(

    relax_new/relax.cpp
    relax_new/line_search.cpp
+    relax_new/bfgs.cpp



one extra line found here, please delete it

mohanchen · 2024-11-19T03:39:47Z

source/module_relax/CMakeLists.txt

@@ -17,6 +18,7 @@ add_library(
    relax_old/lattice_change_basic.cpp
    relax_old/lattice_change_cg.cpp
    relax_old/lattice_change_methods.cpp
+


one extra line found

mohanchen · 2024-11-19T03:41:35Z

source/module_relax/relax_driver.cpp

    {
-        if (!PARAM.inp.relax_new)
+        if(PARAM.inp.relax_method == "bfgs_trad")


some logic here have problems, please think carefully the function of 'relax_new' parameter

@sunliang98 please help Feiyang check the codes

@YuLiu98 please help checking the logic here

mohanchen · 2024-11-19T03:43:44Z

source/module_relax/relax_driver.cpp

@@ -77,7 +82,12 @@ void Relax_Driver::relax_driver(ModuleESolver::ESolver* p_esolver)

            if (PARAM.inp.calculation == "relax" || PARAM.inp.calculation == "cell-relax")
            {
-                if (PARAM.inp.relax_new)
+                if(PARAM.inp.relax_method == "bfgs_trad")


same logic problem found here

mohanchen · 2024-11-19T03:43:55Z

source/module_relax/relax_driver.cpp

+                {   
+                    stop=bfgs_trad.relax_step(force,GlobalC::ucell);
+                }
+


one extra line found, please delete it

mohanchen · 2024-11-19T03:44:48Z

source/module_relax/relax_new/bfgs.h

+{
+public:
+
+    double alpha;//initialize H,diagonal element is alpha


use doxygen format for notes

mohanchen · 2024-11-19T03:45:16Z

source/module_relax/relax_new/test/bfgs_test.cpp

you need to learn how to add tests.

QuantumMisaka · 2024-11-19T03:49:49Z

@19hello Please remove redundant information in stdout and add standard stdout information
Your print-out in relaxation in stdout

 STEP OF ION RELAXATION : 1
 -------------------------------------------
 START CHARGE      : atomic
 DONE(9.76382    SEC) : INIT SCF
 ITER      TMAG       AMAG        ETOT/eV          EDIFF/eV         DRHO     TIME/s
 GE1      1.00e+00   1.05e+00  -1.05828247e+05   0.00000000e+00   2.0197e-01  27.79
 GE2      1.00e+00   1.03e+00  -1.05974176e+05  -1.45929058e+02   6.3496e-02  26.72
 GE3      1.00e+00   1.05e+00  -1.05975481e+05  -1.30541467e+00   4.0279e-02  26.63
 GE4      1.00e+00   1.03e+00  -1.05975750e+05  -2.69111778e-01   1.6323e-02  26.72
 GE5      1.00e+00   1.04e+00  -1.05975788e+05  -3.79099790e-02   2.5708e-03  26.59
 GE6      1.00e+00   1.04e+00  -1.05975801e+05  -1.32110432e-02   1.6555e-03  26.57
 GE7      1.00e+00   1.04e+00  -1.05975807e+05  -5.65212985e-03   9.8648e-04  26.63
 GE8      1.00e+00   1.04e+00  -1.05975810e+05  -2.82251534e-03   5.1189e-04  26.60
 GE9      1.00e+00   1.04e+00  -1.05975812e+05  -2.36488839e-03   2.8607e-04  26.75
 GE10     1.00e+00   1.04e+00  -1.05975813e+05  -1.23262181e-03   1.6751e-04  27.49
 GE11     1.00e+00   1.04e+00  -1.05975815e+05  -1.12359872e-03   1.2231e-04  26.93
 GE12     1.00e+00   1.04e+00  -1.05975815e+05  -8.95499320e-04   8.2590e-05  26.61
 GE13     1.00e+00   1.04e+00  -1.05975816e+05  -6.48718376e-04   5.4154e-05  26.68
 GE14     1.00e+00   1.04e+00  -1.05975817e+05  -4.14907789e-04   3.2547e-05  26.49
 GE15     1.00e+00   1.04e+00  -1.05975817e+05  -1.32732903e-04   1.7303e-05  26.62
 GE16     1.00e+00   1.04e+00  -1.05975817e+05  -8.18802293e-05   1.1003e-05  26.55
 GE17     1.00e+00   1.04e+00  -1.05975817e+05  -3.53268862e-05   5.9376e-06  26.61
 GE18     1.00e+00   1.04e+00  -1.05975817e+05  -1.76773333e-05   3.7520e-06  26.58
 GE19     1.00e+00   1.04e+00  -1.05975817e+05  -1.47323588e-05   2.2837e-06  26.60
 GE20     1.00e+00   1.04e+00  -1.05975817e+05  -9.94848759e-06   1.3589e-06  26.59
 GE21     1.00e+00   1.04e+00  -1.05975817e+05  -1.20949602e-05   7.7700e-07  26.45
----------------------------------------------------------------
 TOTAL-STRESS (KBAR)                                            
----------------------------------------------------------------
        20.4609576207        -0.3745619029         0.2831022240 
        -0.3745619029        18.6297901586         1.0074961155 
         0.2831022240         1.0074961155        20.2788212288 
----------------------------------------------------------------
 TOTAL-PRESSURE: 19.789856 KBAR

enter Step
-0.084090 0.047495 0.117520 
0.035535 0.080209 -0.086416 
0.033115 0.022912 0.041516 
0.054424 0.085066 -0.015168 
0.034315 -0.100508 0.031721 
-0.024596 0.022252 -0.028070 
0.058134 0.021169 -0.042996 
-0.020403 0.099263 -0.011137 
-0.019789 -0.231395 -0.034845 
0.006322 0.009388 0.109516 
0.005212 0.282777 0.158713 
0.242909 0.120901 -0.020402 
0.183283 -0.025066 -0.032386 
0.155625 -0.012797 -0.031496 
-0.120652 -0.022005 0.289342 
0.104293 0.006106 0.258712 
0.020757 0.190447 -0.224988 
-0.083278 0.187994 0.029553 
-0.067578 -0.255382 0.030215 
0.229501 -0.132157 -0.030133 
-0.076858 -0.110391 -0.159552 
-0.016123 0.260658 0.043143 
-0.111364 -0.271461 0.020970 
-0.057393 -0.147844 -0.179816 
0.069255 -0.033162 -0.090451 
0.098905 0.057314 -0.124832 
(not finished)

Standard

GE176  1.44e+02  1.57e+02  -2.637812e+05  -8.513525e-09  1.058e-07  1.082e+01  
 GE177  1.44e+02  1.57e+02  -2.637812e+05  -1.836348e-08  8.958e-08  1.095e+01  
----------------------------------------------------------------
TOTAL-STRESS (KBAR)                                           
----------------------------------------------------------------
      -11.8425501698         0.4251191391        -0.7651493630
        0.4251191391         4.1133571068         0.9350401936
       -0.7651493630         0.9350401936        -6.5247407538
----------------------------------------------------------------
 ETOT DIFF (eV)       : 0.000e+00
 LARGEST GRAD (eV/A)  : 5.900e-01
 BFGS TRUST (Bohr)    : 5.000e-01
 -------------------------------------------
 STEP OF ION RELAXATION : 2

into mybfgs1

kirk0830 · 2024-11-20T03:36:43Z

I will review this PR carefully (mainly focus on the code structure) once all functionalities are implemented correctly.

QuantumMisaka · 2024-11-20T07:48:48Z

@19hello Now the stdout in BFGS:

 STEP OF ION RELAXATION : 1
 -------------------------------------------
 START CHARGE      : atomic
 DONE(12.0107    SEC) : INIT SCF
 ITER      TMAG       AMAG        ETOT/eV          EDIFF/eV         DRHO     TIME/s
 GE1      1.00e+00   1.05e+00  -1.05828247e+05   0.00000000e+00   2.0197e-01  27.87
 GE2      1.00e+00   1.03e+00  -1.05974176e+05  -1.45929058e+02   6.3496e-02  26.71
 GE3      1.00e+00   1.05e+00  -1.05975481e+05  -1.30541467e+00   4.0279e-02  26.62
 GE4      1.00e+00   1.03e+00  -1.05975750e+05  -2.69111778e-01   1.6323e-02  26.54
 GE5      1.00e+00   1.04e+00  -1.05975788e+05  -3.79099791e-02   2.5708e-03  26.61
 GE6      1.00e+00   1.04e+00  -1.05975801e+05  -1.32110432e-02   1.6555e-03  26.52
 GE7      1.00e+00   1.04e+00  -1.05975807e+05  -5.65212981e-03   9.8648e-04  26.54
 GE8      1.00e+00   1.04e+00  -1.05975810e+05  -2.82251525e-03   5.1189e-04  26.47
 GE9      1.00e+00   1.04e+00  -1.05975812e+05  -2.36488872e-03   2.8607e-04  26.58
 GE10     1.00e+00   1.04e+00  -1.05975813e+05  -1.23262177e-03   1.6751e-04  26.52
 GE11     1.00e+00   1.04e+00  -1.05975815e+05  -1.12359819e-03   1.2231e-04  26.49
 GE12     1.00e+00   1.04e+00  -1.05975815e+05  -8.95499827e-04   8.2590e-05  26.47
 GE13     1.00e+00   1.04e+00  -1.05975816e+05  -6.48718030e-04   5.4154e-05  26.49
 GE14     1.00e+00   1.04e+00  -1.05975817e+05  -4.14907776e-04   3.2547e-05  26.61
 GE15     1.00e+00   1.04e+00  -1.05975817e+05  -1.32733026e-04   1.7303e-05  26.47
 GE16     1.00e+00   1.04e+00  -1.05975817e+05  -8.18803778e-05   1.1003e-05  26.48
 GE17     1.00e+00   1.04e+00  -1.05975817e+05  -3.53268862e-05   5.9376e-06  26.52
 GE18     1.00e+00   1.04e+00  -1.05975817e+05  -1.76772219e-05   3.7520e-06  26.55
 GE19     1.00e+00   1.04e+00  -1.05975817e+05  -1.47325444e-05   2.2837e-06  26.47
 GE20     1.00e+00   1.04e+00  -1.05975817e+05  -9.94843809e-06   1.3589e-06  26.47
 GE21     1.00e+00   1.04e+00  -1.05975817e+05  -1.20947498e-05   7.7700e-07  26.41
----------------------------------------------------------------
 TOTAL-STRESS (KBAR)                                            
----------------------------------------------------------------
        20.4609576207        -0.3745619029         0.2831022240 
        -0.3745619029        18.6297901586         1.0074961155 
         0.2831022240         1.0074961155        20.2788212288 
----------------------------------------------------------------
 TOTAL-PRESSURE: 19.789856 KBAR

-------------------------------------------
 STEP OF ION RELAXATION : 2
 -------------------------------------------
 DONE(603.992246 SEC) : INIT SCF

The LARGEST GRAD information is lost, please fix it

kirk0830

thanks for your contribution, I have roughly reviewed your PR. There are mainly two points that you should consider additionally:

explain about the term "bfgs_trad", in this PR and also update our markdown document in /docs/advanced/input_files/input-main.md. Because not only us, all users should have a way to know the functionality you implemented
there are really few annotation both in you header file and source file. Please use doxygen format to add some anntations.

There are also other aspects in which this BFGS implementation can be significantly improved, but you are not required to do these in this PR. There will be a issue records all possible improvements. Some of them are:

the code structure and interface design. The former is really a time-consuming task, before having a very clear idea, I will not talk too much here. The latter is more specific: because BFGS is a general optimization algorithm, once you implement one with higher efficiency, all other developers can benefit from you code as long as the interface is general enough.
The linear algebra (linalg) operation (op). I notice you use std::vector of std::vector to represent a matrix, and you write some linalg ops on your own. This will not significantly affect performance in small systems, but will be bottle neck when the system is quite large.
Unittest and integrated tests are absent.

source/module_io/read_input_item_relax.cpp

QuantumMisaka · 2024-11-21T02:33:10Z

@19hello Thanks for you contribution!
Please notices:

You need an issue to let developers and users know your development target and background
Your function should have likely print-out format in stdout and running*.log to allow smoothly usage

Besides, there are some other optimizer for geometry relaxation in ASE, including the most recommended BFGSLineSearch : https://wiki.fysik.dtu.dk/ase/ase/optimize.html#bfgslinesearch, optimizer base on MD (MDMin and FIRE), and optimizer from scipy. There are also many types of BFGS, so you need to explain what your BFGS is.

Furthermore, if your BFGS coding structure is beauty, it will benefit your next optimizer implementation. I suggest starting from BFGSLineSearch

QuantumMisaka · 2024-11-21T02:38:18Z

I'll have test with #3119 and my system

In my easy test, this BFGS somehow have worse relaxation performance, which lead to 50 ion step but not converge, but the original BFGS can make it done in 40 ion steps.

But the LARGEST GRAD information is lost in the stdout, and I do not know if the convergence judgement is also lost in your algorithm, please check them,

19hello · 2024-11-21T04:14:52Z

I'll have test with #3119 and my system

In my easy test, this BFGS somehow have worse relaxation performance, which lead to 50 ion step but not converge, but the original BFGS can make it done in 40 ion steps.

But the LARGEST GRAD information is lost in the stdout, and I do not know if the convergence judgement is also lost in your algorithm, please check them,

Thank you for your testing. I will try to fix the problems you mentioned in next PR

QuantumMisaka · 2024-11-21T08:34:40Z

I'll have test with #3119 and my system

In my easy test, this BFGS somehow have worse relaxation performance, which lead to 50 ion step but not converge, but the original BFGS can make it done in 40 ion steps.
But the LARGEST GRAD information is lost in the stdout, and I do not know if the convergence judgement is also lost in your algorithm, please check them,

Thank you for your testing. I will try to fix the problems you mentioned in next PR

I consider the LARGEST GRAD information in stdout should be fixed in this PR

19hello · 2024-11-21T08:59:38Z

I'll have test with #3119 and my system

In my easy test, this BFGS somehow have worse relaxation performance, which lead to 50 ion step but not converge, but the original BFGS can make it done in 40 ion steps.
But the LARGEST GRAD information is lost in the stdout, and I do not know if the convergence judgement is also lost in your algorithm, please check them,

Thank you for your testing. I will try to fix the problems you mentioned in next PR

I consider the LARGEST GRAD information in stdout should be fixed in this PR

I just pushed a new PR. In this PR, I have add 'LARGEST GRAD ` information in stdout. I also changed the logic of obtaining atom pos and add Unittest and integrated tests.

The previous convergence condition was that dpos less than 0.00001, so it can't converge within 50 steps. In the new PR, you can see the 'LARGEST GRAD ` information to judge if it is converged.

Besides, I can't run #3119 on my computer because the files are so large. Would you please run my bfgs_trad again.

QuantumMisaka · 2024-11-21T09:05:47Z

I'll have test with #3119 and my system

In my easy test, this BFGS somehow have worse relaxation performance, which lead to 50 ion step but not converge, but the original BFGS can make it done in 40 ion steps.
But the LARGEST GRAD information is lost in the stdout, and I do not know if the convergence judgement is also lost in your algorithm, please check them,

Thank you for your testing. I will try to fix the problems you mentioned in next PR

I consider the LARGEST GRAD information in stdout should be fixed in this PR

I just pushed a new PR. In this PR, I have add 'LARGEST GRAD ` information in stdout. I also changed the logic of obtaining atom pos and add Unittest and integrated tests.

The previous convergence condition was that dpos less than 0.00001, so it can't converge within 50 steps. In the new PR, you can see the 'LARGEST GRAD ` information to judge if it is converged.

Besides, I can't run #3119 on my computer because the files are so large. Would you please run my bfgs_trad again.

Good, I'll test it, but is there any diffuculty to normalize the convergence condition to the traditional force_thr_ev parameter ?

19hello · 2024-11-21T09:12:36Z

I'll have test with #3119 and my system

In my easy test, this BFGS somehow have worse relaxation performance, which lead to 50 ion step but not converge, but the original BFGS can make it done in 40 ion steps.
But the LARGEST GRAD information is lost in the stdout, and I do not know if the convergence judgement is also lost in your algorithm, please check them,

Thank you for your testing. I will try to fix the problems you mentioned in next PR

I consider the LARGEST GRAD information in stdout should be fixed in this PR

I just pushed a new PR. In this PR, I have add 'LARGEST GRAD information in stdout. I also changed the logic of obtaining atom pos and add Unittest and integrated tests. The previous convergence condition was that dpos less than 0.00001, so it can't converge within 50 steps. In the new PR, you can see the 'LARGEST GRAD information to judge if it is converged.
Besides, I can't run #3119 on my computer because the files are so large. Would you please run my bfgs_trad again.

Good, I'll test it, but is there any diffuculty to normalize the convergence condition to the traditional force_thr_ev parameter ?

I forgot it. I will change the convergence condition in next PR.

QuantumMisaka · 2024-11-23T03:14:12Z

Optimization through this BFGS have been test on #3119 and showing much efficiency (reduce 50% of the number of optimization steps to 0.05 eV/A max force and show no sharp increase of LARGEST GRAD).

Waiting for the force_ev_thr been used and more optimization method added-in

mohanchen · 2024-11-25T12:34:08Z