Inquiry on Parallelization for SSCHA Calculation and GAP machine-learning potential #396
-
|
Dear developers, I am currently working with the SSCHA in combination with a GAP machine-learning potential, and I've encountered an issue related to the computational time required for our system. Our calculation setup requires a large number of configurations, with the max_pop parameter set to 10,000 for stability. However, due to the size of the system, completing the calculation with this setting is extremely slow. We have found that reducing max_pop results in the appearance of imaginary frequencies in the Hessian matrix, making it essential to maintain this value for accurate results. Given this, we are seeking to parallelize our calculations to speed up the process. Could you kindly provide guidance or suggest any modifications to the script that would allow us to parallelize the SSCHA calculation? Take the following script as an example. Thank you very much for your time and support. I look forward to your response. Best regards, |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
|
Hi, these are very strange settings. My suspect is that you just need to increase nconf and keep max_pop to just 4 or 5, as nconf is what really determines the precision and accuracy of the calculation, while max_pop determines how many iterations are required to go near the solution. In particular for the Hessian, it is required to have a large number of nconf to converge, while max_pop is mostly irrelevant above few tens. Anyway, to parallelize it should be sufficient to run the script with mpirun as Where in this case we are parallelizing over 4 processors. You can see from the output if it is parallelizing: it should print that it is computing the number of configurations divided by the processors. |
Beta Was this translation helpful? Give feedback.
Hi, these are very strange settings. My suspect is that you just need to increase nconf and keep max_pop to just 4 or 5, as nconf is what really determines the precision and accuracy of the calculation, while max_pop determines how many iterations are required to go near the solution. In particular for the Hessian, it is required to have a large number of nconf to converge, while max_pop is mostly irrelevant above few tens.
Anyway, to parallelize it should be sufficient to run the script with mpirun as
Where in this case we are parallelizing over 4 processors.
If you have multiple mpi installed, check that the version you are using match the one employed to…