Skip to content

Commit 70d41c0

Browse files
Rextilne
authored andcommitted
Wait for clustermgtd heartbeat before requesting for initial count
Signed-off-by: Rex <shuningc@amazon.com>
1 parent 9ceed59 commit 70d41c0

File tree

1 file changed

+6
-0
lines changed

1 file changed

+6
-0
lines changed

recipes/head_node_slurm_finalize.rb

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -15,6 +15,12 @@
1515
# OR CONDITIONS OF ANY KIND, express or implied. See the License for the specific language governing permissions and
1616
# limitations under the License.
1717

18+
execute "check if clustermgtd heartbeat is available" do
19+
command "cat /opt/slurm/etc/pcluster/.slurm_plugin/clustermgtd_heartbeat"
20+
retries 30
21+
retry_delay 10
22+
end
23+
1824
ruby_block "submit dynamic fleet initialization jobs" do
1925
block do
2026
require 'json'

0 commit comments

Comments
 (0)