Skip to content

Commit f3915fe

Browse files
Rextilne
authored andcommitted
Add clustermgtd heartbeat path to resume and suspend program configs
* Add clustermgtd heartbeat file path to resume program and suspend program config, so cloud bursting program can have logic to prevent cluster usage when clustermgtd is down * Change permission of /opt/slurm/etc/pcluster/.slurm_plugin from 0644 to 0755 so files in the directory can be listed and read Signed-off-by: Rex <shuningc@amazon.com>
1 parent f4ff0ca commit f3915fe

File tree

3 files changed

+3
-1
lines changed

3 files changed

+3
-1
lines changed

recipes/head_node_slurm_config.rb

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -164,7 +164,7 @@
164164
directory "/opt/slurm/etc/pcluster/.slurm_plugin" do
165165
user 'root'
166166
group 'root'
167-
mode '0644'
167+
mode '0755'
168168
action :create
169169
recursive true
170170
end

templates/default/slurm/parallelcluster_slurm_resume.conf.erb

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -8,3 +8,4 @@ dns_domain = <%= node['cfncluster']['cfn_dns_domain'] %>
88
use_private_hostname = <%= node['cfncluster']['use_private_hostname'] %>
99
master_private_ip = <%= node['ec2']['local_ipv4'] %>
1010
master_hostname = <%= node['ec2']['local_hostname'] %>
11+
clustermgtd_heartbeat_file_path = /opt/slurm/etc/pcluster/.slurm_plugin/clustermgtd_heartbeat
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1 +1,2 @@
11
[slurm_suspend]
2+
clustermgtd_heartbeat_file_path = /opt/slurm/etc/pcluster/.slurm_plugin/clustermgtd_heartbeat

0 commit comments

Comments
 (0)