Fred Smith
2014-10-08 22:18:42 UTC
I am using slurm latest version 14.03.8 on CentOS 6.2. It's compiled following the instructions in the installation guide for rpmbuild and rpm install. The plugin rpm is installed, nevertheless slurm does not start when using the topology sample from the Topology Guide (the node names are also configured in the slurm.conf NodeName):
SwitchName=s0 Nodes=node-0-[4-5]
SwitchName=s1 Nodes=node-0-[6-7]
SwitchName=s2 Switches=s[0-1]
slurm is configured with TopologyPlugin=topology/tree .
However the start fails with parsing errors:
scontrol: error: Parsing error at unrecognized key: SwitchName
scontrol: error: Parse error in file /etc/slurm/topology.conf line 1:
"SwitchName=s0 Nodes=hpc-0-[4-5] "
scontrol: error: Parsing error at unrecognized key: SwitchName
scontrol: error: Parse error in file /etc/slurm/topology.conf line 2:
"SwitchName=s1 Nodes=hpc-0-[6-7] "
scontrol: error: Parsing error at unrecognized key: SwitchName
scontrol: error: Parse error in file /etc/slurm/topology.conf line 3:
"SwitchName=s2 Switches=s[0-1]"
scontrol: error: "Include" failed in file /etc/slurm/slurm.conf line 85
scontrol: fatal: Unable to process configuration file
the plugin is installed: topology_tree.so in the default location and does indeed include
strings : SwitchName LinkSpeed Nodes Switches. The slurm start script does include the default LIBDIR setting.
Any hint what is missing to get the topology to work?
SwitchName=s0 Nodes=node-0-[4-5]
SwitchName=s1 Nodes=node-0-[6-7]
SwitchName=s2 Switches=s[0-1]
slurm is configured with TopologyPlugin=topology/tree .
However the start fails with parsing errors:
scontrol: error: Parsing error at unrecognized key: SwitchName
scontrol: error: Parse error in file /etc/slurm/topology.conf line 1:
"SwitchName=s0 Nodes=hpc-0-[4-5] "
scontrol: error: Parsing error at unrecognized key: SwitchName
scontrol: error: Parse error in file /etc/slurm/topology.conf line 2:
"SwitchName=s1 Nodes=hpc-0-[6-7] "
scontrol: error: Parsing error at unrecognized key: SwitchName
scontrol: error: Parse error in file /etc/slurm/topology.conf line 3:
"SwitchName=s2 Switches=s[0-1]"
scontrol: error: "Include" failed in file /etc/slurm/slurm.conf line 85
scontrol: fatal: Unable to process configuration file
the plugin is installed: topology_tree.so in the default location and does indeed include
strings : SwitchName LinkSpeed Nodes Switches. The slurm start script does include the default LIBDIR setting.
Any hint what is missing to get the topology to work?