Sergio Iserte Agut
2012-04-23 13:53:06 UTC
Hello,
I'm trying to configure my Slurm-2.3.2 in order to allow me to run multiple
jobs in the same GPU.
These are my configurations:
*slurm.conf*
SchedulerType=sched/backfill
*gres.conf*
*
*
*$ srun -w"compute0" --gres=gpu:1,gpu_mem:250 sleep 100 &*
*$ squeue*
*
*
Thank you.
Regards!
Sergio Iserte.
I'm trying to configure my Slurm-2.3.2 in order to allow me to run multiple
jobs in the same GPU.
These are my configurations:
*slurm.conf*
SchedulerType=sched/backfill
SelectType=select/linear
GresTypes=gpu,gpu_mem
NodeName=enersis CPUs=1 Sockets=1 CoresPerSocket=1 ThreadsPerCore=1
RealMemory=1006 State=UNKNOWN
NodeName=compute0 NodeAddr=10.0.0.2 CPUs=4 RealMemory=7982 Sockets=1
CoresPerSocket=4 ThreadsPerCore=1 State=UNKNOWN gres=gpu:1,gpu_mem:512
NodeName=compute1 NodeAddr=10.0.0.3 CPUs=4 RealMemory=7982 Sockets=1
CoresPerSocket=4 ThreadsPerCore=1 State=UNKNOWN gres=gpu:1,gpu_mem:512
PartitionName=debug Nodes=compute[0-1] Default=YES MaxTime=INFINITE
State=UP
* *GresTypes=gpu,gpu_mem
NodeName=enersis CPUs=1 Sockets=1 CoresPerSocket=1 ThreadsPerCore=1
RealMemory=1006 State=UNKNOWN
NodeName=compute0 NodeAddr=10.0.0.2 CPUs=4 RealMemory=7982 Sockets=1
CoresPerSocket=4 ThreadsPerCore=1 State=UNKNOWN gres=gpu:1,gpu_mem:512
NodeName=compute1 NodeAddr=10.0.0.3 CPUs=4 RealMemory=7982 Sockets=1
CoresPerSocket=4 ThreadsPerCore=1 State=UNKNOWN gres=gpu:1,gpu_mem:512
PartitionName=debug Nodes=compute[0-1] Default=YES MaxTime=INFINITE
State=UP
*gres.conf*
*
*
Name=gpu File=/dev/nvidia0
Name=gpu_mem Count=512
*$ srun -w"compute0" --gres=gpu:1,gpu_mem:250 sleep 100 &*Name=gpu_mem Count=512
*$ srun -w"compute0" --gres=gpu:1,gpu_mem:250 sleep 100 &*
*$ squeue*
*
*
JOBID PARTITION NAME USER ST TIME NODES NODELIST(REASON)
68 debug sleep root PD 0:00 1 (Resources)
67 debug sleep root R 0:04 1 compute0
I wonder if to run both jobs sharing the GPU memory is possible.68 debug sleep root PD 0:00 1 (Resources)
67 debug sleep root R 0:04 1 compute0
Thank you.
Regards!
Sergio Iserte.