Jordi Blasco
2014-06-10 11:19:31 UTC
Hi,
we are using Snoopy library (https://github.com/a2o/snoopy) in order to
monitor and collect statistics regarding to the applications used in the
HPC resources.
Since there are more than 30% of the jobs in our database without any
information in this regard, it seems that Snoopy is not capable to track
everything.
Some other tools like PerfMiner or monitor (
http://web.eecs.utk.edu/~mucci/monitor/) are used in several places, but
since it relies on PapiEx (http://icl.cs.utk.edu/~mucci/papiex/), and this
project is no longer supported, I would like to know if there is some other
approach to collect this data.
In addition to that, I would like to know if it can be possible to enforce
to use srun in the submit script. I used a sbatch wrapper before, but maybe
there is now a better way to do it.
Thanks!
Regards,
Jordi
we are using Snoopy library (https://github.com/a2o/snoopy) in order to
monitor and collect statistics regarding to the applications used in the
HPC resources.
Since there are more than 30% of the jobs in our database without any
information in this regard, it seems that Snoopy is not capable to track
everything.
Some other tools like PerfMiner or monitor (
http://web.eecs.utk.edu/~mucci/monitor/) are used in several places, but
since it relies on PapiEx (http://icl.cs.utk.edu/~mucci/papiex/), and this
project is no longer supported, I would like to know if there is some other
approach to collect this data.
In addition to that, I would like to know if it can be possible to enforce
to use srun in the submit script. I used a sbatch wrapper before, but maybe
there is now a better way to do it.
Thanks!
Regards,
Jordi