Plugin_jobid

Date:

03 Dec 2016

NAME

Plugin_jobid - man page for the LDMS jobid plugin

SYNOPSIS

Within ldmsd_controller or in a configuration file
config name=jobid [ <attr>=<value> ]

DESCRIPTION

With LDMS (Lightweight Distributed Metric Service), plugins for the ldmsd (ldms daemon) are configured via ldmsd_controller or a configuration file. The jobid plugin provides jobid info from /var/run/ldms.jobinfo or similar files replaced periodically by resource managers. When files are missing, the value 0 or equivalent is reported.

CONFIGURATION ATTRIBUTE SYNTAX

config
name=<plugin_name> producer=<pname> instance=<set_name> [component_id=<compid> schema=<sname>] [with_jobid=<bool>] file=<filepath>
configuration line
name=<plugin_name>

This MUST be jobid.
producer=<pname>

The producer name value.
instance=<set_name>

The name of the metric set.
schema=<schema>

Optional schema name. It is intended that the same sampler on different nodes with different metrics have a different schema. If not specified, will default to `vmstat`.
component_id=<compid>

Optional component identifier. Defaults to zero.
with_jobid=<bool>

Option to lookup job_id with set or 0 if not. The job_id column will always appear, but populated witn zero.

BUGS

No known implementation bugs. Design features you may not like: Relies on site-specific resource manager configuration to produce the file read. Does not query local or remote manager daemons. May be slow to sample and generate undesirable filesystem events if filepath is on a networked filesystem instead of a node-local RAM partition as is usual in clusters.

NOTES

The colname option from LDMS v2 slurmjobid plugin is no longer supported. The sampler offset for the jobid plugin should be slightly less than all other plugins to ensure consistency in the job information reported for a given time interval across all other plugins. The time interval for the jobid plugin need only be approximately the clock granularity of the resource manager.

Other samplers use the jobid plugin as the jobid data source. If the jobid sampler is not loaded, these samplers will report 0 jobid values.

EXAMPLES

Within ldmsd_controller or in a configuration file
load name=jobid
config name=jobid component_id=1 producer=vm1_1 instance=vm1_1/jobid
start name=jobid interval=1000000 offset=-100000


Within ldmsd_controller or in a configuration file
load name=jobid
config name=jobid component_id=1 producer=vm1_1 instance=vm1_1/jobid file=/var/run/rman/node/jobinfo
start name=jobid interval=1000000 offset=-100000

Slurm 2.x installations can populate /var/run/ldms.jobid by adding the following lines to slurm.epilog and slurm.prolog, respectively.

echo "JOBID=0" > /var/run/ldms.jobinfo

and

echo JOBID=$SLURM_JOBID > /var/run/ldms.jobinfo
echo UID=$SLURM_UID >> /var/run/ldms.jobinfo
echo USER=$SLURM_JOB_USER >> /var/run/ldms.jobinfo

These slurm files might be found in /etc/nodestate/bin/.

SEE ALSO

ldms(7), ldmsd(8), ldmsd_controller(8)