Monitoring Jobs
The Clarive Job Monitor keeps track of all jobs running in the system in a single, integrated interface.
Each job shown in the monitor links to both their Job Dashboard and Job Log interface, for extensive detail on what is being run by the job.
Monitor Fields and Data¶
Job Statuses¶
Here's a list of job statuses and their meaning:
Ready- Job is waiting to be picked-up by the job daemon, which can happen at any moment, except for theRUNstep, which runs at a scheduled date.Running- Job is currently runningWaiting for Approval- Waiting for an approver to approve or reject the job in the monitor.Rejected- Approver has rejected the job. Job is at a stand-still and won't be run unless action is taken.Expired- The current date and time is greater than theMax Start Date, so the jobRUNstep has been canceledAbend- The Job Daemon could not find the job process on the server, so it's marked as an aborted (crashed or killed) processRollback- Job is running a rollback operationFinished- Job finished runningError- Job finished with an error at any of its stepsCanceled- Job was canceled by a user while it was runningTrapped- An error was trapped and it's waiting for user inputTrap Paused- User has decided to pause the trap timeout counter since a longer resolution time is expected.
Always check the Step column to get a sense of where the job is at a given point in time.
Job Steps¶
Job steps indicate which phase of the job is being run (or expected to be run) by the job daemon at a given time.
CHECK- This step is previous to a job being created in the database and is not visible in the monitorINIT- Job has just been created, but the user is still waiting for confirmation. This is actually visible in the monitorPRE- During this step, the job will run all preparation that does not affect target environments, such as building an application or running tests.RUN- This step contains the rule logic that is going to run during the scheduled time.POST- This is the final step in the job pipeline chain. This step runs in the event of success or failure after aPREorRUNsteps
Job Progress¶
Progress is calculated by counting the number of total ops against the ones that have run. It does not include any loop unrolling, so the progress may not be 100% accurate, but gives an idea of how far the job pipeline has advanced.
Job Natures¶
Once the job contents are determined, Clarive parses all revisions and determines which natures are included.
So, this information is not necessarily available after job creation, but only after the PRE step runs.
Job Dates¶
Start Date- The real date-time that the job started itsPREstep.End Date- The real date-time when the job reached itsENDstep.Scheduled- This is when theRUNstep is planned to run.Max Start Date- If the job does not start by this date, it is marked asExpiredautomatically by the job daemon.
Monitor Actions¶
With the job monitor, you can control what happens to each job running, such as starting, canceling, deleting, reruning, etc.
These actions also require that the user have the adequate permissions, discussed further down this section.
Rerun¶
Rerun allows a job to be put in Ready status for a given step.
Normally, jobs are either rerun for PRE or RUN steps, to repeat things like build or deploy phases.
Also POST steps may be run for redoing things like resend notifications or promotions.
NOTE: If you rerun a step, all following steps will also be rerun, with the following behavior:
- If a
PREstep is rerun, theRUNstep still will preserve and wait for its scheduled date to be run. - If a
RUNstep is rerun, the scheduled date can be overruled with theRun Nowoption.
Reschedule¶
Jobs that are in Ready, Expired or Waiting for Approval can be rescheduled, which means setting a new date for the
RUN step to start.
Job Expiration¶
Jobs expire automatically when its Scheduled Date is greater than their Max Start Date. The purpose of expiring
jobs is to prevent them from start a deployment beyond a system outage, in a off-time.
Handling Expired Jobs¶
If a job has expired, it is not going to run. But using the monitor actions, the operator has 2 options:
- Rerun the job at either
PREorRUNsteps - Reschedule the job for another time
Canceling Jobs¶
Jobs in a Running state can be canceled. That will terminate the job immediately in the Clarive server, but does
not prevent processes running.
Permissions¶
action.job.view_monitor- Has access to see jobs for the authorized scopes in the Job Monitoraction.job.approve_all- Approve any job, even if not in the approval listaction.job.restart- Rerun a jobaction.job.cancel- Cancel a jobaction.job.delete- Delete a jobaction.job.create- Can create a job