offmon_2017-01_ver14


Click on each figure to show larger version

Status

Info retrieved: 2017-05-08 10:29:06.0

Job Limit: 0 Total attempts: 3454

Status from SWIF Summary

Numbers are for individual registered jobs, final status.

Undispatched Dispatched Succeeded Failed Problems Canceled Total
0 0 3426 0 4 0 3430


Auger Status

Dependency Pending Staging In Active Staging Out Finishing Dispatched Total
0 0 0 0 0 0 0

Status by Resources

Counts are for each attempt of running a job, if an attempt fails and the job is resubmitted

the same job will appear in other places as other attempts.

Resources Results TOTAL
RAM (GB) time limit (hrs) SUCCESS USER ERROR FAILED TIMEOUT OVER_RLIMIT UNDISPATCHED ACTIVE CANCELLED
20.0 8.0 3426 15 8 0 4 0 0 0 3453
25.0 8.0 0 1 0 0 0 0 0 0 1
TOTAL 3426 16 8 0 4 0 0 0 3454

Number of Attempts For Each Job


Number of jobs in each stage since launch


Time Spent in Dependency


Time Spent in Pending


Number of Jobs Active


Duration of Each Stage


Wall Time, CPU Time, CPU Time vs Wall Time


MAX Memory reported by AUGER


Problem Jobs

Run File RAM (GB) Attempt Problem Resolution SWIF Job ID Auger Job ID Submit Time Complete Time
030411 000 20 1 / 2 SWIF-USER-NON-ZERO retry 5535661 37491239 2017-05-02 16:27:26.0 2017-05-04 23:43:34.0
030450 004 20 1 / 2 SWIF-USER-NON-ZERO retry 5535760 37491338 2017-05-02 16:27:44.0 2017-05-04 22:12:21.0
030463 000 20 1 / 2 SWIF-USER-NON-ZERO retry 5535807 37491385 2017-05-02 16:27:53.0 2017-05-02 17:02:28.0
030470 001 20 1 / 5 SWIF-USER-NON-ZERO retry 5535843 37491421 2017-05-02 16:27:59.0 2017-05-03 22:27:35.0
030470 001 20 2 / 5 SWIF-USER-NON-ZERO retry 5535843 37523159 2017-05-04 11:42:37.0 2017-05-05 00:20:31.0
030470 001 20 3 / 5 AUGER-OVER_RLIMIT retry 5535843 37544476 2017-05-05 09:56:03.0 2017-05-05 12:06:29.0
030470 001 20 4 / 5 SWIF-USER-NON-ZERO retry 5535843 37571657 2017-05-05 16:52:56.0 2017-05-07 14:58:06.0
030470 001 20 5 / 5 SWIF-USER-NON-ZERO 5535843 37714276 2017-05-07 20:32:53.0 2017-05-07 22:34:07.0
030496 000 20 1 / 2 SWIF-USER-NON-ZERO retry 5535932 37491510 2017-05-02 16:28:14.0 2017-05-03 19:46:54.0
030632 000 20 1 / 2 SWIF-USER-NON-ZERO retry 5536158 37491736 2017-05-02 16:28:48.0 2017-05-02 22:49:38.0
030804 003 20 1 / 2 AUGER-FAILED retry 5536578 37492160 2017-05-02 16:29:58.0 2017-05-05 01:34:16.0
030805 004 20 1 / 2 AUGER-FAILED retry 5536583 37492165 2017-05-02 16:29:59.0 2017-05-05 01:34:16.0
030888 000 20 1 / 2 AUGER-FAILED retry 5536770 37492352 2017-05-02 16:30:25.0 2017-05-05 01:34:16.0
030898 004 20 1 / 4 SWIF-USER-NON-ZERO retry 5536801 37492383 2017-05-02 16:30:29.0 2017-05-05 02:18:23.0
030898 004 20 2 / 4 SWIF-USER-NON-ZERO retry 5536801 37544480 2017-05-05 09:56:03.0 2017-05-05 11:26:06.0
030898 004 20 3 / 4 SWIF-USER-NON-ZERO retry 5536801 37571658 2017-05-05 16:52:56.0 2017-05-07 11:21:34.0
030898 004 20 4 / 4 AUGER-OVER_RLIMIT 5536801 37714277 2017-05-07 20:32:53.0 2017-05-07 22:03:43.0
030954 004 20 1 / 2 AUGER-FAILED retry 5536889 37492478 2017-05-02 16:30:45.0 2017-05-04 16:43:24.0
030955 004 20 1 / 2 AUGER-FAILED retry 5536891 37492480 2017-05-02 16:30:45.0 2017-05-04 16:43:25.0
030448 007 20 1 / 2 SWIF-USER-NON-ZERO retry 5567383 37548372 2017-05-05 10:36:25.0 2017-05-06 13:19:09.0
030451 008 20 1 / 2 AUGER-OVER_RLIMIT modify 5567396 37548386 2017-05-05 10:36:28.0 2017-05-07 00:49:05.0
030451 008 25 2 / 2 SWIF-USER-NON-ZERO 5567396 37714284 2017-05-07 20:32:55.0 2017-05-07 21:31:24.0
030459 005 20 1 / 2 SWIF-USER-NON-ZERO retry 5567421 37548413 2017-05-05 10:36:33.0 2017-05-06 18:03:08.0
030598 005 20 1 / 2 AUGER-FAILED retry 5567682 37548692 2017-05-05 10:37:21.0 2017-05-07 15:00:02.0
030745 006 20 1 / 2 AUGER-FAILED retry 5568113 37549151 2017-05-05 10:39:09.0 2017-05-07 15:00:02.0
030754 008 20 1 / 2 AUGER-FAILED retry 5568120 37549158 2017-05-05 10:39:10.0 2017-05-07 15:00:02.0
030783 006 20 1 / 2 SWIF-USER-NON-ZERO retry 5568150 37549188 2017-05-05 10:39:17.0 2017-05-07 18:23:40.0
030783 006 20 2 / 2 AUGER-OVER_RLIMIT 5568150 37714283 2017-05-07 20:32:54.0 2017-05-08 00:04:02.0