Workflow Summary

Events Units
category total read written total
unmasked Units which are not masked by lumi_mask. If no mask has been specified, all units are unmasked. For more information see the documentation.
written merged
stuck Units which cannot be attempted because they are either failed or skipped, or their input is a unit in a parent workflow that failed or was skipped. If you want to increase skipping/failure thresholds so that the parent units are attempted again, run lobster configure /my/working/directory and increase threshold_for_failure and/or threshold_for_skipping in the lobster.core.config.AdvancedOptions section. If the parent unit finishes successfully, the stuck units will automatically be attempted. For more information see the documentation on updating your configuration after the start of a run, and advanced configuration options.
failed Units for which the executable has not exited successfully more than threshold_for_failure.

If you want to increase the failure threshold so that these units are attempted again, run lobster configure /my/working/directory and increase threshold_for_failure in the lobster.core.config.AdvancedOptions section. For more information see the documentation on updating your configuration after the start of a run, and advanced configuration options.
skipped Units for which accessing the input file has failed more than threshold_for_skipping.

If you want to increase the skipping threshold so that these units are attempted again, run lobster configure /my/working/directory and increase threshold_for_skipping in the lobster.core.config.AdvancedOptions section. For more information see the documentation on updating your configuration after the start of a run, and advanced configuration options.
left Units which are available for processing but haven't been attempted yet:

left = unmasked - running - written - failed - skipped - stuck
Progress Merged JSON
mAOD_step_tllq4fNoSchanWNoHiggs0p_HanV4Model16DttllScanpointsXQCUT0MatchOff_run1 0 5 000 000 5 000 000 10 000 10 000 10 000 10 000 0 0 0 0 100.0 % 100.0 % processed
mAOD_step_ttHJet_HanV4ttXJetStartPtChecks_run0 0 6 450 524 6 450 524 40 000 40 000 40 000 40 000 0 0 0 0 100.0 % 100.0 % processed
mAOD_step_ttllNuNuJetNoHiggs_HanV4ttXJetStartPtChecks_run2 0 5 653 664 5 653 664 40 000 40 000 40 000 40 000 0 0 0 0 100.0 % 100.0 % processed

Task Summary

Efficiency

Transfer Protocol Summary

Protocol Stage-in Success Stage-in Failure Stageout Success Stageout Failure
hdfs 0 88183 0 36795
root 95526 0 0 36795
gsiftp 0 0 11784 0

Task Resources Utilized

Task Resources Allocated

Task Timing

statisticsprofile show timeline breakdown

Task Resources

Task Timing

statisticsprofile show timeline breakdown

A mapping of the exit codes can be found in the documentation.

Failure Modes

Task Resources

Task Logs

Exit code Count Samples
-11 77 281167 281026 280436 280156 280017 280013 279998 279948 279814 279813
-4 218 281173 281170 281064 281025 280996 280995 280491 280487 280468 280463
143 9 264385 263908 252070 247964 233687 231190 218593 216864 138661
8020 260 281177 280501 280494 280484 280483 280482 280406 280400 280280 280227
8021 917 281175 281174 281171 281169 281130 281113 281107 281082 281075 281043
8022 1 192615
10001 306 280977 280976 280823 280799 280791 280489 280481 280469 280272 280195
10040 1 238078

Hosts with Task Failures

Hostname Exit Codes
All 8021 10001 -4 8020 -11 143
d64cepyc002.crc.nd.edu 49 18 14 13 3 1 0
d64cepyc003.crc.nd.edu 44 18 13 5 3 5 0
d64cepyc001.crc.nd.edu 41 18 9 10 3 1 0
q16copt100.crc.nd.edu 22 11 2 4 4 1 0
q16copt090.crc.nd.edu 21 6 4 6 4 0 1
q16copt051.crc.nd.edu 20 4 8 3 4 1 0
q16copt088.crc.nd.edu 19 6 5 1 5 2 0
d12chas326.crc.nd.edu 9 5 1 0 3 0 0
q16copt050.crc.nd.edu 9 6 0 0 2 1 0
d12chas327.crc.nd.edu 8 3 1 2 2 0 0