CIDuty:QuarantineInstances: Difference between revisions

Added info about logs and added pictures
(work_in_progress)
(Added info about logs and added pictures)
Line 3: Line 3:
= When and How to quarantine taskcluster instances =
= When and How to quarantine taskcluster instances =


Choose the worker types you wish to investigate. You can find them [https://tools.taskcluster.net/provisioners/releng-hardware/worker-types/ here.]
1. '''Choose the worker types you wish to investigate. You can find them [https://tools.taskcluster.net/provisioners/releng-hardware/worker-types/ here.]'''


Check which instances have exception(orange) at task state and investigate each of them separately.
2. '''Check which instances have exception(orange) or failed(red) at task state and investigate each of them separately.'''


Quarantine the ones that have the last 10+ tasks resolved as exception(orange). And accept the default 1000 year quarantine, RelOps can disable the quarantine whenever
[[File:Worker List.png|left]]
the instances are fixed.


File a bug in Bugzilla under RelOps similar to this [https://bugzilla.mozilla.org/show_bug.cgi?id=1441820 bug example.]
3. '''If the last 5-6+ tests are problematic, be sure to check out a few of them. '''
 
[[File:Test name.png]]
 
 
'''Check public/logs/live_backing.log for errors on a few of the latest tests by going to <Test-Name> -> Run Artifacts -> public/logs/live_backing.log as shown below :'''
 
[[File:Log location.png]]
 
 
4. J'''udging by the error logs we will know if the machine is faulty (quarantine if this is true) or not. There is no black and white answer for this only that we will know it from experience. Thus far we know if the above conditions are met and the error log terminated with error code -1 and a message like :'''
 
[[File:Error log.png]]
 
 
'''5. Quarantine all the instances for which all of the above is true by pressing the Quarantine button and leaving the default 1000 years as expiration date, as shown in this [https://irccloud.mozilla.com/file/JTN97Erw/image.png image.]''' 
 
 
'''6. File a bug in Bugzilla under RelOps e.g.:''' https://bugzilla.mozilla.org/show_bug.cgi?id=1441820
Confirmed users
39

edits