QA/Execution/Web Testing/roles/buildmaster: Difference between revisions

From MozillaWiki
Jump to navigation Jump to search
(Initial creation of page)
 
 
(46 intermediate revisions by 5 users not shown)
Line 1: Line 1:
=Build Master=
=Introduction=
The buildmaster role is assigned to a team member in rotation, each lasting two weeks. The role is intended to identify failures early, and to take action towards resolving the failures. This action will vary depending on the skills and knowledge of the current buildmaster, or the level of commitment the team has to the affected project.


==Role Definition==
=Monitoring Failures=
* edit jenkins desc with your name as buildmaster
Failures can be monitored in the following ways:
* build master is point of contact for open issues/bugs
* [https://webqa-ci.mozilla.com/ Jenkins dashboard]
* role includes filing bugs/issues, sending out emails, investigating issues
* [https://webqa-ci.mozilla.com/rssFailed RSS feed]
* does NOT include escalation paths, prioritizing fixes, following up with other teams
* [[QA/Execution/Web_Testing/Automation/Build_Notifications|Mailing list]]
* send daily email with list of generally prioritized github issues to be fixed, blocking bugs that were filed
* [http://widget.mibbit.com/?settings=0a187437b06ca9bcb0cff2a717e985c5&server=irc.mozilla.org&channel=mozwebqa&noServerTab=false&autoConnect=true IRC]
* check builds at least once per day (there are several methods for doing this: view the web dashboard, subscribe to RSS feeds, read e-mail alerts, watch IRC notifications. We could consider others using Jenkins plugins)
** investigate failures
** If it is a locator issue, if you have a question, if you wonder if the test is still valid or important then file a GitHub issue
** File bugs on projects based on info below, then contact the noted team members


==Support Tiers==
=Roles=
==Owner==
* in Jenkins, on the failing job's output, you'll see "This build was not claimed. Claim it."  Click on Claim it and add your relevant details.
* you'll now be the Assignee, and you and your investigation details should now show up next to the failing build, on https://webqa-ci.mozilla.com/claims/
* if you know or suspect the failure's root cause does or will affect other jobs/projects, send an email to either the public (preferred) or internal mailing list, as appropriate
* once/when the build passes, your claim and the failing job will automatically be removed from the [https://webqa-ci.mozilla.com/claims/ Claims view]


===Tier 1===
==Co-owner==
* Marketplace
* serves as a "backup" in case of vacation/PTO
* AMO
* both Owner and Co-owner should coordinate to help ensure continued coverage
* Mozilla.org
* prompt the Owner with an offer for help, when there doesn't appear to be traction/visibility


===Tier 2===
=Rotation=
* SUMO
{{:QA/Execution/Web Testing/roles/buildmaster/Rotation}}
* Socorro


===Tier 3===
=Known Issues=
* BIDPOM
For the latest known issues check [https://public.etherpad-mozilla.org/p/webqa-buildmaster this etherpad]. If you are the current buildmaster please try to keep this pad updated. It helps when it comes to hand the role onto the next buildmaster, and can avoid duplicating effort investigating failures.
* Affiliates
* Mozillians
* Moztrap
* One and Done


==Projects==
=New Failures=
When a new failure is found, details should be added to [https://public.etherpad-mozilla.org/p/webqa-buildmaster this etherpad] and an issue should be raised in the appropriate place (see projects section for details). This is the minimum expected of the buildmaster, however if you have the time and skills (or time to learn the skills), it's a good idea to try to replicate the failure locally. If the failure is clearly something that should be fixed in the test suite, then it's also appreciated if you can submit a pull request with a fix.


===SUMO===
=Projects=
* low priority
 
* fails: Mythmon, rdalal, r1cky [who is leaving SUMO]
===Add-ons (AMO)===
* contact: IRC sumodev
* Priority: High
* IRC: [irc://irc.mozilla.org/#remora #remora]
* Bugs: [https://bugzilla.mozilla.org/buglist.cgi?product=addons.mozilla.org&resolution=--- Open Bugs], [https://bugzilla.mozilla.org/enter_bug.cgi?product=addons.mozilla.org Raise Bug]
* Contacts:
*# Krupa Raj [:krupa]
*# Mathieu Agopian [:magopian]
*# Wil Clouser [:clouserw]
 
===bidpom===
* towards low priority
* fail: john morrison [jrgm] if infrastructure related (time outs, buttons not loading, etc.); bob or davehunt are the ones to fix
* IRC: [irc://irc.mozilla.org/#identity #identity]
* known bug file bug and also need info him; esp if you know who checked in the change who made it fail
* known bug file bug and also need info him; esp if you know who checked in the change who made it fail
** https://bugzilla.mozilla.org/enter_bug.cgi?product=support.mozilla.org
** https://github.com/mozilla/bidpom/issues
 
===Bouncer===
* IRC: [irc://irc.mozilla.org/#bouncer #bouncer]
* Bugs: [https://bugzilla.mozilla.org/buglist.cgi?product=Webtools&component=Bouncer&resolution=---&list_id=12662997 Open Bugs], [https://bugzilla.mozilla.org/enter_bug.cgi?product=Webtools&component=Bouncer Raise Bug]
* Contacts:
*# Jeremy Orem [:oremj]


===One and Done===
===Hello (Loop)===
* low priority
* IRC: [irc://irc.mozilla.org/#loop #loop]
* bob
* Bugs: [https://bugzilla.mozilla.org/buglist.cgi?product=Loop&component=Client&resolution=---&list_id=12223283 Open Bugs], [https://bugzilla.mozilla.org/enter_bug.cgi?product=Loop&component=Client Raise Bug]
* https://bugzilla.mozilla.org/enter_bug.cgi?component=One%20and%20Done&product=Mozilla%20QA
* Releases: https://github.com/mozilla/loop-client
* Contacts:
*# Mark Banner [:Standard8]
*# Dan Mosedale [:dmose]


===Socorro===
===mozillians.org===
* low priority, no rush
* medium
* tests will be pared down to match less upcoming features
* IRC: [irc://irc.mozilla.org/#commtools #commtools]
* fails: mbrandt and rhelmer,
* Contacts: nemo-yiannis
* contact: IRC breakpad
* Bugs: [https://bugzilla.mozilla.org/buglist.cgi?order=Importance&list_id=12247087&resolution=---&classification=Other&query_format=advanced&bug_status=UNCONFIRMED&bug_status=NEW&bug_status=ASSIGNED&bug_status=REOPENED&component=Phonebook&product=Participation%20Infrastructure Open Bugs], [https://bugzilla.mozilla.org/enter_bug.cgi?product=Participation%20Infrastructure&component=Phonebook Raise Bug]
* known bug file bug and also need info him; esp if you know who checked in the change who made it fail
** https://bugzilla.mozilla.org/enter_bug.cgi?product=Socorro


===QMO===
===MozTrap===
* low priority
* lower priority
* tests not in jenkins
* IRC: [irc://irc.mozilla.org/#moztrap #moztrap]
* could make a public job for contributors to workon or fix
* Bugs: [https://bugzilla.mozilla.org/buglist.cgi?product=Mozilla%20QA&component=MozTrap&resolution=---&list_id=12235937 Open Bugs], [https://bugzilla.mozilla.org/enter_bug.cgi?component=MozTrap&product=Mozilla%20QA Raise Bug]
* https://bugzilla.mozilla.org/enter_bug.cgi?product=quality.mozilla.org
* Contacts:
*# Peter Bengtsson [:peterbe]
*# Cameron Dawson [:camd]


===mozwebqa dashboard===
===mozwebqa dashboard===
* low priority
* low priority
* fail: bob
* fail: bsilverberg / (justinpotts?)
* IRC: mozwebqa
* IRC: mozwebqa
* https://github.com/mozilla/mozwebqa-dashboard/issues
* https://github.com/mozilla/mozwebqa-dashboard/issues


===bidpom===
===One and Done===
* towards low priority
* low priority
* fail: john morrison [jrgm] if infrastructure related (time outs, buttons not loading, etc.); bob or davehunt are the ones to fix
* bsilverberg
* IRC: mozwebqa
* https://bugzilla.mozilla.org/enter_bug.cgi?component=One%20and%20Done&product=Mozilla%20QA
* known bug file bug and also need info him; esp if you know who checked in the change who made it fail
 
** https://github.com/mozilla/bidpom/issues
===Crash Stats (Socorro)===
* Priority: Low
* IRC: [irc://irc.mozilla.org/#breakpad #breakpad]
* Bugs: [https://bugzilla.mozilla.org/buglist.cgi?product=Socorro&resolution=--- Open Bugs], [https://bugzilla.mozilla.org/enter_bug.cgi?product=Socorro Raise Bug]
* Contacts:
*# Matt Brandt [:mbrandt]
*# Peter Bengtsson [:peterbe]
 
===Snippets===
* IRC: [[irc://irc.mozilla.org/#snippets #snippets]
* Bugs: [https://bugzilla.mozilla.org/buglist.cgi?product=Snippets&resolution=---&list_id=12663032 Open Bugs], [https://bugzilla.mozilla.org/enter_bug.cgi?product=Snippets Raise Bug]
* Contacts:
*# Giorgos Logiotatidis [:giorgos]
 
===Support (SUMO)===
* Priority: Low
* IRC: [irc://irc.mozilla.org/#sumodev #sumodev]
* Bugs: [https://bugzilla.mozilla.org/buglist.cgi?product=support.mozilla.org&resolution=--- Open Bugs], [https://bugzilla.mozilla.org/enter_bug.cgi?product=support.mozilla.org Raise Bug]
* Contacts:
*# Rebecca Billings [:rbillings]
*# Michael Cooper [:mythmon]
*# Rehan Dalal [:rdalal]
 
=FAQ=


===moztrap===
===Which bugs are currently open that correspond to known test failures?===
* lower priority
The buildmaster maintains an [https://etherpad.mozilla.org/webqa-buildmaster etherpad] which lists bugs that currently impact jobs.
* fails: peterbe/camd
* IRC: moztrap
* file a bug
*** https://bugzilla.mozilla.org/enter_bug.cgi?component=MozTrap&product=Mozilla%20QA


===mozilla.org===
===Who do I contact if the issue is related to Persona?===
* medium-high priority
If you trace an issue to Persona (the sign-on service) you should contact :jrgm in #persona. You can also raise issues in the project's [https://github.com/mozilla/persona/issues GitHub repository].
* fails: pmac/jgmize
* IRC: www
* file bug
** https://bugzilla.mozilla.org/enter_bug.cgi?product=www.mozilla.org


===marketplace===
===Who do I contact if the issue is related to Firefox Accounts?===
* high priority
If you trace an issue to Firefox Accounts you should contact one of the following in #fxa: Shane Tomlinson (stomlinson), Zachary Carter (zaach), or Vladislav Filippov (vladikoff).
* fails: krupa 1st / 2nd david durst EST/ wil clouser 3rd
* IRC: #marketplace
* file a bug - if we know there's been a regression, if you can reproduce manually and you can correlate with what you see in automation, or if it's infrastructure related
** https://bugzilla.mozilla.org/enter_bug.cgi?product=Marketplace


===amo===
===Why is the failure only happening on Sauce Labs?===
* high priority
It could be that the failure is only presenting itself on specific browser window sizes. Sauce Labs uses virtual machines with screen resolutions that may differ from our internal Selenium Grid. You could try specifying a browser window size to make the results consistent, or at least consider how the size of the browser might affect the tests that are failing.
* fails: 1st krupa / 2nd mathieu agopian FRANCE / 3rd wil clouser
* file a bug
** https://bugzilla.mozilla.org/enter_bug.cgi?product=addons.mozilla.org

Latest revision as of 17:49, 21 July 2016

Introduction

The buildmaster role is assigned to a team member in rotation, each lasting two weeks. The role is intended to identify failures early, and to take action towards resolving the failures. This action will vary depending on the skills and knowledge of the current buildmaster, or the level of commitment the team has to the affected project.

Monitoring Failures

Failures can be monitored in the following ways:

Roles

Owner

  • in Jenkins, on the failing job's output, you'll see "This build was not claimed. Claim it." Click on Claim it and add your relevant details.
  • you'll now be the Assignee, and you and your investigation details should now show up next to the failing build, on https://webqa-ci.mozilla.com/claims/
  • if you know or suspect the failure's root cause does or will affect other jobs/projects, send an email to either the public (preferred) or internal mailing list, as appropriate
  • once/when the build passes, your claim and the failing job will automatically be removed from the Claims view

Co-owner

  • serves as a "backup" in case of vacation/PTO
  • both Owner and Co-owner should coordinate to help ensure continued coverage
  • prompt the Owner with an offer for help, when there doesn't appear to be traction/visibility

Rotation

The Web QA Buildmaster Rotation page contains the past and upcoming schedule.

These entries are in reverse chronological order.

  • 2016-05-05 - 2016-05-19 - stephend
  • 2016-04-21 - 2016-05-05 - mbrandt
  • 2016-04-07 - 2016-04-21 - davehunt
  • 2016-03-24 - 2016-04-07 - rbillings
  • 2016-03-10 - 2016-03-24 - krupa

Known Issues

For the latest known issues check this etherpad. If you are the current buildmaster please try to keep this pad updated. It helps when it comes to hand the role onto the next buildmaster, and can avoid duplicating effort investigating failures.

New Failures

When a new failure is found, details should be added to this etherpad and an issue should be raised in the appropriate place (see projects section for details). This is the minimum expected of the buildmaster, however if you have the time and skills (or time to learn the skills), it's a good idea to try to replicate the failure locally. If the failure is clearly something that should be fixed in the test suite, then it's also appreciated if you can submit a pull request with a fix.

Projects

Add-ons (AMO)

  • Priority: High
  • IRC: #remora
  • Bugs: Open Bugs, Raise Bug
  • Contacts:
    1. Krupa Raj [:krupa]
    2. Mathieu Agopian [:magopian]
    3. Wil Clouser [:clouserw]

bidpom

  • towards low priority
  • fail: john morrison [jrgm] if infrastructure related (time outs, buttons not loading, etc.); bob or davehunt are the ones to fix
  • IRC: #identity
  • known bug file bug and also need info him; esp if you know who checked in the change who made it fail

Bouncer

Hello (Loop)

mozillians.org

MozTrap

mozwebqa dashboard

One and Done

Crash Stats (Socorro)

Snippets

Support (SUMO)

  • Priority: Low
  • IRC: #sumodev
  • Bugs: Open Bugs, Raise Bug
  • Contacts:
    1. Rebecca Billings [:rbillings]
    2. Michael Cooper [:mythmon]
    3. Rehan Dalal [:rdalal]

FAQ

Which bugs are currently open that correspond to known test failures?

The buildmaster maintains an etherpad which lists bugs that currently impact jobs.

Who do I contact if the issue is related to Persona?

If you trace an issue to Persona (the sign-on service) you should contact :jrgm in #persona. You can also raise issues in the project's GitHub repository.

Who do I contact if the issue is related to Firefox Accounts?

If you trace an issue to Firefox Accounts you should contact one of the following in #fxa: Shane Tomlinson (stomlinson), Zachary Carter (zaach), or Vladislav Filippov (vladikoff).

Why is the failure only happening on Sauce Labs?

It could be that the failure is only presenting itself on specific browser window sizes. Sauce Labs uses virtual machines with screen resolutions that may differ from our internal Selenium Grid. You could try specifying a browser window size to make the results consistent, or at least consider how the size of the browser might affect the tests that are failing.