Confirmed users
1,031
edits
Nnethercote (talk | contribs) |
m (name fixup) |
||
(34 intermediate revisions by 4 users not shown) | |||
Line 1: | Line 1: | ||
'''Project Uptime ran from April 2016 to June 2017.''' | |||
''Preventing and fixing crashes remains an important task, and there is ongoing work there. But this work is no longer being coordinated under Project Uptime. This page is being kept in place as a historical record.'' | |||
Project Uptime's goal is to reduce the crash rate of Firefox (desktop and mobile) and keep it down. This project is a Platform Engineering initiative that aims to extend and complement existing work relating to stability within Mozilla. | Project Uptime's goal is to reduce the crash rate of Firefox (desktop and mobile) and keep it down. This project is a Platform Engineering initiative that aims to extend and complement existing work relating to stability within Mozilla. | ||
Line 182: | Line 185: | ||
=== Improve pre-release coverage === | === Improve pre-release coverage === | ||
* Get the number of Nightly users growing again [pascal, marcia] | * Get the number of Nightly users growing again [pascal, marcia] | ||
* Provide test exercises for QA people and keen Nightly users to run regularly [marcia] | * <s>Provide test exercises for QA people and keen Nightly users to run regularly [marcia]</s> (One and Done has been [https://blog.mozilla.org/webqa/2016/09/28/one-and-done-is-being-decommissioned-october-2016/ decommissioned].) | ||
* ? Improve earlier channel populations to be more representative of the Release population [lonnen?] | * ? Improve earlier channel populations to be more representative of the Release population [lonnen?] | ||
Line 202: | Line 205: | ||
'''Extra:''' A [https://docs.google.com/presentation/d/1j-w1Mxgh7xQBPa57gdx_PSP2yf9yMJh8Oqu_7izReAY/edit#slide=id.p cross-variate analysis of FHR data], by Brendan Colloran, which may have useful techniques. | '''Extra:''' A [https://docs.google.com/presentation/d/1j-w1Mxgh7xQBPa57gdx_PSP2yf9yMJh8Oqu_7izReAY/edit#slide=id.p cross-variate analysis of FHR data], by Brendan Colloran, which may have useful techniques. | ||
=== Improve understanding of OOM causes === | === {{mdone|}} Improve understanding of OOM causes === | ||
* {{mdone|}} Do a large-scale analysis of memory reports from OOM crashes [njn] | * {{mdone|}} Do a large-scale analysis of memory reports from OOM crashes [njn] | ||
* Show important data from memory reports in crash-stats [njn] | * {{mdone|}} Show important data from memory reports in crash-stats [njn] | ||
* ? Discuss common OOM cases with partners [digitarald?] | * ? Discuss common OOM cases with partners [digitarald?] | ||
Line 215: | Line 218: | ||
</bugzilla | </bugzilla | ||
> | > | ||
=== Catch more crashes with static and dynamic analysis === | === Catch more crashes with static and dynamic analysis === | ||
* Add additional, regular ASan, TSan, and Valgrind jobs to TreeHerder [jseward, sfink, tcole] | * Add additional, regular ASan, TSan, and Valgrind jobs to TreeHerder [jseward, sfink, tcole] | ||
Line 251: | Line 253: | ||
> | > | ||
=== Prevent common C++ defects with better types and annotations === | === Prevent common C++ defects with better types and annotations === | ||
* Finish mozilla::Result<T,E> and use in SpiderMonkey [ | * Finish mozilla::Result<T,E> and use in SpiderMonkey [jandem] | ||
* Use MOZ_MUST_USE outside SpiderMonkey [njn] | * Use MOZ_MUST_USE outside SpiderMonkey [njn, wcpan] | ||
'''Extra:''' Bugs blocking {{bug|1289662}}: [Uptime] Low-level defect prevention and detection | '''Extra:''' Bugs blocking {{bug|1289662}}: [Uptime] Low-level defect prevention and detection | ||
Line 262: | Line 264: | ||
</bugzilla | </bugzilla | ||
> | > | ||
== H2 2016 Backlog == | == H2 2016 Backlog == | ||
Line 270: | Line 273: | ||
* Use [https://github.com/vvuk/crystalball Crystal Ball] (or other means) to identify how pre-Release crashes will manifest on Release | * Use [https://github.com/vvuk/crystalball Crystal Ball] (or other means) to identify how pre-Release crashes will manifest on Release | ||
'''Extra:''' Bugs blocking {{bug|1291174}}: [Uptime] Crash cluster ranking | |||
<bugzilla> | |||
{ | |||
"blocked": "1309114", | |||
"include_fields": "id, priority, summary, keywords, assigned_to" | |||
} | |||
</bugzilla | |||
> | |||
'''Extra:''' Relevant research papers. | '''Extra:''' Relevant research papers. | ||
* http://ieeexplore.ieee.org/document/5711013/ | * http://ieeexplore.ieee.org/document/5711013/ | ||
Line 341: | Line 352: | ||
</bugzilla | </bugzilla | ||
> | > | ||
= Data Sources and Useful Links = | = Data Sources and Useful Links = | ||
Line 352: | Line 364: | ||
* [https://docs.google.com/document/d/14nnlEiyl80qyirhIqzCHCDz9zo0R72wDorvmB8jctgs/ Fuzzing Explained - A Guide for Software Developers] | * [https://docs.google.com/document/d/14nnlEiyl80qyirhIqzCHCDz9zo0R72wDorvmB8jctgs/ Fuzzing Explained - A Guide for Software Developers] | ||
* Crash rate dashboards | * Crash rate dashboards | ||
** [https://crash-analysis.mozilla.com/release-mgmt/crash-report-tools/longtermgraph/ Crash rate history]: old ADI-based view | |||
** [https://telemetry.mozilla.org/crashes/ Stability dashboard]: new view, intended to replace arewestableyet.com [chutten] | |||
** [https://health.graphics/crashes ADI & Telemetry Crash Rate (Release)] [harald] | ** [https://health.graphics/crashes ADI & Telemetry Crash Rate (Release)] [harald] | ||
** [https://health.graphics/crashes/beta Telemetry Crash Rate (Beta)] [harald] | ** [https://health.graphics/crashes/beta Telemetry Crash Rate (Beta)] [harald] | ||
** [https://bsmedberg.github.io/telemetry-dashboard/crashes/office-dashboard.html Crashes per 1000 usage hours (Beta, DevEd, Nightly)] [bsmedberg] | ** [https://bsmedberg.github.io/telemetry-dashboard/crashes/office-dashboard.html Crashes per 1000 usage hours (Beta, DevEd, Nightly)] [bsmedberg] | ||
** {{bug|1324528}} - Bug about removing old dashboards. | |||
** {{bug|1324526}} - Bug about the new dashboards we want. | |||
* [https://dataviz.mozilla.org/views/PlatformVersionFirefoxADI/DesktopADIbyPlatform Firefox ADI dashboard] (requires Tableau/dataviz privileges to view) | * [https://dataviz.mozilla.org/views/PlatformVersionFirefoxADI/DesktopADIbyPlatform Firefox ADI dashboard] (requires Tableau/dataviz privileges to view) | ||
Line 372: | Line 388: | ||
North America (Pacific) | North America (Pacific) | ||
* Andrew McCreight, platform engineering | * Andrew McCreight, platform engineering | ||
* | * Lonnen, Socorro | ||
* David Baron, platform engineering | * David Baron, platform engineering | ||
Line 390: | Line 406: | ||
* Julian Seward, dynamic analysis | * Julian Seward, dynamic analysis | ||
* Sylvestre Ledru, static analysis, release management & stability | * Sylvestre Ledru, static analysis, release management & stability | ||
* Gabriele Svelto, Firefox engineering | |||
Europe (Eastern) | Europe (Eastern) | ||
Line 396: | Line 413: | ||
= Meetings = | = Meetings = | ||
We held meetings for several months, then switched to email updates because we deemed that to be more effective use of everybody's time. | |||
Here are the minutes of the meetings we have had. | |||
* [https://public.etherpad-mozilla.org/p/uptime20170410 2017-04-10 minutes] | |||
* [https://public.etherpad-mozilla.org/p/uptime20170327 2017-03-27 minutes] | |||
* [https://public.etherpad-mozilla.org/p/uptime20170313 2017-03-13 minutes] | |||
* [https://public.etherpad-mozilla.org/p/uptime20170227 2017-02-27 minutes] | |||
* [https://public.etherpad-mozilla.org/p/uptime20170213 2017-02-13 minutes] | |||
* [https://public.etherpad-mozilla.org/p/uptime20161219 2016-12-19 minutes] | |||
* [https://public.etherpad-mozilla.org/p/uptime_hawaii_general Hawaii minutes: general] | |||
* [https://public.etherpad-mozilla.org/p/uptime_hawaii_windows Hawaii minutes: Windows third-party crashes] | |||
* [https://public.etherpad-mozilla.org/p/uptime_hawaii_generation Hawaii minutes: client-side crash report generation] | |||
* [https://public.etherpad-mozilla.org/p/uptime_hawaii_analysis Hawaii minutes: crash report analysis] | |||
* [https://public.etherpad-mozilla.org/p/uptime_hawaii_metrics Hawaii minutes: crash metrics] | |||
* [https://public.etherpad-mozilla.org/p/uptime20161121 2016-11-21 minutes] | |||
* [https://public.etherpad-mozilla.org/p/uptime20161107 2016-11-07 minutes] | |||
* [https://public.etherpad-mozilla.org/p/uptime20161024 2016-10-24 minutes] | |||
* [https://public.etherpad-mozilla.org/p/uptime20161010 2016-10-10 minutes] | |||
* [https://public.etherpad-mozilla.org/p/uptime20160926 2016-09-26 minutes] | * [https://public.etherpad-mozilla.org/p/uptime20160926 2016-09-26 minutes] | ||
* [https://public.etherpad-mozilla.org/p/uptime20160912 2016-09-12 minutes] | * [https://public.etherpad-mozilla.org/p/uptime20160912 2016-09-12 minutes] | ||
* [https://public.etherpad-mozilla.org/p/uptime20160829 2016-08-29 minutes] | * [https://public.etherpad-mozilla.org/p/uptime20160829 2016-08-29 minutes] | ||
Here is the meeting minutes template. | |||
* [https://public.etherpad-mozilla.org/p/uptime-template Minutes template] | |||
= Communication channels = | = Communication channels = | ||
* IRC: #uptime | * IRC: #uptime | ||
* Vidyo: [https://v.mozilla.com/flex.html?roomdirect.html&key=tGTDjguBXn29Ldaww7BCeVhp4M Uptime] | * Vidyo: [https://v.mozilla.com/flex.html?roomdirect.html&key=tGTDjguBXn29Ldaww7BCeVhp4M Uptime] | ||
* Email: There are two email lists, both of which are low-volume. | |||
** Public: The [https://mail.mozilla.org/listinfo/stability stability] list. This list should be used for the vast majority of discussion. Anybody can join this list. | |||
** Private: <code>projectuptime-team@mozilla.com</code>. This is only for private discussion, e.g. discussions relating to commercially-sensitive information. Please email nnethercote if you want to join this list. | |||
= Nightly crash triage = | = Nightly crash triage = | ||
''This documentation has been moved to [[NightlyCrashTriage]].'' | |||