Devops/monitoring-alerting: Difference between revisions

From MozillaWiki
Jump to navigation Jump to search
Line 11: Line 11:
Mozilla Foundation applications are monitored and measured in a number of systems:
Mozilla Foundation applications are monitored and measured in a number of systems:
* Opsview, a Nagios clone with a much friendlier interface.
* Opsview, a Nagios clone with a much friendlier interface.
::Indented line * Monitors and alerts when servers in load balancers are unhealthy
:: * Monitors and alerts when servers in load balancers are unhealthy
::Indented line * Monitors and alerts on uptime/downtime of overall endpoints, such as https://webmaker.org
:: * Monitors and alerts on uptime/downtime of overall endpoints, such as https://webmaker.org
::Indented line * Monitors and alerts on database utilization and downtime.
:: * Monitors and alerts on database utilization and downtime.


::Indented line '''Important Opsview Links'''
::  '''Important Opsview Links'''
::Indented line [http://opsview.mofoprod.net:3000/viewport Public Status Page]
:: [http://opsview.mofoprod.net:3000/viewport Public Status Page]
::Indented line [http://opsview.mofoprod.net:3000/status/service?filter=unhandled&order=state_desc&order=host&order=service&includeunhandledhosts=1 Current Unhandled Alerts (Login required)]
:: [http://opsview.mofoprod.net:3000/status/service?filter=unhandled&order=state_desc&order=host&order=service&includeunhandledhosts=1 Current Unhandled Alerts (Login required)]
::Indented line [http://opsview.mofoprod.net:3000/event Recent Alerts in Opsview]
:: [http://opsview.mofoprod.net:3000/event Recent Alerts in Opsview]


* New Relic monitoring (Login Required)
* New Relic monitoring (Login Required)
::Indented line * Watching application response time in browser and server side
:: * Watching application response time in browser and server side
::Indented line * Watching database and web server utilization, transactions, timings, and throughput
:: * Watching database and web server utilization, transactions, timings, and throughput
::Indented line * Watching load balancer (ELB) metrics
:: * Watching load balancer (ELB) metrics
::Indented line * Performing serverside and client-side tracing of long running transactions
:: * Performing serverside and client-side tracing of long running transactions
::Indented line * Overall endpoint monitoring, such as https://webmaker.org
:: * Overall endpoint monitoring, such as https://webmaker.org
::Indented line * Watching cache server utilization and metrics
:: * Watching cache server utilization and metrics
::Indented line * Watching Elasticsearch server utilization and metrics
:: * Watching Elasticsearch server utilization and metrics
::Indented line * Watching Mongo server utilization and metrics
:: * Watching Mongo server utilization and metrics
::Indented line * Marks and compares new/old deployed versions of software
:: * Marks and compares new/old deployed versions of software


::Indented line '''Important New Relic Links'''
::  '''Important New Relic Links'''
::Indented line [https://rpm.newrelic.com/accounts/255689/custom_dashboards/1695/pages New Relic Dashboards ]
:: [https://rpm.newrelic.com/accounts/255689/custom_dashboards/1695/pages New Relic Dashboards ]
::Indented line [https://rpm.newrelic.com/accounts/255689/incidents Recent New Relic Alerts]
:: [https://rpm.newrelic.com/accounts/255689/incidents Recent New Relic Alerts]
::Indented line [https://rpm.newrelic.com/accounts/255689/applications New Relic Applications Overview]
:: [https://rpm.newrelic.com/accounts/255689/applications New Relic Applications Overview]
::Indented line [https://rpm.newrelic.com/accounts/255689/recent_events?scope=applications&type=deployment Recent Deployments]
:: [https://rpm.newrelic.com/accounts/255689/recent_events?scope=applications&type=deployment Recent Deployments]
::Indented line [https://rpm.newrelic.com/accounts/255689/browser Browser / Front-end Performance Overview]
:: [https://rpm.newrelic.com/accounts/255689/browser Browser / Front-end Performance Overview]


* Log monitoring with [https://loggins.mofoprod.net Loggins (Kibana) (Login Required)]
* Log monitoring with [https://loggins.mofoprod.net Loggins (Kibana) (Login Required)]

Revision as of 04:57, 31 May 2014

Mozilla Foundation Monitoring & Alerting

===== TLDR ===== :

Monitoring

Mozilla Foundation applications are monitored and measured in a number of systems:

  • Opsview, a Nagios clone with a much friendlier interface.
* Monitors and alerts when servers in load balancers are unhealthy
* Monitors and alerts on uptime/downtime of overall endpoints, such as https://webmaker.org
* Monitors and alerts on database utilization and downtime.
Important Opsview Links
Public Status Page
Current Unhandled Alerts (Login required)
Recent Alerts in Opsview
  • New Relic monitoring (Login Required)
* Watching application response time in browser and server side
* Watching database and web server utilization, transactions, timings, and throughput
* Watching load balancer (ELB) metrics
* Performing serverside and client-side tracing of long running transactions
* Overall endpoint monitoring, such as https://webmaker.org
* Watching cache server utilization and metrics
* Watching Elasticsearch server utilization and metrics
* Watching Mongo server utilization and metrics
* Marks and compares new/old deployed versions of software
Important New Relic Links
New Relic Dashboards
Recent New Relic Alerts
New Relic Applications Overview
Recent Deployments
Browser / Front-end Performance Overview