Sheriffing/TBPL/DeveloperDocs
About the service
(what does it do, what is the purpose?)
Development documentation
(is there a design doc for development?)
Logical diagram
Physical diagram
Hardware
(what is going to be used here, physically?)
OS
(self explanatory, note any exceptions)
Interface settings and IP allocations
VLANs
Private interfaces
Public interfaces
Network flows
(firewall needs?)
Load Balancing / Caching Load balancing
(round robin? VIP? GLB?)
Health checks
(how will the app be checked for validity from the lb?)
Front end caching
(http caching)
Back end caching
(memcache etc)
Database
(what database server(s) - rw & ro, db name(s), db username) Other requirements
(sphinx, redis, celery, etc)
File storage
(internally or externally mounted filesystems.. where will static data for this service live?)
Automation
Cron jobs
(if the cron jobs run from an admin machine, please specify where they will run)
Puppet
(what modules/classes will be used?)
Monitoring
Nagios
(what checks will need set for this service?)
Ganglia Update / Push procedure
(how to update the code, db, etc)
Common Troubleshooting
Backup / DR
(where are backups stored if any?
How can someone else fix this site in a disaster?)
Staging site
Deployment Bugs
Admin Contacts
Primary Admin:
Secondary Admin: