Backup procedures
From Wikitech
This page is an inventory of our backups for both clusters data and office data.
Some definitions:
- On-site
- here means the backup is in the same physical location as the master, but on a different machine.
- Off-site
- means the backup is in a physically separate location. In most cases it's nice to have both!
- Red cells
- denote a lacking area which we want/need.
Contents |
Service backup state
| Service | Master | On-site | Off-site | Notes | Approx. Space Needed |
|---|---|---|---|---|---|
Wiki primary data | |||||
| Wiki main databases | PMTPA db* | PMTPA slave replication | EQIAD db slaves & snapshots / ESAMS toolserver replication | http://noc.wikimedia.org/dbtree/ | |
| External storage | PMTPA db* | PMTPA slave replication | EQIAD ES1004 replicates ms03; ES1001-3 is a copy of rc1 & cl1-cl21 /ESAMS toolserver replication | ||
| Wiki data dumps | PMTPA dataset2 | Gluster storage (labstore1 under /publicdata-project directory) Last 5 good only. | EQIAD: dataset1001 (via rsync). All public data: dumps.wikimedia.your.org. Last 5 good dumps: wikipedia.c3sl.ufpr.br (ftp/http/rsync). | See Dumps/Mirror status for more info. | 6T (last 5 good)/ 29T (all) |
Images and media | |||||
| Uploads | PMTPA ms7 | PMTPA ms8 | EQIAD ms1002 / ESAMS ms6 (partial) | 18T | |
| Thumbs | PMTPA ms5 | EQIAD ms1004 | 8T | ||
Software development and configuration | |||||
| MediaWiki config | PMTPA NFS | tridge | Subversion | ||
| Apache config | PMTPA NFS | tridge | Automatic | rsync | |
| Subversion | Formey | tridge | 3 GB | ||
| Gerrit | Manganese | Formey | Databases on db9/10 | 3 GB | |
| Bugzilla data | PMTPA db9 | db10 replication | EQIAD db1008/db1025 & snapshots | 1 GB | |
| Bugzilla config | PMTPA db9 | db10 replication | EQIAD db1008/db1025 & snapshots | 1.5 GB | |
| Bugzilla frontend | PMTPA isidore | daily rsync to Tridge | |||
| Wikitech wiki | linode | PMTPA tridge; ESAMS ? | |||
Communications | |||||
| OTRS data | PMTPA db9 | db10 replication | EQIAD db1008/db1025 & snapshots | 57 GB | |
| OTRS config | PMTPA bart | 19 GB | |||
| blogs frontend data | PMTPA singer | Daily backups to Tridge | |||
| bugzilla | PMTPA isidore | Daily backups to Tridge | |||
| IMAP mail | PMTPA sanger | PMTPA mchenry, Daily backups to tridge | see Mail#Backups | 32 GB | |
| Mailing lists | ESAMS lily - decommissioned | EQIAD sodium | PMTPA mchenry | See Mailing lists#Backups | 50 GB |
| DNS | PMTPA ns0, ns1 | checkin to sockpuppet SVN | ESAMS ns2 | In SVN now | |
| Google docs | Google :P | ad-hoc downloads | |||
Soft data | |||||
| Search databases | PMTPA various | regeneratable | |||
| HTTP logs | PMTPA locke | emery | /a/squid | ||
| MediaWiki logs | PMTPA NFS | tridge | /home/wikipedia/logs | ||
Fundraising | |||||
| Fundraising Front-End | EQIAD aluminium | software:svn logs:storage3->tridge |
puppet:files/misc/scripts/offhost_backups | 5GB | |
| Fundraising Databases | EQIAD db1008 | db1025 replication + snapshots | PMTPA storage3 replication, snapshots, dumps tridge daily rsync of dumps |
puppet:files/misc/scripts/dump_fundraisingdb | 1TB (15GB/day) |
| Payment Processing | software:svn db+logs:PMTPA |
db:mysql replication db+logs:encrypt+copy daily to tridge |
db+logs:replicate to silicon | payments cluster | 100GB(?) |
| Impression Logs | PMTPA storage3 | hume 90 days max see puppet:private/files/misc/fundraising/impression_log_rotator |
1.5TB year | ||
Misc data | |||||
| Office fileserver | OFFICE fileserver | weekly rsync to USB drive | rsync to tridge not running | 2 GB(?) | |
| Office workstations | OFFICE * | currently no backups done (can be enabled, if needed; MacOS only) | |||
| Office laptops | OFFICE * | Time Machine backups to usb drive on Imac (MacOS only) | |||
| Server home directories | PMTPA nfs1 / nfs2 | tridge | Daily backups to Tridge | 30 GB | |