Dumps/Snapshot hosts

From Wikitech
< Dumps(Difference between revisions)
Jump to: navigation, search
(Created page with "==Snapshot (XML dumps generation) cluster information== ===Hardware=== We have two mini snapshot clusters. In Tampa: *snapshot1: '''operational''', PowerEdge 1950, Ubuntu 10...")
 
Line 18: Line 18:
 
*snapshot1003: waiting for drac issue fix, PowerEdge R410, Ubuntu 10.04, 16GB RAM, 2 6-core Xeons, 500GB HD
 
*snapshot1003: waiting for drac issue fix, PowerEdge R410, Ubuntu 10.04, 16GB RAM, 2 6-core Xeons, 500GB HD
 
*snapshot1004: waiting for drac issue fix, PowerEdge R410, Ubuntu 10.04, 16GB RAM, 2 6-core Xeons, 500GB HD
 
*snapshot1004: waiting for drac issue fix, PowerEdge R410, Ubuntu 10.04, 16GB RAM, 2 6-core Xeons, 500GB HD
 +
 +
Ordinarily only one cluster will be running dump jobs at a time; the other is on standby in case of various failures.  The two beefier servers (with 4 8-core cpus) are dedicated machines for the en wikipedia dumps; as with the other hosts, one of them is in opertion and the other is in standby.
  
 
===Currently running===
 
===Currently running===
Line 28: Line 30:
 
* [[snapshot4]] -- running en wiki dumps via  
 
* [[snapshot4]] -- running en wiki dumps via  
 
*: <code>python ./worker.py --configfile wikidump.conf.enwiki enwiki</code>
 
*: <code>python ./worker.py --configfile wikidump.conf.enwiki enwiki</code>
 +
 +
[[Category:Dumps]]

Revision as of 18:06, 19 January 2012

Snapshot (XML dumps generation) cluster information

Hardware

We have two mini snapshot clusters.

In Tampa:

  • snapshot1: operational, PowerEdge 1950, Ubuntu 10.04, 8GB RAM, 2 quad-core Xeons, 80GB HD
  • snapshot2: operational, PowerEdge 1950, Ubuntu 10.04, 8GB RAM, 2 quad-core Xeons, 80GB HD
  • snapshot3: operational, PowerEdge 1950, Ubuntu 10.04, 8GB RAM, 2 quad-core Xeons, 80GB HD
  • snapshot4: operational, PowerEdge R815, Ubuntu 10.04, 8GB RAM, 4 8-core Opterons, 2 80GB HDs

In D.C.:

  • snapshot1001: base install done, PowerEdge R815, Ubuntu 10.04, 64GB RAM, 4 8-core Opterons, 2 80GB HDs
  • snapshot1002: base install done, PowerEdge R410, Ubuntu 10.04, 16GB RAM, 2 6-core Xeons, 500GB HD
  • snapshot1003: waiting for drac issue fix, PowerEdge R410, Ubuntu 10.04, 16GB RAM, 2 6-core Xeons, 500GB HD
  • snapshot1004: waiting for drac issue fix, PowerEdge R410, Ubuntu 10.04, 16GB RAM, 2 6-core Xeons, 500GB HD

Ordinarily only one cluster will be running dump jobs at a time; the other is on standby in case of various failures. The two beefier servers (with 4 8-core cpus) are dedicated machines for the en wikipedia dumps; as with the other hosts, one of them is in opertion and the other is in standby.

Currently running

  • snapshot1 -- currently running 3 worker processes for bigger wikis out of /backups-atg, via
    python ./worker wikidump.conf.bigwikis
  • snapshot2 -- running 4 processes for small wikis out of /backups-atg, via
    ./worker
  • snapshot3 -- runs adds/changes dumps from cron
  • snapshot4 -- running en wiki dumps via
    python ./worker.py --configfile wikidump.conf.enwiki enwiki
Personal tools
Namespaces

Variants
Actions
Navigation
Ops documentation
Wiki
Toolbox