Server admin log/Archive 21

From Wikitech
Jump to: navigation, search

May 15

  • 22:34 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Enable CentralAuth logging to file'
  • 22:13 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php
  • 21:09 logmsgbot: raindrift synchronizing Wikimedia installation... : PageTriage update
  • 21:05 logmsgbot: raindrift synchronizing Wikimedia installation... : PageTriage update
  • 19:40 logmsgbot: aaron synchronized wmf-config/swift.php 'disabled new hook for now.'
  • 19:35 logmsgbot: reedy synchronized php-1.20wmf3/extensions/Translate/tag/PageTranslationHooks.php 'Add wfDebugLog call'
  • 19:31 logmsgbot: reedy synchronized php-1.20wmf3/extensions/Translate/tag/PageTranslationHooks.php 'Add wfDebugLog call'
  • 19:30 logmsgbot: reedy synchronized wmf-config/CommonSettings.php '$wgDebugLogGroups[updateTranstagOnNullRevisions] = udp://10.0.5.8:8420/updateTranstagOnNullRevisions'
  • 19:26 logmsgbot: reedy synchronized php-1.20wmf3/extensions/Translate/tag/PageTranslationHooks.php 'Only conditionally disable updateTranstagOnNullRevisions hook. Debugging to come'
  • 17:45 logmsgbot: aaron synchronized php-1.20wmf3/includes/filerepo/backend/SwiftFileBackend.php
  • 17:41 logmsgbot: aaron synchronized php-1.20wmf3/includes/filerepo/backend/SwiftFileBackend.php
  • 17:35 logmsgbot: aaron synchronized wmf-config/swift.php 'Use new thumb purge hook for testwikis'
  • 16:54 RobHalsell: updated apache config for wiki-pedia.org, seems the bot doesnt spam that anymore =[
  • 16:36 mutante: srv app servers max. uptime with older kernel down to ~120 days after another bunch of upgrades
  • 16:34 RobHalsell: updating dns for wiki-pedia.org
  • 12:20 hashar: deployment-prep replaced most occurrences of /mnt/upload to /mnt/upload6
  • 10:37 apergos: on db39 dropped triggers pt_osc_elwiki_recentchanges ins, del, upd, they were preventing all elwiki edits except bot edits with the complaint Table 'elwiki._recentchanges_new' doesn't exist ... binasher, doublecheck me please?
  • 09:24 mutante: srv278 - still has issues as in reopnened RT #24 - upgrading kernel anyways
  • 03:05 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'update wgUploadNavigationUrl on all cs wikis'
  • 02:35 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Tue May 15 02:35:53 UTC 2012
  • 02:23 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Tue May 15 02:23:47 UTC 2012
  • 01:09 logmsgbot: asher synchronized wmf-config/db.php 'returning db31 as an s4 slave'
  • 01:05 logmsgbot: aaron synchronized php-1.20wmf3/extensions/SwiftCloudFiles/php-cloudfiles-wmf/cloudfiles.php 'deployed f20e752630575f8384083f0ad0401e250c8babf5'
  • 01:00 binasher: shutting down mysql on db31, then rebooting
  • 00:59 logmsgbot: asher synchronized wmf-config/db.php 'pulling db31 from s4 for kernel upgrade'
  • 00:58 binasher: new s4 master position - MASTER_LOG_FILE='db51-bin.000114', MASTER_LOG_POS=1772578
  • 00:57 logmsgbot: asher synchronized wmf-config/db.php 'new s4 master'
  • 00:55 logmsgbot: asher synchronized wmf-config/db.php 'setting s4 to read-only, switching master to db51'
  • 00:54 binasher: preparing to rotate s4 master from db31 to db51
  • 00:48 logmsgbot: reedy synchronized wmf-config/CommonSettings.php
  • 00:48 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bring in numerous shell requests from gerrit'
  • 00:48 binasher: rebooting db51 for kernel upgrade, prior to promoting to s4 master
  • 00:47 logmsgbot: asher synchronized wmf-config/db.php 'pulling db51 from s4 for kernel upgrade'
  • 00:01 binasher: just completed an online schema change for commonswiki.recentchanges in prod. woo!

May 14

  • 22:02 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping MobileFrontend resource version'
  • 21:08 logmsgbot: reedy synchronized php-1.20wmf3/extensions/Translate/tag/PageTranslationHooks.php 'Live hack out updateTranstagOnNullRevisions'
  • 20:08 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: mediawikiwiki to 1.20wmf3
  • 19:38 logmsgbot: reedy synchronizing Wikimedia installation... : Running scap to make sure everything is ok...
  • 19:36 logmsgbot: reedy synchronized php-1.20wmf3/cache/l10n/ 'Resync localisation cache'
  • 19:26 logmsgbot: reedy synchronized live-1.5/ 'Push live-1.5 new symlinks'
  • 19:24 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: testwiki to 1.20wmf3
  • 19:16 logmsgbot: reedy synchronized php-1.20wmf3/cache/trusted-xff.cdb
  • 19:14 logmsgbot: reedy synchronized wmf-config/ExtensionMessages-1.20wmf3.php
  • 19:13 logmsgbot: reedy synchronized php-1.20wmf3/extensions/ 'Push extensions out properly'
  • 19:11 binasher: resyncing cluster22 from es1002 to es1004
  • 19:02 logmsgbot: reedy synchronized php-1.20wmf3/LocalSettings.php 'Use newer version'
  • 19:01 logmsgbot: reedy synchronized php-1.20wmf2/LocalSettings.php 'Use newer version'
  • 18:58 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: test2wiki to 1.20wmf3
  • 18:57 logmsgbot: reedy synchronized php-1.20wmf3/cache/l10n/ 'Syncing localisation cache files'
  • 18:49 Ryan_Lane: added OATHAuth to components list for MediaWiki Extensions product in bugzilla
  • 18:43 Ryan_Lane: switching sessions back to memcached for labsconsole
  • 18:42 Ryan_Lane: adding OATHAuth to labsconsole
  • 18:40 Ryan_Lane: completed upgrade to 1.20wmf2 on labsconsole
  • 18:30 Ryan_Lane: upgrading labsconsole to 1.20wmf2
  • 18:26 logmsgbot: reedy synchronized php-1.20wmf3 'Initial pushing of php-1.20wmf3 files to apaches'
  • 18:12 Reedy: Killing old php-1.20wmf1 directories from apaches to save full disks
  • 13:48 mutante: copying outdated wikiversions.dat/.cdb files from /home to /usr/local on spence, which fixes check_job_queue (thanks jeremyb)
  • 13:07 mutante: opening a bz bug for check_job_queue issue related to CommonSettings.php BZ:36835
  • 07:43 mutante: still upgrading/rebooting a couple srv (API) application servers with long uptime
  • 06:22 apergos: restarted lucene search on search1016 it had stopped doing anything useful (see ganglia graphs, also nothinig wtitten to logs)
  • 02:22 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Mon May 14 02:22:09 UTC 2012

May 13

  • 02:24 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Sun May 13 02:24:51 UTC 2012

May 12

  • 02:22 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Sat May 12 02:22:18 UTC 2012

May 11

  • 22:10 logmsgbot: preilly synchronized wmf-config 'add wikimedia to zero image disable list fix header'
  • 21:52 logmsgbot: preilly synchronized wmf-config 'add wikimedia to zero image disable list'
  • 19:49 Reedy: ran apache-graceful-all
  • 19:42 RobH: apache restarted by puppet run on srv286
  • 19:31 RobH: shutting down srv286 and srv286 for power rebalancing
  • 19:23 RobH: srv260 and srv261 back in business
  • 19:10 RobH: srv261 & srv261 shutting down for power rebalancing within the rack
  • 18:33 notpeter: shutting down search 13-20 for hd upgrades
  • 18:05 maplebed: swift: deleting the unsharded version of all sharded containers
  • 18:03 logmsgbot: catrope synchronized php-1.20wmf2/extensions/UploadWizard/ 'Deploy 4b5df1a1151ac80e309d396102e5e2a8d0c27ccb'
  • 17:46 maplebed: deleted wikipedia-de-local-thumb container from swift. the sharded version is currently being used.
  • 15:33 mutante: adding DNS entries for analytics hosts in new vlan 1121 (10.64.21.0/24), hosts starting at .101 to match names analytics1001 = .101 and ++
  • 15:03 mutante: mw62 -unless somebody was on that right now it died. mgmt also just Create Instance Error
  • 14:06 mutante: kernel upgrading / rebooting srv servers where uptime > 200 d order by uptime desc limit 1
  • 13:12 mutante: installing package upgrades on pdf1-3 (and installed requested indic fonts via new puppet role class)
  • 11:39 mutante: starting ms-be swift-container-auditors every once in a while
  • 11:35 mutante: stat1 - installed new kernel, but waiting to reboot. schedule with aotto
  • 11:24 mutante: upgrading packages/kernel on hooper, rebooting (Blog,Etherpad,Racktables)
  • 09:21 mutante: ekrem was close running out of disk again. logrotated apache logs, changed config to: size 512M,rotate 3
  • 08:58 mutante: package upgrades on ekrem (IRC server, WAP, Apple dict...)
  • 08:51 mutante: rebooting marmontel (blog)
  • 08:48 mutante: upgrading apache/mysql/kernel on marmontel (blog)
  • 02:20 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Fri May 11 02:20:39 UTC 2012
  • 02:00 RoanKattouw: Started Apache back up on srv200, done debugging
  • 01:58 logmsgbot: catrope synchronized php-1.20wmf2/extensions/UserDailyContribs/UserDailyContribs.hooks.php 'Deploy 3c45831ffe1817f3dc18f06644db46b1b74173e7'
  • 01:17 RoanKattouw: Stopping Apache on srv200 so I can use it as my guinea pig for segfault debugging
  • 00:56 logmsgbot: tstarling synchronized php-1.20wmf2/includes/User.php 'header log'
  • 00:49 logmsgbot: aaron synchronized php-1.20wmf2/includes/User.php
  • 00:48 logmsgbot: aaron synchronized php-1.20wmf2/includes/User.php
  • 00:40 logmsgbot: aaron synchronized php-1.20wmf2/includes/User.php
  • 00:30 Tim: restarted socat on fenari so that fatal.log is reopened
  • 00:29 logmsgbot: aaron synchronized php-1.20wmf2/includes/User.php 'removed logging hack tweaks.'
  • 00:29 logmsgbot: aaron synchronized php-1.20wmf2/includes/User.php 'logging hack tweaks.'
  • 00:27 logmsgbot: aaron synchronized php-1.20wmf2/includes/User.php 'removed some temp logging'
  • 00:27 logmsgbot: aaron synchronized php-1.20wmf2/includes/User.php
  • 00:16 logmsgbot: aaron synchronized php-1.20wmf2/includes/User.php
  • 00:00 binasher: pulling cp1044 from lvs for testing

May 10

  • 23:38 logmsgbot: reedy synchronized php-1.20wmf2/extensions/LiquidThreads/classes/Hooks.php 'Updating to master'
  • 22:38 logmsgbot: catrope synchronized php-1.20wmf1/.git 'Make Special:Version show the correct commit now that I have fixed the weird repo state'
  • 22:37 logmsgbot: catrope synchronized php-1.20wmf2/.git 'Make Special:Version show the correct commit now that I have fixed the weird repo state'
  • 22:36 RoanKattouw: Cleaned up weird git repo states on fenari in php-1.20wmf1 and php-1.20wmf2
  • 22:04 maplebed: swift: deleting the unsharded wikipedia-de thumb container contents (the sharded version is currently serving traffic)
  • 19:51 notpeter: rebooting db29 for do a test install of precise
  • 19:02 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36420 - Wikipedia namespace alias for sr.wp'
  • 19:02 logmsgbot: catrope synchronizing Wikimedia installation... : ArticleFeedbackv5 updates
  • 18:57 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36694 - Set wgSitename on srwikisource'
  • 18:43 LeslieCarr: restarting mobile varnish
  • 18:33 LeslieCarr: reloaded and purged cache of mobile varnish
  • 18:03 notpeter: starting innobackupex from db10 to blondel
  • 17:39 notpeter: pushing out new zone files. only minor changes
  • 16:47 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'showupdatemarker on enwiki tooooo'
  • 03:04 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Enable show update markers on dewiki'
  • 02:14 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Thu May 10 02:14:03 UTC 2012
  • 01:07 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'https://gerrit.wikimedia.org/r/#/c/7133/'
  • 00:11 logmsgbot: catrope synchronized php-1.20wmf2/extensions/UploadWizard 'Deploy b45437b6e09018dacfc78c8e4fa822a917858b2d / 62631485ba36f973c0d4a850ef494a8f84c4c86b'
  • 00:11 logmsgbot: catrope synchronized php-1.20wmf1/extensions/UploadWizard 'Deploy b45437b6e09018dacfc78c8e4fa822a917858b2d / 62631485ba36f973c0d4a850ef494a8f84c4c86b'
  • 00:06 logmsgbot: preilly synchronized wmf-config 'remove MF passwords'
  • 00:01 logmsgbot: preilly synchronized wmf-config 'remove MF passwords'

May 9

  • 23:33 notpeter: taking down search20 to do precise test-install
  • 23:26 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Enable Translate on outreachwiki'
  • 23:25 Reedy: Created Translate tables on outreachwiki
  • 22:49 Reedy: ExtensionDistributor fixed
  • 22:32 Reedy: Debugging ExtensionDistributor being broken. Likely to show more debug output on mw.org if you attempt to use it (though, it wouldn't give you what you wanted anyway)
  • 22:15 logmsgbot: reedy synchronized php/cache/interwiki.cdb 'Updating interwiki cache'
  • 21:53 logmsgbot: aaron synchronized php-1.20wmf2/includes/filerepo/file/LocalFile.php 'deployed b9ac85cbf304a65d900cda00fafe53bf82d7a227'
  • 21:53 logmsgbot: aaron synchronized php-1.20wmf2/includes/SiteStats.php 'deployed b9ac85cbf304a65d900cda00fafe53bf82d7a227'
  • 20:52 LeslieCarr: done
  • 20:44 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'bump memory limit to 128MB'
  • 19:39 Ryan_Lane: updating OpenStackManager on virt0 to master again
  • 19:16 Ryan_Lane: updating OpenStackManager on virt0 to master
  • 18:54 logmsgbot: aaron synchronized php-1.20wmf2/includes/filerepo/file/LocalFile.php 'deployed fa1a8d5119e1174f7458eb9516287f4867c46484'
  • 18:50 RobH: dns update for db61 and db62
  • 18:25 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: 295 other wikipedias over to 1.20wmf2
  • 18:20 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: jawiki to 1.20wmf2
  • 18:16 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: ruwiki to 1.20wmf2
  • 18:12 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: frwiki to 1.20wmf2
  • 18:11 notpeter: turning db30 back on
  • 18:07 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: dewiki to 1.20wmf2
  • 17:51 cmjohnson1: to shutting down storage3
  • 16:58 LeslieCarr: restarted mobile varnish instances
  • 16:58 LeslieCarr: flushed mobile varnish cache
  • 16:54 logmsgbot: aaron synchronized wmf-config/CommonSettings.php 'Make sure Swift backend will have journaling too.'
  • 16:31 logmsgbot: aaron synchronized wmf-config/CommonSettings.php 'Removed backend config conditional now that everything was switched over.'
  • 14:06 mutante: started container-auditor on ms-be1
  • 09:24 mutante: started container-auditor on ms-be3 and 4
  • 02:37 logmsgbot: LocalisationUpdate completed (1.20wmf1) at Wed May 9 02:37:02 UTC 2012
  • 02:19 Reedy: Running cleanupUploadStash.php over all wikis
  • 02:13 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Wed May 9 02:13:10 UTC 2012
  • 01:42 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36506 - Site logo for Tsonga Wikipedia -- ts.wikipedia.org'
  • 01:39 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36522 - Upload link should lead to UploadWizard instead of commons:Special:Upload'
  • 01:36 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36663 - Please allow bureaucrats to add and remove autoreviewer status on pt.wiki'
  • 01:30 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'wgShowUpdatedMarker enabled on anything that isn't enwiki or dewiki'
  • 01:27 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36533 - Set sitename to Telugu Wiktionary'
  • 01:25 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36595 - Please enable Extention:NewUserMessage on ml.wikipedia'
  • 01:23 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36595 - Please enable Extention:NewUserMessage on ml.wikipedia'
  • 01:21 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36571 - Please lock wikimania2011 wiki'
  • 01:21 logmsgbot: reedy synchronized closed.dblist 'Closing wikimania2011wiki'
  • 00:11 maplebed: started process to delete objects that don't exist in the container listings on all swift backends

May 8

  • 23:44 K4-713: synchronized payments cluster to r115155, DonationInterface ccfbb304
  • 23:34 LeslieCarr: purged varnish mobile cache
  • 23:25 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping mobilefrontend resource version again'
  • 23:25 logmsgbot: awjrichards synchronized php-1.20wmf1/extensions/MobileFrontend/ 'd43f5f19ff3599f16200d247b6838cfb04ef1473'
  • 23:25 logmsgbot: awjrichards synchronized php-1.20wmf2/extensions/MobileFrontend/ 'd43f5f19ff3599f16200d247b6838cfb04ef1473'
  • 23:22 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping mobilefrontend resource version'
  • 23:11 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping MobileFrontend resource version'
  • 23:11 logmsgbot: awjrichards synchronized php-1.20wmf2/extensions/MobileFrontend/ '2b1e8573fdbcab0feb3a2481167b68fb96abf663'
  • 23:10 logmsgbot: awjrichards synchronized php-1.20wmf1/extensions/MobileFrontend/ '2b1e8573fdbcab0feb3a2481167b68fb96abf663'
  • 22:53 RoanKattouw: Actually fixed it now with chmod -R g+w /h/w/conf/httpd
  • 22:47 RoanKattouw: Fixed permissions in /h/w/conf/httpd by running find -group wikidev -not -perm 020 -exec chmod g+w \{\} \;
  • 22:38 logmsgbot: awjrichards synchronized php-1.20wmf2/extensions/MobileFrontend/stylesheets/sections.css 'Live hack to live test broken interface on ICS devices on very large articles'
  • 22:34 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Enable mobile url transformation on testwiki'
  • 22:13 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'bumping MobileFrontend resource version number'
  • 22:13 logmsgbot: awjrichards synchronized php-1.20wmf2/extensions/MobileFrontend/ 'd828a8196d8bc877afdbd1559e8e6d639b51cef7'
  • 22:12 logmsgbot: awjrichards synchronized php-1.20wmf1/extensions/MobileFrontend/ 'd828a8196d8bc877afdbd1559e8e6d639b51cef7'
  • 21:53 binasher: rebooting db1018 one more time
  • 21:47 logmsgbot: aaron synchronized php-1.20wmf2/includes/filerepo/file/LocalFile.php 'deployed 43aa35016b03935b27d439afe9a6b3f1aad1aa8b'
  • 21:45 Ryan_Lane: adding adminbot to the repo
  • 21:32 binasher: rebooting eqiad core db slaves for kernel upgrade
  • 21:29 logmsgbot: aaron synchronized wmf-config/swift.php 'Added new thumbnail purge/import hooks handlers that use the swift backend class; unused atm.'
  • 21:23 logmsgbot: aaron synchronized wmf-config/CommonSettings.php 'Added swift backend config; unused atm.'
  • 21:15 logmsgbot: asher synchronized wmf-config/db.php 'returning db45 to service'
  • 21:14 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Matched wikimania2013wiki configuration to that of wikimania2012wiki'
  • 21:13 maplebed: delpoyed container sharding for thumbnails to swift for 'dewiki', 'fiwiki', 'frwiki', 'hewiki', 'huwiki', 'idwiki', 'itwiki', 'jawiki', 'rowiki', 'ruwiki', 'thwiki', 'trwiki', 'ukwiki', 'zhwiki' (in addition to existing sharding for commons and enwiki)
  • 21:13 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Move wikimania2013wiki to php-1.20wmf2
  • 21:12 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Matched wikimania2013wiki configuration to that of wikimania2012wiki'
  • 21:11 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Matched wikimania2013wiki configuration to that of wikimania2012wiki'
  • 21:10 binasher: shutting down mysql across all eqiad core db slaves
  • 20:59 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'logo for wikimania2013wiki'
  • 20:57 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'remove w'
  • 20:56 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Enable translate on wikimania2013wiki'
  • 20:56 logmsgbot: aaron synchronized wmf-config/swift.php 'Switching purge hook to use new sharding scheme.'
  • 20:54 Reedy: Created translate related tables for wikimania2013wiki
  • 20:31 logmsgbot: reedy synchronized php-1.20wmf1/extensions/AntiBot/
  • 20:30 logmsgbot: reedy synchronized php-1.20wmf2/extensions/AntiBot/
  • 20:14 maplebed: creating sharded containers for swift for 'dewiki','fiwiki', 'frwiki', 'hewiki', 'huwiki', 'idwiki', 'itwiki', 'jawiki', 'rowiki', 'ruwiki', 'thwiki', 'trwiki', 'ukwiki', 'zhwiki'
  • 19:54 logmsgbot: aaron synchronized wmf-config/CommonSettings.php 'Moved remaining wikis over to new backend config'
  • 19:34 logmsgbot: awjrichards synchronized php-1.20wmf2/extensions/MobileFrontend/
  • 19:12 LeslieCarr: flushed mobile varnish cache
  • 19:11 logmsgbot: awjrichards synchronized php-1.20wmf2/extensions/MobileFrontend/
  • 19:10 logmsgbot: awjrichards synchronized php-1.20wmf1/extensions/MobileFrontend/
  • 18:37 LeslieCarr: reenabled services on fpc5 of cr1-eqiad
  • 18:16 cmjohnson1: updating md1000 controller card firmware on storage3
  • 18:14 LeslieCarr: turned off fpc5 on cr1-eqiad to swap
  • 18:05 LeslieCarr: powering on fpc 5 on cr1-eqiad
  • 18:03 LeslieCarr: powering off fpc5 on cr1-eqiad in order for RobH to physically reseat the card
  • 17:48 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php 'Enable TranslationNotifications on meta, incubator and wikimania2012'
  • 17:44 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php 'Enable TranslationNotifications on mediawikiwiki'
  • 17:42 LeslieCarr: switching all masterships over to cr2-eqiad in preparation to reseat cr1 linecard
  • 17:25 LeslieCarr: flushed the mobile cache
  • 17:24 logmsgbot: reedy synchronizing Wikimedia installation... : Rebuilding localisation cache files before TranslationNotification deploy
  • 17:18 logmsgbot: reedy synchronized php-1.20wmf2/extensions/MobileFrontend/ 'Pushing out head'
  • 17:16 logmsgbot: reedy synchronized php-1.20wmf1/extensions/MobileFrontend/ 'Pushing out head'
  • 17:14 RobH: asw-c1-eqiad connected to both cr1 and cr2
  • 15:16 cmjohnson1: shutting down storage3 to replace raid card
  • 12:40 pp-pdf1: updated mwlib to 0.13.7
  • 12:39 pp-pdf2: updated mwlib to 0.13.7
  • 12:36 pp-pdf3: updated mwlib to 0.13.7
  • 11:59 mutante: merging CSS fix for broken mobile site table layout
  • 02:18 RoanKattouw: Removed and recloned /var/lib/l10nupdate/mediawiki/extensions , it was in a weird state because magic extension submodules work now but my hacky workaround for them not working was still in place
  • 02:00 logmsgbot: LocalisationUpdate failed: git pull of extensions failed
  • 01:12 logmsgbot: tstarling synchronized php-1.20wmf2/includes/api/ApiMain.php
  • 01:10 logmsgbot: tstarling synchronized php-1.20wmf1/includes/api/ApiMain.php
  • 00:44 binasher: rebooted db1034
  • 00:42 logmsgbot: tstarling synchronized php-1.20wmf2/includes/Exception.php
  • 00:42 logmsgbot: tstarling synchronized php-1.20wmf2/includes/DefaultSettings.php
  • 00:37 logmsgbot: tstarling synchronized php-1.20wmf1/includes/Exception.php
  • 00:36 logmsgbot: tstarling synchronized php-1.20wmf1/includes/DefaultSettings.php
  • 00:20 logmsgbot: aaron synchronized wmf-config/CommonSettings.php 'Switched enwiki to new backend config.'

May 7

  • 23:52 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'bumping mobilefrontend resource version #'
  • 23:44 logmsgbot: reedy synchronized php-1.20wmf2/extensions/PageTriage/
  • 23:43 logmsgbot: reedy synchronized php-1.20wmf1/extensions/PageTriage/
  • 23:35 logmsgbot: reedy synchronized php-1.20wmf2/extensions/PageTriage/includes/PageTriageUtil.php
  • 23:31 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'wgShowExceptionDetails to true for testwiki and test2wiki'
  • 23:25 logmsgbot: awjrichards synchronizing Wikimedia installation... : Sync'ing MobileFrontend changes per http://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments/2012-05-07, take 3
  • 23:15 logmsgbot: reedy synchronized php-1.20wmf1/extensions/PageTriage/includes/PageTriageUtil.php
  • 23:15 logmsgbot: reedy synchronized php-1.20wmf2/extensions/PageTriage/includes/PageTriageUtil.php
  • 23:00 logmsgbot: tstarling synchronized wmf-config/InitialiseSettings.php 'wgShowExceptionDetails = false'
  • 22:57 Ryan_Lane: restarting glusterd processes on virt1-5
  • 22:56 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping mobile resource version'
  • 22:54 Ryan_Lane: upgrading glusterfs on virt1-5
  • 22:49 Ryan_Lane: upgrading glusterfs on labstore1-4
  • 22:48 binasher: running an osc against plwiktionary.recentchanges on master
  • 22:40 paravoid: deleting 14k tmp files from spence's /home/nagios
  • 22:35 logmsgbot: awjrichards synchronizing Wikimedia installation... : Sync'ing MobileFrontend changes per http://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments/2012-05-07
  • 22:34 logmsgbot: awjrichards synchronizing Wikimedia installation... : Sync'ing MobileFrontend changes per http://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments/2012-05-07
  • 22:24 RoanKattouw: chmod 775 /usr/local/apache/common-local/php-1.20wmf2/extensions/PageTriage with dsh as root
  • 22:19 logmsgbot: raindrift synchronized php-1.20wmf1/resources/startup.js 'touch'
  • 22:18 binasher: rebooting nfs2 to new kernel
  • 22:16 logmsgbot: raindrift synchronized wmf-config/InitialiseSettings.php 'enabling PageTriage on enwp'
  • 22:14 logmsgbot: raindrift synchronized php-1.20wmf2/extensions/PageTriage 'Syncing PageTriage to enwp, a la carte'
  • 22:14 logmsgbot: raindrift synchronized php-1.20wmf1/extensions/PageTriage 'Syncing PageTriage to enwp, a la carte'
  • 21:59 mutante: was still upgrading/rebooting amssq* and knsq* hosts on the side (slow,b/c upload squids). expect temp. nagios squid reports tomorrow as well. out for now.
  • 21:44 binasher: moved default resolution for upload from eqiad to pmtpa
  • 21:29 cmjohnson1: shutting down storage3 for troubleshooting
  • 20:37 binasher: attempting a live online schema change for zuwikitionary.recentchanges on the prod master
  • 20:22 LeslieCarr: (above) restarted nagios-wm on spence
  • 20:20 LeslieCarr: restarted irc bot
  • 20:15 binasher: rebooting db45
  • 20:11 binasher: rebooting db1019
  • 18:46 logmsgbot: reedy synchronized php-1.20wmf1/extensions/Collection/Collection.session.php 'head'
  • 18:45 logmsgbot: reedy synchronized php-1.20wmf2/extensions/Collection/Collection.session.php 'head'
  • 18:25 logmsgbot: reedy synchronized php-1.20wmf2/extensions/GlobalBlocking/GlobalBlocking.class.php
  • 18:24 logmsgbot: reedy synchronized php-1.20wmf1/extensions/GlobalBlocking/GlobalBlocking.class.php
  • 18:07 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: enwiki to 1.20wmf2
  • 16:16 cmjohnson1: shutting down storage3 to reseat RAID card
  • 15:58 cmjohnson1: Going to power cycling storage3 several times to troubleshoot hardware issue
  • 15:15 RobH: updating firmware on storgae3
  • 14:20 Jeff_Green: stopped cron jobs on storage3 because of RAID failure
  • 12:49 mutante: pushing out virtual host for wikimania2013 wiki. sync / apache-graceful/all
  • 11:18 mutante: continuing with upgrades/reboots in amssq* on the side during the day
  • 11:09 mutante: squids - sq* done. all latest kernel and 0 pending upgrades.
  • 09:27 mutante: rebooting bits varnish sq68-70 one by one..
  • 08:00 mutante: upgrading/rebooting the last couple sq* servers
  • 07:20 binasher: power cycled db45 (crashed dewiki slave)
  • 07:05 logmsgbot: asher synchronized wmf-config/db.php 'db45 is down'
  • 02:25 Tim: on locke: introduced 1/100 sampling for banner impressions, changed filename to bannerImpressions-sampled100.log
  • 02:12 Tim: on locke: moved fundraising logs back where they were
  • 02:00 logmsgbot: LocalisationUpdate failed: git pull of extensions failed
  • 01:38 Tim: on locke: compressing bannerImpressions.log
  • 01:35 Tim: on locke: moved bannerImpressions.log to archive and restarted udp2log
  • 01:26 Tim: on locke: moved fundraising logs from /a/squid/fundraising/logs to /a/squid so that they will be processed by logrotate

May 6

  • 07:03 apergos: manually rotates udplogs on locke, copying destined_for_storage3 off to hume:/archive/emergencyfromlocke/ (jeff, this note's for you in particular)
  • 06:36 apergos: bringing up storage3 with neither /a nor /archive mounted, saw "The disk drive for /archive is not ready yet or not present" etc on boot, waited a long time, finally skipped them
  • 06:12 apergos: and powercycling the box instead. grrrr
  • 06:05 apergos: rebooting storage3: we have messages like May 6 05:45:12 storage3 kernel: [465081.410025] Filesystem "dm-0": xfs_log_force: error 5 returned. in the log, and the raid is unaccessible, megacli doesn't run either
  • 02:00 logmsgbot: LocalisationUpdate failed: git pull of extensions failed

May 5

  • 09:37 mutante: squids - upgrading in the sq5x range (upload)
  • 08:53 apergos: disabling modcompress temporarily for lightty on dataset2 (live hack), let's see what that does as far as it dying. could be issue similar to http://redmine.lighttpd.net/issues/2391
  • 06:45 mutante: squids - upgrading sq44,48 (upload)
  • 05:23 mutante: squids - finishing a couple reboots in the sq7x range
  • 03:04 binasher: rebooting db1006 as well
  • 03:04 binasher: rebooting db1038, kernel uptime scheduler chaos
  • 02:00 logmsgbot: LocalisationUpdate failed: git pull of extensions failed
  • 00:21 logmsgbot: reedy synchronized php-1.20wmf2/extensions/GlobalBlocking/GlobalBlocking.class.php

May 4

  • 23:46 logmsgbot: reedy synchronized php-1.20wmf2/extensions/GlobalBlocking/GlobalBlocking.class.php
  • 23:45 logmsgbot: reedy synchronized php-1.20wmf1/extensions/GlobalBlocking/GlobalBlocking.class.php
  • 22:35 logmsgbot: aaron synchronized php-1.20wmf2/includes/filerepo/backend/FSFileBackend.php 'deployed a807624'
  • 22:34 LeslieCarr: clearing varnish cache and reloading varnish on mobile
  • 21:14 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php
  • 21:13 logmsgbot: reedy ran sync-common-all
  • 20:18 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Fix typo (cswikquote vs cswikiquote)'
  • 20:06 logmsgbot: asher synchronized wmf-config/db.php 'setting s2 writable'
  • 20:05 binasher: performing mysql replication steps for s2 master switch to db52
  • 20:04 logmsgbot: asher synchronized wmf-config/db.php 'setting s2 read-only, db52 (still ro) as master, db13 removed'
  • 19:49 logmsgbot: asher synchronized wmf-config/db.php 'setting db52 weight to 0 in prep for making new s2 master'
  • 19:32 binasher: powering off db24
  • 18:08 LeslieCarr: reloaded mobile varnish caches and purged them
  • 18:02 Ryan_Lane: gerrit upgrade is done
  • 17:55 Ryan_Lane: starting gerrit
  • 17:32 Ryan_Lane: installing gerrit package on manganese
  • 17:28 Ryan_Lane: adding gerrit 2.3 package to the repo
  • 17:25 Ryan_Lane: shutting down gerrit so that everything can be backed up
  • 16:45 apergos: lighty on dataset2 is running under gdb in screen session as root, if it dies please leave that alone (or look at it if you want to investigate)
  • 16:26 notpeter: turning off db30 (former s2 db, still on hardy, will ask asher what to do with it) to test noise in DC
  • 15:50 mutante: rebooting sq67 (bits)
  • 15:42 mutante: going through sq7x servers (text), full upgrades
  • 15:32 notpeter: removing srv281 from rending pool until we figure out what's going on with it
  • 15:23 notpeter: putting srv224 back into pybal pool
  • 15:09 notpeter: removing srv224 from pybal pool for repartitioning
  • 14:56 notpeter: putting srv223 back into pybal pool
  • 14:50 mutante: going through sq6x (text), full upgrades
  • 14:08 notpeter: removing srv223 from pybal pool for repartitioning
  • 14:02 notpeter: putting srv222 back into pybal pool
  • 13:50 notpeter: removing srv222 from pybal pool for repartitioning
  • 13:43 notpeter: putting srv221 back into pybal pool
  • 13:30 notpeter: removing srv221 from pybal pool for repartitioning
  • 13:16 mutante: going through sq80 to sq86 (upload), full upgrade & reboot
  • 12:56 mutante: maximum uptime in the sq* group down to 171 days, so we have like a month now for the rest. stopping upgrades for the moment being.
  • 12:54 notpeter: starting script to move /usr/local/apache to /a partition on all remaing non-imagescaler apaches
  • 12:47 mutante: (just) new kernels & reboot - sq45,sq49 (upload)
  • 12:30 mark: Sending ALL non-european upload traffic to eqiad
  • 12:23 mutante: (just) new kernels & reboot - sq63 to sq66 (209 days up)
  • 12:06 mutante: dist-upgrade & kernel & reboot - sq42,sq43 - rebooting upload squids one by one
  • 11:48 mutante: powercycling srv266 one more time, but now creating RT for it, once already showed CPU issue before it was reinstalled recently
  • 11:13 apergos: restarted lighty on dataset2 ... about ... half an hour ago. stupid case sensitivity
  • 10:02 apergos: tossed knsq1 through 7 from squid_knams dsh nodegroups file, prolly lots more cleanup where that came from
  • 09:34 mutante: dist-upgrade/kernel/reboot: sq37, sq41. rebooting upload squid sq41
  • 08:49 mutante: dist-upgrade & new kernel & reboot: sq33, sq36
  • 07:47 mutante: preemptive rebooting of sq* servers identified as having > 200 days of uptime
  • 02:22 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Fri May 4 02:22:42 UTC 2012
  • 02:13 logmsgbot: LocalisationUpdate completed (1.20wmf1) at Fri May 4 02:13:58 UTC 2012
  • 00:20 logmsgbot: raindrift synchronizing Wikimedia installation... :
  • 00:18 logmsgbot: raindrift synchronizing Wikimedia installation... : Syncing the PageTriage extension, but only enabling on testwiki
  • 00:08 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Adding 'fr' to language codes for mobile feedback'
  • 00:06 maplebed: moved ms1-3 from the production cluster to the test cluster

May 3

  • 23:29 LeslieCarr: restarting networking on sq55
  • 23:29 logmsgbot: tstarling synchronizing Wikimedia installation... :
  • 23:27 LeslieCarr: restarting networking on sq54
  • 23:24 LeslieCarr: restarting networking on sq53
  • 23:21 LeslieCarr: restarting networking on sq52
  • 23:16 LeslieCarr: restarting networking on sq51
  • 21:30 notpeter: removing srv220 from pybal pool for repartitioning
  • 21:29 LeslieCarr: switching asw-a4-sdtpa from single uplink to lag
  • 21:19 notpeter: putting srv219 back into pybal pool
  • 21:14 logmsgbot: asher synchronized wmf-config/db.php 'setting wgDefaultExternalStore to cluster23'
  • 21:09 logmsgbot: asher synchronized wmf-config/db.php 'reverting cluster23 change'
  • 21:05 logmsgbot: asher synchronized wmf-config/db.php 'setting wgDefaultExternalStore to cluster23'
  • 21:02 binasher: about to move ES writes to cluster23
  • 20:47 notpeter: removing srv219 from pybal pool for repartitioning
  • 20:37 logmsgbot: reedy synchronized php-1.20wmf2/extensions/Collection/Collection.templates.php
  • 20:37 logmsgbot: reedy synchronized php-1.20wmf1/extensions/Collection/Collection.templates.php
  • 19:50 binasher: restarted profiling collector post parser.php livehack and stats.db removal
  • 19:45 notpeter: starting script to move /usr/local/apache to /a partition on all non-imagescaler, non-jobrunner apaches
  • 19:42 logmsgbot: aaron synchronized php-1.20wmf2/includes/parser/Parser.php 'live-hack out template profiling...again.'
  • 19:40 logmsgbot: aaron synchronized php-1.20wmf1/includes/parser/Parser.php 'live-hack out template profiling...again.'
  • 19:31 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Revert $wgDefaultUserOptions[enotifwatchlistpages] = 1'
  • 19:20 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Bug 36316 - Set Add pages I edit to my watchlist to true by default for new users'
  • 19:15 logmsgbot: reedy synchronized wmf-config/CommonSettings.php '$wgDefaultUserOptions[enotifwatchlistpages] = 1'
  • 19:09 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Enable show update markers for some more of the larger wikis'
  • 19:00 paravoid: powercycling all of sq51-sq62, hanged due to 209 days uptime
  • 18:46 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36092 - Activation of flood flag on vec.wikipedia.org'
  • 18:43 paravoid: powercycling sq59; inaccessible via either SSH or serial due to load
  • 18:41 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36183 - Fix namespace alias on Hindi Wikipedia'
  • 18:38 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36171 - Imports from Wikibooks'
  • 18:34 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36344 - Remove file upload facility on Gujarati wikipedia'
  • 18:33 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36344 - Remove file upload facility on Gujarati wikipedia'
  • 18:31 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36344 - Remove file upload facility on Gujarati wikipedia'
  • 18:23 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36386 - cswikiquote user group changes'
  • 18:15 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36480 - Create namespace Comments: in Greek Wikinews'
  • 17:44 RobH: db1029 ssd test items removed, can go back to normal service via asher
  • 17:43 notpeter: returning mw58 to pool
  • 17:34 RobH: shutting down db1029 for ssd card testing removal per rt 2766
  • 17:26 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36320 - Set $wgShowUpdatedMarker back to true on ptwiki'
  • 17:18 notpeter: removing mw58 from pool for more testin'
  • 17:16 LeslieCarr: reloaded and purged varnish cache for mobile in eqiad
  • 17:03 notpeter: mwm59 out of apache pool. using it for some testing
  • 16:16 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36359 - Add namespace 102 to $wgContentNamespaces on ptwiki Bug 36360 - Add namespace 102 to $wgNamespacesToBeSearchedDefault on ptwiki'
  • 16:08 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Bug 36460 - Enable chunked uploads as opt-in user preference'
  • 16:06 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Bug 31406 - Set $wgUseMathJax = true on Wikimedia wikis'
  • 15:12 notpeter: chris is taking down search1-12 to replace with new search nodes
  • 15:05 mutante: powercycling srv266
  • 13:49 mark: Built new wikimedia-base 1.00 package, stripped of most stuff now handled by Puppet, and inserted it into the lucid-wikimedia and precise-wikimedia APT repositories
  • 10:33 mutante: starting container-auditor on ms-be3
  • 08:42 logmsgbot: ariel synchronized php-1.20wmf2/LocalSettings.php 'job runners don't have /home mounted'
  • 08:16 Nemo_bis: siebrand: job queue stuck, on en.wiki jumped from o to 37k in the last ~36h
  • 04:52 jeremyb: fixed complaints of beta simplewiki appearing in #cvn-simplewikis on freenode on the labs side. details
  • 04:00 logmsgbot: tstarling synchronizing Wikimedia installation... :
  • 02:47 logmsgbot: tstarling synchronizing Wikimedia installation... :
  • 02:38 Tim: fixed scap, was failing on the remote side due to mwversionsinuse exiting with status 1 due to /home/wikipedia/common not existing on apaches
  • 02:21 logmsgbot: tstarling synchronizing Wikimedia installation... :
  • 02:21 Tim: aborted scap and re-ran with fanout=5 instead of 30, since nfs1 CPU was maxed out
  • 02:14 logmsgbot: tstarling synchronizing Wikimedia installation... :
  • 02:04 logmsgbot: aaron synchronized multiversion/activeMWVersions 'deployed r115116'
  • 02:00 logmsgbot: LocalisationUpdate failed (php-1.20wmf2) at Thu May 3 02:00:13 UTC 2012
  • 02:00 logmsgbot: LocalisationUpdate failed (php-1.20wmf1) at Thu May 3 02:00:12 UTC 2012

May 2

  • 23:56 logmsgbot: aaron synchronized multiversion/ 'deployed svn HEAD'
  • 23:41 maplebed: started swift old-object-deleter on ms-be3
  • 23:28 maplebed: update - roan takes the blame
  • 23:28 logmsgbot: raindrift synchronized wmf-config/InitialiseSettings.php 'Aborting todays PageTriage deployment'
  • 23:22 maplebed: swift is recovered; ~20 minutes of impaired service. cause unknown, but the swiftcleaner looks likely.
  • 23:18 RoanKattouw_away: Scap tried to push two new source trees to php-1.20wmf1-* and php-1.20wmf2-* , causing full disks. Cleaning up now
  • 23:13 LeslieCarr: restarting nagios bot
  • 22:59 logmsgbot: raindrift synchronizing Wikimedia installation... :
  • 22:49 logmsgbot: preilly synchronized php-1.20wmf2/extensions/MobileFrontend/ 'contact us change'
  • 22:48 logmsgbot: preilly synchronized php-1.20wmf1/extensions/MobileFrontend/ 'contact us change'
  • 21:43 logmsgbot: asher synchronized wmf-config/db.php 's2: pulling db30, raising weights on new hosts'
  • 21:02 ^demon: finished database maintenance on db9.reviewdb
  • 20:24 hashar: hashar: updated TestSwarm to distribute tests to Firefox 12 users.
  • 20:12 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Re-pushing for srv219 and srv220
  • 20:07 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files:
  • 20:04 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved special, wikimedia, wikiquote, wikiversity, and wiktionary wikis to 1.20wmf2
  • 19:59 logmsgbot: asher synchronized wmf-config/db.php 'adding dbs 52,53,57 to s2 at lower weights'
  • 19:55 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files:
  • 19:47 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: metawiki to 1.20wmf2
  • 19:40 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: commonswiki to 1.20wmf2
  • 19:36 preilly: fix for PHP Warning: in_array() expects parameter 2 to be array, string given in /usr/local/apache/common-local/php-1.20wmf1/extensions/MobileFrontend/skins/SkinMobile.php on line 156
  • 19:36 logmsgbot: preilly synchronized php-1.20wmf2/extensions/MobileFrontend/skins/SkinMobile.php 'fix php notice for in_array'
  • 19:35 logmsgbot: preilly synchronized php-1.20wmf1/extensions/MobileFrontend/skins/SkinMobile.php 'fix php notice for in_array'
  • 19:34 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved all wikibooks to 1.20wmf2
  • 19:25 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved enwikibooks to 1.20wmf2
  • 19:21 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved sourceswiki to 1.20wmf2
  • 19:20 logmsgbot: preilly synchronized php-1.20wmf1/extensions/ZeroRatedMobileAccess/ 'zero weekly carrier test'
  • 19:11 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved all wikisource sites to 1.20wmf2
  • 19:03 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved all wikinews sites to 1.20wmf2
  • 19:00 logmsgbot: preilly synchronized wmf-config/CommonSettings.php 'only remove images for DIGI'
  • 18:51 logmsgbot: asher synchronized wmf-config/db.php 'added ES cluster23 to templateOverridesByCluster but not activating'
  • 18:48 binasher: creating a blobs_cluster23 ES shard table for all active projects
  • 18:31 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwikisource to 1.20wmf2
  • 18:25 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwikinews to 1.20wmf2
  • 18:24 RobH: updating dns
  • 18:20 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwikinews to 1.20wmf2
  • 18:09 logmsgbot: preilly synchronized wmf-config/CommonSettings.php 'only remove images for DIGI'
  • 17:57 logmsgbot: preilly synchronized php-1.20wmf2/extensions/ZeroRatedMobileAccess/ 'zero weekly carrier test'
  • 17:56 logmsgbot: preilly synchronized php-1.20wmf1/extensions/ZeroRatedMobileAccess/ 'zero weekly carrier test'
  • 17:46 K4-713: updated production civicrm to r1726
  • 17:36 logmsgbot: aaron synchronized php-1.20wmf2/includes/specials/SpecialContributions.php 'Deployed 799998c3a160ef6dd3b926b7d6fec223682b788c'
  • 17:30 logmsgbot: preilly synchronized php-1.20wmf2/extensions/ZeroRatedMobileAccess/ 'zero weekly carrier test'
  • 17:28 logmsgbot: preilly synchronized php-1.20wmf1/extensions/ZeroRatedMobileAccess/ 'zero weekly carrier test'
  • 17:14 logmsgbot: catrope synchronized php-1.20wmf2/skins/vector/ 'Deploying 7260cc5fe4071e03241378ba1a48bc0b6f188948'
  • 16:51 RoanKattouw: Changing docroot/bits/skins-1.19 and other 1.19 symlinks to point to the 1.20wmf1 tree instead. This is needed because we're still getting requests for magnify-clip.png at the 1.19 URL from cached HTML
  • 16:16 notpeter: starting innobackupex from db1040 to db1022 for new s6 snapshot slave
  • 15:31 notpeter: no nagios bot, kicking nagios on spence
  • 15:04 RobH: shutting down mw64 for hw test per rt 1890
  • 15:03 RobH: bellin crashed, unresponsive to ssh or serial console
  • 14:43 mark: Built varnish for precise as 3.0.2-2wm5 and imported it into APT repository precise-wikimedia
  • 11:52 mark: Started distribution upgrade of server stafford from Lucid to Precise
  • 10:41 mutante: refreshLinks.php - started it once again in a screen on hume, just for s1. last cron failed with "mwscript command not found"?? well now it is there again and running
  • 10:09 mark___: Started distribution upgrade of server sockpuppet from Lucid to Precise
  • 09:20 mutante: upgrading bugzilla to 4.0.6
  • 08:43 mutante: kaulen: installing various upgrades (apache,mysql,cron,php-wikidiff2,...)
  • 08:40 logmsgbot: hashar synchronized php-1.20wmf2/includes/GitInfo.php 'Fix Special:Version for 1.20wmf2 (commit ae12df0 , bug 36361 )'
  • 08:20 hashar: cherry-picked ae12df0 commit to 1.20wmf2 since there are mobilefrontend commits pending.
  • 02:35 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Wed May 2 02:35:51 UTC 2012
  • 02:32 K4-713: updated production civicrm to r1723
  • 02:13 logmsgbot: LocalisationUpdate completed (1.20wmf1) at Wed May 2 02:13:30 UTC 2012
  • 01:01 notpeter: starting innobackupex from db57 to db53 for new s2 slave for the one zillionth time

May 1

  • 22:28 logmsgbot_: asher synchronized wmf-config/db.php 'returning db45'
  • 22:23 logmsgbot_: asher synchronized wmf-config/db.php 'pulling db45, last coredb on prior fb mysql build'
  • 22:17 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'Enable doublepage on test2wiki'
  • 22:11 binasher: upgraded percona-toolkit on coredbs to 2.1.1 - now with the potential to run online schema changes on tables without single column unique keys!!
  • 21:39 binasher: created an ops db on all core mysql shards
  • 21:00 notpeter: reinstalling db53. this time with correct raid!
  • 20:40 logmsgbot_: awjrichards synchronized wmf-config/CommonSettings.php 'Fixing mailto links on mobilefrontend feedback form to properly populate subject lines'
  • 19:32 LeslieCarr: reverting vrrp mastership of row a to cr2-eqiad
  • 19:29 LeslieCarr: switching vrrp mastership of row a to cr1-eqiad
  • 18:32 logmsgbot_: awjrichards synchronized wmf-config/InitialiseSettings.php 'Make testwiki use mobile domain for URLs'
  • 18:28 LeslieCarr: making routing change, higher risk
  • 17:51 Ryan_Lane: make that virt0
  • 17:51 Ryan_Lane: switching the session cache back to filesystem on virt1, since it isn't working properly with memcache
  • 17:29 maplebed: kicking nagios to check a change to fix the mobile LVS alert
  • 17:25 logmsgbot_: nikerabbit synchronized php-1.20wmf2/extensions/TranslationNotifications/ 'Deploying TranslationNotifications code'
  • 17:08 logmsgbot_: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: testwiki to 1.20wmf2
  • 16:27 notpeter: starting innobackupex from db1034 to db53 for new s2 slave
  • 16:27 notpeter: starting innobackupex from db57 to db52 for new s2 slave
  • 16:03 notpeter: rebuilding db52 and db53 as s2 slaves
  • 15:47 logmsgbot_: asher synchronized wmf-config/db.php 's1: raising db59,60 weights, pulling db52/53 for reuse'
  • 09:23 logmsgbot_: reedy synchronized wmf-config/CommonSettings.php 'hewiki account creation high throttle limits'
  • 04:04 Tim: on all apaches, running "chmod -R a+rX /usr/local/apache/common-local/" to clean up after killed rsyncs which left files unreadable
  • 02:23 logmsgbot_: LocalisationUpdate completed (1.20wmf2) at Tue May 1 02:23:29 UTC 2012
  • 02:21 logmsgbot_: preilly synchronized php-1.20wmf1/extensions/MobileFrontend/specials/SpecialMobileFeedback.php
  • 02:14 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Tue May 1 02:14:06 UTC 2012
  • 02:06 logmsgbot_: preilly synchronized php-1.20wmf1/extensions/MobileFrontend/specials/SpecialMobileOptions.php
  • 01:51 Ryan_Lane: bringing up all labs instances with a 60 second lag
  • 01:40 Ryan_Lane: rebooting virt0
  • 01:35 Ryan_Lane: rebooting virt3
  • 01:33 logmsgbot_: preilly synchronized php-1.20wmf1/extensions/MobileFrontend/HtmlFormatter.php
  • 01:26 Ryan_Lane: rebooting virt5
  • 01:18 Ryan_Lane: rebooting virt4
  • 01:03 Ryan_Lane: rebooting virt2
  • 00:51 LeslieCarr: restarted swift-container-auditor on ms-be5
  • 00:38 logmsgbot_: tstarling synchronizing Wikimedia installation... :
  • 00:26 Tim: removed large syslogs from mw60 and ran sync-common
  • 00:18 Tim: on mw60 there was an actual directory at /usr/local/apache/common/php where a symlink should have been. fixed

April 30

  • 23:58 logmsgbot_: aaron synchronized php
  • 23:44 RoanKattouw: Started Apache back up on mw60
  • 23:39 RoanKattouw: Running scap-1 on the Apaches with dsh
  • 23:38 RoanKattouw: Moved /home/catrope/php-1.19 to /home/wikipedia/lazy-backups/php-1.19
  • 23:38 Reedy: mediawiki.org to 1.20wmf2
  • 23:37 logmsgbot_: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: mw.org to 1.20wmf2
  • 23:35 RoanKattouw: Strike that, instead moving /home/w/common/php-1.19 to /home/catrope/php-1.19
  • 23:34 RoanKattouw: Removing /home/w/common/php-1.19 , NFS might freak out a bit
  • 23:31 RoanKattouw: Removed php-1.19 from mw60 , synced it, and restarted Apache
  • 23:28 RoanKattouw: Synced docroot and purged varnish for static-1.20wmf2, bits seems to be working for 1.20wmf2 now
  • 23:27 RoanKattouw: mw60 has full disk, stopping Apache for now
  • 22:50 Ryan_Lane: rebooting virt5
  • 22:42 Ryan_Lane: rebooting virt3
  • 22:35 Ryan_Lane: rebooting virt4
  • 22:28 Ryan_Lane: rebooting virt1
  • 22:23 Ryan_Lane: bringing down all instances (yay gluster)
  • 21:12 pgehres: re-enabled Jenkins jobs on Aluminium after db1008 reboot
  • 21:11 pgehres: CiviCRM back to normal after db1008 reboot
  • 21:07 Jeff_Green: db1008 gets kernel update and reboot
  • 21:00 pgehres: put CiviCRM on Aluminium in maintenance mode for db1008 reboot
  • 20:59 logmsgbot_: reedy synchronized php-1.20wmf2/resources/startup.js 'touch'
  • 20:57 pgehres: disabled all Jenkins jobs on Aluminium in prep for db1008 reboot
  • 20:50 Jeff_Green: db1025 and storage3 get new kernels and reboot
  • 20:28 notpeter: restarting, once again, innobackupex from db1034 to db57 for new s2 slave after fenari crash killed my screen
  • 20:24 Reedy: Running ddsh -F30 -cM -g mediawiki-installation -o -oSetupTimeout=10 '/usr/bin/scap-1' in the hope it syncs all the files that would be nice to be on the app servers
  • 20:18 logmsgbot_: reedy synchronized php-1.20wmf2/cache/ 'Synching whole cache directory'
  • 19:59 notpeter: restarting nagios to get rid of some old checks
  • 19:57 Jeff_Green: payments cluster gets kernel updates and reboots
  • 19:55 logmsgbot_: reedy synchronizing Wikimedia installation... : Rebuiild l10n for 1.20wmf2
  • 19:49 logmsgbot_: reedy synchronized wmf-config/ExtensionMessages-1.20wmf2.php 'Syncing file'
  • 19:49 logmsgbot_: reedy synchronized php-1.20wmf2/LocalSettings.php 'Pushing LocalSettings.php'
  • 19:48 paravoid: upgraded & rebooted ssl3001, ssl3002, ssl3003
  • 19:45 logmsgbot_: reedy synchronizing Wikimedia installation... : Pushing out new symlinks etc, moving test2wiki to 1.20wmf2
  • 19:30 logmsgbot_: reedy synchronized php-1.20wmf2 'Syncing 1.20wmf2 live hack revisions'
  • 19:28 logmsgbot_: reedy synchronized php-1.20wmf2 'Syncing 1.20wmf1 live hack revisions'
  • 19:26 logmsgbot_: reedy synchronized php-1.20wmf2 'Syncing 1.20wmf2 for deployment'
  • 19:18 Reedy: Syncing php-1.20wmf2 files from NFS to apaches. Likely to upset NFS (or the uplink for the switch nfs is on) for a little while...
  • 19:14 paravoid: rebooting ssl1004
  • 19:06 paravoid: rebooting ssl1003
  • 19:00 paravoid: rebooting ssl1002
  • 18:59 notpeter: starting innobackupex from db1034 to db57 for new s2 slave
  • 18:50 paravoid: rebooting ssl1001
  • 18:42 Jeff_Green: grosley gets new kernel + reboot
  • 18:35 Jeff_Green: aluminium gets kernel update, yayyyyyyy!
  • 18:34 paravoid: pooled back ssl1; depooling ssl3 and rebooting
  • 18:29 binasher: rebooting mw45 for kernel upgrade
  • 18:27 Jeff_Green: power cycling aluminium which faceplanted
  • 18:22 binasher: rebooting mw45
  • 18:21 notpeter: rebuilding db57 again, this time with more correct raid level!
  • 18:19 logmsgbot_: asher synchronized wmf-config/db.php 'adding db59,60 to s1 with low weights'
  • 18:16 paravoid: depooled & rebooting ssl1
  • 18:09 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Sanity run after script changes.
  • 18:00 logmsgbot_: aaron synchronized multiversion
  • 17:58 logmsgbot_: reedy synchronized php-1.20wmf1/includes/MagicWord.php 'https://gerrit.wikimedia.org/r/6135'
  • 17:44 logmsgbot_: aaron synchronized wikiversions.cdb
  • 17:43 AaronSchulz: updating multiversion code
  • 08:34 mutante: reinstalling srv266
  • 08:08 mutante: upgraded mw1,mw2,mw35
  • 07:59 mutante: reinstalling srv206
  • 07:50 mutante: upgrading mw36
  • 07:37 apergos: powercycling srv266, had this message on mgmt console: Severity: Non Recoverable, SEL:CPU Machine Chk: Processor sensor, transition to non-recoverable was asserted
  • 07:22 mutante: installing upgrades on srv212
  • 07:19 apergos: reinstalled srv284, seems to be up now
  • 07:17 mutante: powercycled mw8
  • 02:14 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Mon Apr 30 02:13:59 UTC 2012

April 29

  • 20:13 apergos: srv206 won't run puppet, see syslog, clearing out the yaml file didn't help, since it's not urgent I'm leaving it for tomorrow
  • 19:51 Ryan_Lane: depooling ssl3004
  • 19:51 Ryan_Lane: removed the ipv6 addresses from maerlant and added them to ssl3001, then restarted nginx
  • 19:50 Ryan_Lane: repooling ssl3001
  • 19:46 apergos: powercycled mw60, same reason as the rest
  • 19:12 apergos: power cycled mw48 and mw52 (hung just like the others)
  • 18:05 apergos: sll3002 and 3003 were rebooted and are the entire ssl esams pool right now
  • 18:02 apergos: ok the ssl300x situation: ssl3001 is now disabled in the pybal conf file on fenari; it is picking up the ipv6and4labs tmplate and I don't know if that's right, anyways nginx doesn't want to bind to one of those addresses. ssl3004 isn't reachable or pingable even via mgmt but at leasy lvs sees it's gone
  • 16:34 apergos: powercycling the ssl300x.esams hosts. 212 days of uptime... (and 3001 had gone out to lunch)
  • 12:34 mutante: and finally mw1, so just leaving mw1102 and mw60 for having other issues for a while (->Nagios)
  • 12:22 mutante: check_all_memcached recovered, but still same treatment for mw10 and 11 (8 and 15h ago)
  • 12:15 mutante: powercycling mw32,mw33,mw44,mw46 one by one, they were all frozen and went down between like 17 and 24 hours ago approx.
  • 12:07 mutante: powercycling mw30
  • 02:56 paravoid: rebooting ssl2 (has 214 days uptime)
  • 02:47 paravoid: powercycled ssl3
  • 02:13 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Sun Apr 29 02:13:58 UTC 2012

April 28

  • 22:53 Reedy: Job queue logs on gdash seem to have stopped on the 26th...
  • 22:29 logmsgbot_: reedy synchronized php-1.20wmf1/includes/EditPage.php 'https://gerrit.wikimedia.org/r/6088'
  • 21:52 logmsgbot_: reedy synchronized wmf-config/CommonSettings.php
  • 21:51 logmsgbot_: reedy synchronized php-1.20wmf1/extensions/cldr/LanguageNames.body.php
  • 21:12 logmsgbot_: reedy synchronized php-1.20wmf1/extensions/cldr/LanguageNames.body.php
  • 21:10 logmsgbot_: reedy synchronized php-1.20wmf1/extensions/cldr/LanguageNames.body.php
  • 21:09 logmsgbot_: reedy synchronized common/php-1.20wmf1/extensions/cldr/LanguageNames.body.php 'more debugging'
  • 20:51 logmsgbot_: reedy synchronized php-1.20wmf1/extensions/cldr/LanguageNames.body.php 'Add debugging'
  • 20:49 logmsgbot_: reedy synchronized wmf-config/CommonSettings.php 'Add debuglog group for language code not being a string'
  • 19:04 logmsgbot_: reedy synchronized php-1.20wmf1/includes/ExternalEdit.php 'https://gerrit.wikimedia.org/r/6077'
  • 19:03 logmsgbot_: reedy synchronized php-1.20wmf1/includes/api/ApiParse.php 'https://gerrit.wikimedia.org/r/6076'
  • 02:24 Ryan_Lane: rebooting all mediawiki boxes that have uptimes affected by the bug are being rebooted at 8 minute intervals
  • 02:14 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Sat Apr 28 02:14:14 UTC 2012
  • 01:33 paravoid: powecycled mw29
  • 01:21 paravoid: powercycled mw38
  • 00:17 notpeter: db12 is sooooo sloooooow, starting innobackupex from db1017 to db60 for new s1 slave

April 27

  • 22:15 paravoid: upgraded ssl4 to nginx 0.7.65-5wmf1 and added it back to the pool
  • 21:45 paravoid: rebooting ssl4 after upgrading (incl. a kernel update)
  • 20:00 notpeter: starting innobackupex from db1040 to db1022 for new eqiad s6 snapshot slave, again
  • 19:59 notpeter: starting innobackupex from db12 to db60 for new s1 slave, again
  • 19:58 notpeter: starting innobackupex from db1017 to db59 for new s1 slave, again
  • 19:49 paravoid: de-pooling ssl4
  • 19:30 mutante: test - added new gerrit interwiki prefix for SAL/wikitech - gerrit:6002
  • 19:14 logmsgbot_: catrope synchronized wmf-config/CommonSettings.php 'Fix rights for afttest and afttest-hide groups'
  • 18:25 logmsgbot_: reedy synchronized wmf-config/CommonSettings.php 'Cleanup enotif related settings'
  • 18:24 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'Set wgEnotifWatchlist to true for all wikis. Leaving wgShowUpdatedMarker set to false for all the big wikis'
  • 16:50 logmsgbot_: reedy synchronized wmf-config/CommonSettings.php 'Simplify enotif code'
  • 16:45 notpeter: starting innobackupex from db1040 to db1022 for new eqiad s6 snapshot slave
  • 16:45 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'wgEnotifWatchlist defaulting to true. Big wikis explicitly set to false'
  • 12:25 mutante: fixing integration.mw testswarm and applying fixed erb template by hashar
  • 04:35 Tim: added an account for myself on observium
  • 04:22 logmsgbot_: tstarling synchronized wmf-config/mc.php 'increased wgMemCachedTimeout from 500ms to 3000ms for bug 35900'
  • 02:13 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Fri Apr 27 02:13:51 UTC 2012
  • 00:12 Ryan_Lane: upgrading gluster on all instances
  • 00:09 Ryan_Lane: upgrading gluster on labstore1-4

April 26

  • 23:46 logmsgbot_: asher synchronized wmf-config/db.php 'raising db58 weight'
  • 23:09 Reedy: Recreated resources directory symlinks in bits docroot
  • 21:21 LeslieCarr: started deletion script on ms-be4
  • 19:20 notpeter: restarting puppet on db59
  • 19:18 Ryan_Lane: made LiquidThreads disabled by default on labsconsole, now users must add the special string to a page to enable it there.
  • 19:18 Ryan_Lane: enabled NewUserMessage on labsconsole
  • 19:06 logmsgbot_: catrope synchronized wmf-config/CommonSettings.php 'Add group permissions settings for AFTv5'
  • 18:33 logmsgbot_: catrope synchronizing Wikimedia installation... : Deploy AFTv5 updates
  • 17:17 LeslieCarr: reloaded varnish on mobile caches
  • 14:19 notpeter: cleaned log space on search1017 and search1018 and started lucene
  • 14:04 notpeter: stopping lucene on search1017 and 1018 to take that out of the equation
  • 13:57 mutante: installing some (security) upgrades on fenari (apt,cron,samba,...)
  • 13:54 notpeter: restartin lucene on search1017 and search1018
  • 13:27 logmsgbot_: nikerabbit synchronized wmf-config/InitialiseSettings.php 'Narayamon tewiki bug 33480'
  • 13:23 logmsgbot_: nikerabbit synchronized php-1.20wmf1/extensions/Narayam/ 'Updating Narayam'
  • 13:03 notpeter: (re)starting innobackupex from db1017 to db59 for new s1 slave
  • 12:56 mark: Created precise-wikimedia APT distribution
  • 08:27 mark: Power cycled mw40
  • 06:57 binasher: restart pybal on amlvs1 with bgp disabled
  • 06:57 binasher: restarted pybal on amlvs2 with bgp enabled
  • 06:47 binasher: restarting pybal on amslvs2
  • 06:26 binasher: shifting all traffic out of esams
  • 02:14 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Thu Apr 26 02:14:03 UTC 2012
  • 01:42 Ryan_Lane: starting mysql on db46
  • 01:40 Tim: on professor: restarted udpprofile collector
  • 01:37 Ryan_Lane: powercycling db46
  • 01:33 logmsgbot_: asher synchronized wmf-config/db.php 'pulling db46, host down'
  • 00:44 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php

April 25

  • 22:14 LeslieCarr: restarted swift-container-auditor on ms-be3
  • 21:55 RobH: pushing dns update for scs-c1-eqiad and ps1-c#-eqiad
  • 21:22 LeslieCarr: reloading varnish on mobile caches cp1041 cp1042 cp1043 cp1044
  • 21:21 LeslieCarr: clearing mobile varnish cache
  • 19:38 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'Attempted fatal fix'
  • 19:33 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/Math/ 'Deploying 4c9e7dbe761c798ce15d7e2acef829a1582c058b'
  • 19:14 notpeter: starting innobackupex from db12 to db59 for new s1 slave, per mr. feldman's directions
  • 18:56 notpeter: starting innobackupex from db1017 to db60 for new s1 slave
  • 18:49 logmsgbot_: aaron synchronized php-1.20wmf1/extensions/FeaturedFeeds/SpecialFeedItem.php 'Deployed 4fb14a7b2ca9be715b820a9847d999f21c7d2cfc'
  • 18:36 logmsgbot_: aaron synchronized php-1.20wmf1/img_auth.php 'Deployed f7e49bd71bd8356751242c5ce1cbae076a27cf7a'
  • 18:10 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moving all remaining wikis to php-1.20wmf1
  • 17:07 LeslieCarr: reloaded mobile varnish configs
  • 17:06 LeslieCarr: purging mobile cache
  • 16:40 LeslieCarr: starting delete script on ms-be3
  • 16:14 RobH: done moving mgmt connections and serial connections in s8-eqiad for now
  • 16:05 RobH: reshuffling cables in eqiad for serial and mgmt connections in a8, this may affect all eqiad mgmt and serial connections for the next 5 minutes
  • 15:29 hashar: hashar: gallium: MySQL had issues most probably because of the mysql configuration snippets. https://gerrit.wikimedia.org/r/5796 might solve that.
  • 14:03 mutante: gallium - don't start puppet unless the erb template fix for mysql has been merged
  • 13:52 mutante: gallium stopped puppet, moved log_slow_queries config, re-setting up mysql again
  • 13:41 mutante: gallium/testswarm - back up after mysql upgrade and issue starting the service
  • 13:36 mutante: gallium - dpkg-reconfigure mysql-server-5.1, mysql does not start right
  • 13:27 mutante: running apt-get upgrade on gallium
  • 12:29 mark: Sending US, Brazil, Indian traffic to upload.eqiad
  • 11:39 mutante: running authdns-update to add analytics100x and labsdb100x mgmt names
  • 05:35 paravoid: powercycled lvs6, was dead and not responding to serial
  • 03:43 logmsgbot_: asher synchronized wmf-config/db.php 'adding db58 to s7 as a new slave with a low weight'
  • 03:24 logmsgbot_: asher synchronized wmf-config/db.php 'pulling db58'
  • 03:23 logmsgbot_: asher synchronized wmf-config/db.php 'adding db58 to s7 as a new slave with a low weight'
  • 02:28 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Wed Apr 25 02:28:47 UTC 2012
  • 02:14 logmsgbot_: LocalisationUpdate completed (1.19) at Wed Apr 25 02:14:46 UTC 2012
  • 00:02 binasher: profiling collector was pegged at 100% cpu and graphs were turned to swiss cheese due to a bad stats call in 1.20, now fixed

April 24

  • 23:59 binasher: powering off db16
  • 23:55 binasher: streaming hot backup of db1041 to db58 (building a new s7 slave)
  • 23:48 logmsgbot_: aaron synchronized php-1.19/includes/Setup.php 'Hacked out session request stats.'
  • 23:46 logmsgbot_: aaron synchronized php-1.20wmf1/includes/Setup.php 'Deployed 42fcd43299246ecd1b265fcfcdd01a60319cf378'
  • 23:19 AaronSchulz: Running 'mwscriptwikiset maintenance/populateRevisionSha1.php all.dblist' on hume
  • 22:43 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'Enabled file change journal on wikis using the new backend config.'
  • 22:20 AaronSchulz: Tables added
  • 22:18 binasher: rebooting db16 with updated kernel. it's probably still hopeless (dimm errors)
  • 22:18 AaronSchulz: Creating the filejournal table on all wikis
  • 21:59 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'Switched commonswiki to the new backend config format.'
  • 21:48 logmsgbot_: asher synchronized wmf-config/db.php 'pulling db16, memory errors'
  • 20:13 apergos: re-enabled replication via cron on ms7, it should catch up within an hour or so
  • 20:10 binasher: reimaged db58 with fixed raid setup, imaging db59
  • 19:51 notpeter: starting innobackupex from db1034 to db57 for new s2 slave
  • 19:50 Ryan_Lane: repooling ssl3001
  • 19:28 Ryan_Lane: depooling ssl3001
  • 18:18 LeslieCarr: deploying to frontend
  • 17:48 notpeter: deploying new squid conf to cp1001 frontend. is just a udp2log port change.
  • 17:19 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'Using newer backend for shared repos for testwiki, test2wiki, and mediawikiwiki.'
  • 16:55 logmsgbot_: nikerabbit synchronized wmf-config/CommonSettings.php 'Translate extension configuration changes'
  • 11:54 apergos: after much cursing and kicking zfs, a manual snapshot replication is running in screen as root on ms7 to ms8, expect it to take at least a day
  • 11:44 mark: Sending all non-european upload traffic back to pmtpa to prepare for eqiad varnish storage rework
  • 08:56 mutante: updated blog theme per guillaume (April commits)
  • 08:05 apergos: temporarily disabled automatic zfs replication from ms7 -> ms8, cleared out space on ms8, catching up by hand
  • 04:00 Ryan_Lane: powercycling ssl1
  • 02:47 logmsgbot_: aaron synchronized php-1.19/extensions/MobileFrontend/MobileFrontend.body.php 'Fixed notice spam.'
  • 02:45 logmsgbot_: aaron synchronized php-1.20wmf1/extensions/MobileFrontend/MobileFrontend.body.php 'Fixed notice spam.'
  • 02:37 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'Restructed filerepo a config a bit; nothing changed yet.'
  • 02:28 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Tue Apr 24 02:28:47 UTC 2012
  • 02:15 logmsgbot_: LocalisationUpdate completed (1.19) at Tue Apr 24 02:15:00 UTC 2012
  • 00:15 logmsgbot_: awjrichards synchronized php-1.20wmf1/extensions/MobileFrontend/stylesheets/common.css '0be2dc1288361c51f91533f1f77e78d9279b86e0'
  • 00:13 logmsgbot_: awjrichards synchronized php/extensions/MobileFrontend/stylesheets/common.css 'r115019'

April 23

  • 23:35 logmsgbot_: awjrichards synchronized wmf-config/CommonSettings.php 'Bumpging MobileFrontend resource version'
  • 23:07 logmsgbot_: awjrichards synchronizing Wikimedia installation... :
  • 23:02 logmsgbot_: catrope synchronized wmf-config/CommonSettings.php 'Add code for new URL scheme based on version_compare() logic'
  • 22:51 logmsgbot_: awjrichards synchronizing Wikimedia installation... : MobileFrontend updates per http://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments#23_April.2C_2012
  • 22:33 logmsgbot_: awjrichards synchronized wmf-config/CommonSettings.php 'bumping mobile frontend resource version'
  • 21:49 logmsgbot_: catrope synchronized php-1.20wmf1/resources/mediawiki/mediawiki.js 'Deploy 6e55a770b26b17b8fc9b5b4fe943dcc2867df4f3'
  • 21:27 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/ArticleFeedbackv5/modules/jquery.articleFeedbackv5/jquery.articleFeedbackv5.js 'Deploy 93d470b'
  • 20:41 mutante: neon - upgraded libssl, started icinga after adding monitor group
  • 20:32 logmsgbot_: aaron synchronized php-1.20wmf1/includes/filerepo/FileRepo.php 'Disabled write checks in the cleanDir() function.'
  • 20:31 logmsgbot_: aaron synchronized php-1.20wmf1/includes/filerepo/FileRepo.php 'Disabled write checks in the quickImport/quickPurge functions.'
  • 19:43 logmsgbot_: catrope synchronized php-1.20wmf1/includes/specials/SpecialListgrouprights.php 'Deploy 047543b6805a268c8d689a7a1ce12ec545ef79a9'
  • 18:43 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'touch'
  • 18:43 logmsgbot_: reedy synchronized flaggedrevs.dblist 'Seems I never added ukwiki to the dblist... Oh well'
  • 18:32 logmsgbot_: aaron synchronized wikiversions.dat
  • 18:32 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved enwiki to 1.20wmf1
  • 18:28 logmsgbot_: aaron synchronized php-1.20wmf1/includes/specials/SpecialContributions.php 'Deployed 72969cf8c9a403430c8c93fc20ab3118328c4d9c'
  • 17:06 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'Made mediawikiwiki use the newer backend config.'
  • 14:33 notpeter: stopping puppet on cp1041 as well
  • 14:17 notpeter: temp stopping puppet on cp1042-1044
  • 13:09 mutante: powercycling frozen mw25, looks like mw21 above but no console output to paste here
  • 13:07 mutante: fix puppet run on spence by removing searchidx1 resources from db9 (was in weird state being in site but also decommissioned)
  • 11:23 mutante: mw21 powercycling mw21 - it died with this http://etherpad.wikimedia.org/mw21
  • 10:55 mutante: force-reload ircecho on manganese to make gerrit-wm rejoin #mediawiki
  • 10:48 hashar: banned CIA bots from #mediawiki IRC channel. It started spamming us with notifications from KDE and mandriva projects. See http://permalink.gmane.org/gmane.science.linguistics.wikipedia.technical/60905
  • 10:30 mutante: searchidx1 was in site.pp and decom.pp at the same time. breaks puppet runs on spence. cannot override local resource. removing from site
  • 10:27 mutante: killed a couple morebots processes on wikitech and it came back by itself :p

April 21

  • 02:29 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Sat Apr 21 02:29:40 UTC 2012
  • 02:15 logmsgbot_: LocalisationUpdate completed (1.19) at Sat Apr 21 02:15:20 UTC 2012

April 20

  • 22:03 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'Switched test2wiki to use the new LocalRepo config style.'
  • 22:01 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'Switched testwiki to use the new LocalRepo config style.'
  • 21:52 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'Added NFS backends for local/shared repos; they are not used yet.'
  • 21:12 LeslieCarr: starting swift delete script on ms-be2
  • 20:02 logmsgbot_: aaron synchronized php-1.20wmf1/includes/filerepo/file/LocalFile.php 'deployed c77fbd394cda701758ad4523113f567bff7ede66'
  • 19:45 apergos: powercycled mw4, it was unresponsive to pings and via mgmt
  • 18:48 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/ArticleFeedbackv5 'Apply https://gerrit.wikimedia.org/r/5449'
  • 18:48 logmsgbot_: catrope synchronized php-1.19/extensions/ArticleFeedbackv5 'Apply https://gerrit.wikimedia.org/r/5449'
  • 18:07 notpeter: restarting nginx on ssl1002 and ssl1004 as they are not back up
  • 18:01 logmsgbot_: awjrichards synchronizing Wikimedia installation... :
  • 17:31 logmsgbot_: catrope synchronized wmf-config/CommonSettings.php 'Remoev wgArticleFeedbackv5OversightEmails override that was messing things up'
  • 17:15 notpeter: stopping puppet on locke and emery. just to be safe...
  • 17:11 RoanKattouw: Fixed ownership of /h/w/common/php-1.20wmf1/cache/l10n , should be owned by l10nupdate but was owned by reedy
  • 17:01 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36124 - Deploy ProofreadPage extension on test2'
  • 17:00 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'Giving test2wiki moar namespaces'
  • 16:11 mutante: add missing memcached servicegroup to nagios, restarted
  • 15:10 mutante: apache error log on stafford has ruby exceptions re: phusion_passenger
  • 15:01 mark: Converted OSPF directly connected redistributed routes from type 2 to type 1
  • 14:51 mutante: starting swift-container-auditor on ms-be1
  • 14:30 mark: Disabled down-pref of Tampa AS2828 routes
  • 13:14 logmsgbot_: demon synchronized php-1.20wmf1/maintenance/backupTextPass.inc 'Pushing out Idb58ce27 for Ariel/Chris for dumps'
  • 13:10 mark: Sending India upload traffic to upload-lb.eqiad
  • 12:40 mark: Disabled iptables firewalls on internal prod swift cluster servers as it's dropping packets
  • 12:22 mutante: restarted pdns on ns2
  • 11:19 mark: Sending US upload traffic to eqiad as well
  • 10:27 mark: Sending Brazil upload traffic to eqiad
  • 08:39 hashar: Gave up running l10nupdate script it has some file permissions issues. Opened bug 36119 and bug 36120
  • 08:36 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Fri Apr 20 08:36:53 UTC 2012
  • 08:27 logmsgbot_: LocalisationUpdate completed (1.19) at Fri Apr 20 08:27:36 UTC 2012
  • 08:13 hashar: rerunning l10nupdate for bug 34938
  • 08:02 hashar: running l10nupdate for bug 34938
  • 06:27 pgehres: re-eanabled PayPal on donatewiki and wmfwiki and resumed queue consumer on Aluminium
  • 05:32 LeslieCarr: flushing mobile varnish cache
  • 04:56 pgehres: disabled paypal on donatewiki and disabled queue consumer for duration of PayPal outage
  • 02:33 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Fri Apr 20 02:33:02 UTC 2012
  • 02:23 logmsgbot_: LocalisationUpdate completed (1.19) at Fri Apr 20 02:23:57 UTC 2012
  • 01:47 logmsgbot_: awjrichards synchronizing Wikimedia installation... : r114983 on wikis still running 1.19

April 19

  • 23:33 binasher: powercycled es1004
  • 21:08 Jeff_Green: changed nagios contactgroup fundraising from tfinc/awrichards --> jgreen
  • 21:03 RoanKattouw: Scap is broken in some weird way, it just stops running after the scap1-skins step. Doesn't run scap-1 (which does the actual sync), doesn't log "sync done", doesn't update graphite
  • 21:01 logmsgbot_: catrope synchronizing Wikimedia installation... : Running scap again, AFTv5 is acting up
  • 19:34 logmsgbot_: catrope synchronizing Wikimedia installation... : ArticleFeedbackv5 updates
  • 19:29 RoanKattouw: Running scap to deploy AFTv5 updates, and running AFTv5 schema changes on enwiki at the same time
  • 18:50 logmsgbot_: catrope synchronized wmf-config/InitialiseSettings.php 'Set wmgArticleFeedbackv5OversightEmails for enwiki'
  • 18:25 notpeter: nothing obvious in logs on db1005, starting mysql
  • 18:15 notpeter: rebooting db1005. it's dead, jim.
  • 17:52 RoanKattouw: Running schema changes for AFTv5 on testwiki
  • 17:51 Jeff_Green: discovered nfs1 had ~1K redundant iptables rules, removed extras and reloaded
  • 17:42 Jeff_Green: discovered sanger had ~7K redundant iptables rules, removed extras and reloaded
  • 13:56 mutante: adding refreshLinks cron jobs to hume per RT-2355 (via puppet). if there should be any performance issues, schedule can be changed like <cluster>@<hour> in mediawiki.pp (and/or remove mediawiki::refreshlinks from hume and clear out the jobs of user mwdeploy)
  • 08:35 mutante: emery - "udp2log_age" says some squid logfiles have not been written to in 6 hours, but from the filenames looks like this isnt a reason to worry, right
  • 07:49 mutante: stat1 - this also needs udp2log stuff fixed. currently Could not find class misc::udp2log::udp-filter
  • 07:47 mutante: gilman - what's up with it? closes SSH, does not like mgmt pass, was running jenkins but broken
  • 07:43 mutante: owa[1-3] They dont have real puppet freshness issues, it's rather firewalling and the snmp traps
  • 02:30 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Thu Apr 19 02:30:33 UTC 2012
  • 02:21 logmsgbot_: LocalisationUpdate completed (1.19) at Thu Apr 19 02:21:31 UTC 2012

April 18

  • 22:55 LeslieCarr: updating exim4.conf on mchenry to not allow old ranges
  • 21:03 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files:
  • 20:47 logmsgbot_: catrope synchronized php-1.20wmf1/resources/startup.js 'touch'
  • 20:46 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/SyntaxHighlight_GeSHi/ 'Deploying GeSHi fix https://gerrit.wikimedia.org/r/#change,4949'
  • 20:04 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: specieswiki and foundationwiki to 1.20wmf1
  • 19:56 logmsgbot_: aaron synchronized php-1.20wmf1/extensions/LiquidThreads/classes/Hooks.php 'Avoid fatals on invalid title in API'
  • 19:51 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: All *wiki wikis to 1.20wmf1
  • 19:25 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: All wikiquote and wikiversity projects to 1.20wmf1
  • 19:22 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: All wikibooks to 1.20wmf1
  • 19:18 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: All wikinewses to 1.20wmf1
  • 19:07 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: All wikisources to 1.20wmf1
  • 19:07 logmsgbot_: catrope synchronized wmf-config/mc.php 'Swap out 10.0.2.251 (down) with 10.0.11.24 (spare). This is the last spare, there are now NO SPARES LEFT in mc.php'
  • 19:00 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: All wiktionaries to 1.20wmf1
  • 18:57 logmsgbot_: aaron synchronized php-1.20wmf1/extensions/LiquidThreads/classes/Dispatch.php 'Added type hint for better fatals'
  • 18:44 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwikiversity to 1.20wmf1
  • 18:43 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwikiquote to 1.20wmf1
  • 18:41 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwikibooks to 1.20wmf1
  • 18:40 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwikinews to 1.20wmf1
  • 18:39 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwiktionary to 1.20wmf1
  • 18:32 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwikisource to 1.20wmf1
  • 17:20 logmsgbot_: catrope synchronized docroot/bits/ 'Remove static-1.00 again'
  • 16:57 logmsgbot_: catrope synchronized docroot/bits 'Add docroot/bits/static-1.00 for testing'
  • 16:41 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'Set wmfUseRevSha1Columns to true for enwiki'
  • 13:30 mutante: applied a patch to etherpad that allows admins to delete pads
  • 12:53 mutante: restarting/fixing etherpad issue
  • 11:08 mark: Sending European bits traffic back to esams
  • 02:30 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Wed Apr 18 02:30:50 UTC 2012
  • 02:21 logmsgbot_: LocalisationUpdate completed (1.19) at Wed Apr 18 02:21:49 UTC 2012
  • 02:13 logmsgbot_: catrope synchronized php-1.20wmf1/README 'Dummy sync to capture which hosts time out on sync-file'
  • 00:52 K4-713: updated production civi to r1631
  • 00:41 Ryan_Lane: adding interface for per-project sudo on OpenStackManager

April 17

  • 23:36 K4-713: updated production civi to r1628
  • 23:12 logmsgbot_: catrope synchronized wmf-config/InitialiseSettings.php 'Fixes for cswiktionary changes per Danny B'
  • 22:49 RoanKattouw: That was bug 34885 of course
  • 22:43 logmsgbot_: catrope synchronized php-1.19/extensions/WikiEditor/ 'Deploy fix for bug 348885'
  • 22:38 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/WikiEditor/ 'Deploy fix for bug 348885'
  • 22:05 K4-713: updated prod civi to r1625
  • 21:51 logmsgbot_: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.i18n.php 'changes for zero needed for carrier testing'
  • 21:42 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'use $wmgUseMathJax'
  • 21:41 logmsgbot_: aaron synchronized wmf-config/InitialiseSettings.php 'use $wmgUseMathJax'
  • 21:38 K4-713: queue consumer re-enabled
  • 21:35 K4-713: updated prod civi to r1623
  • 21:32 logmsgbot_: aaron synchronized wmf-config/InitialiseSettings.php
  • 21:29 logmsgbot_: awjrichards synchronized php-1.20wmf1/extensions/MobileFrontend/templates/ApplicationTemplate.php 'ec7c5cc'
  • 21:28 logmsgbot_: awjrichards synchronized php/extensions/MobileFrontend/templates/ApplicationTemplate.php 'r114947'
  • 21:24 logmsgbot_: aaron synchronized wmf-config/InitialiseSettings.php 'Enabled $wgUseMathJax on mediawikiwiki'
  • 20:33 logmsgbot_: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/ArticleFeedbackv5.flagging.php
  • 20:26 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/VisualEditor/ 'Deploy VisualEditor beta warning'
  • 19:52 logmsgbot_: awjrichards synchronized wmf-config/CommonSettings.php 'Bump mobile resource version'
  • 19:52 logmsgbot_: awjrichards synchronized php-1.20wmf1/extensions/MobileFrontend/
  • 19:51 logmsgbot_: awjrichards synchronized php/extensions/MobileFrontend/
  • 19:50 logmsgbot_: awjrichards synchronized wmf-config/CommonSettings.php 'bumping mobile frontend resource version'
  • 19:01 logmsgbot_: awjrichards synchronized wmf-config/CommonSettings.php
  • 18:55 logmsgbot_: reedy synchronized php-1.19/includes/api/ApiQueryBlocks.php 'r114941'
  • 18:53 logmsgbot_: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping MobileFrontend resource version'
  • 18:47 binasher: returning sq68
  • 18:36 binasher: pulling sq68 from pybal for a bit
  • 18:29 RoanKattouw: Did a graceful restart of all job runners using dsh about 15 mins ago
  • 18:29 RoanKattouw: Restarted morebots
  • 07:44 apergos: morebots test
  • 07:44 apergos: restarted varnish service manually a bit a go on sq67 and sq70, the cron job didn't seem to have gone off. restarted morebots too while I was at it
  • 03:37 Jeff_Green: dist-upgrade arsenic
  • 03:29 LeslieCarr: restarting varnish on arsenic again
  • 03:12 maplebed: started a script to delete old objects on ms-be1 for swift truncated object cleaning
  • 02:53 Jeff_Green: dist-upgrade on strontium
  • 02:43 LeslieCarr: restarted varnish on arsenic
  • 02:26 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Tue Apr 17 02:26:40 UTC 2012
  • 02:17 logmsgbot_: LocalisationUpdate completed (1.19) at Tue Apr 17 02:17:24 UTC 2012
  • 01:44 LeslieCarr: restarting varnish on niobium
  • 00:52 LeslieCarr: reloading amslvs4
  • 00:27 logmsgbot_: aaron synchronized php-1.20wmf1/includes/filerepo 'deployed 552ff0f482f3e65e9795fe304dd810e9ae1b03fb'

April 16

  • 23:31 logmsgbot_: catrope synchronizing Wikimedia installation... : Now with a touch of the specific WikiEditor.i18n.php file
  • 23:11 logmsgbot_: catrope synchronizing Wikimedia installation... : Hopefully fixing the WikiEditor messages this time, now with MessagesEn.php touch
  • 23:07 logmsgbot_: catrope synchronizing Wikimedia installation... : Hopefully fixing the WikiEditor messages this time
  • 22:58 logmsgbot_: awjrichards synchronizing Wikimedia installation... : Updating MobileFrontend to r114934
  • 22:49 logmsgbot_: catrope synchronizing Wikimedia installation... : Need to run scap for this WikiEditor change, contains i18n changes
  • 22:39 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/WikiEditor/ 'Deploy WikiEditor revert'
  • 20:53 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/WikiEditor/ 'Actually deploy the recent WikiEditor fixes'
  • 18:58 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Commons Wiki to 1.20wmf1
  • 18:47 logmsgbot_: reedy synchronized php-1.20wmf1/resources/mediawiki/mediawiki.js
  • 18:46 logmsgbot_: reedy synchronized php-1.20wmf1/extensions/WikiEditor
  • 18:37 mutante: manually added iptables nat rules on nfs2
  • 18:13 notpeter: upgrade of udp2log on nfs1/2 complete. should be operating normally now.
  • 17:41 mutante: LDAP on nfs2 warnings - opendj was _just_ started there when puppet was fixed with an unrelated issue
  • 17:38 mutante: restarting opendj on nfs2 because it refused connections
  • 17:08 logmsgbot_: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ 'zero and mobile changes'
  • 16:07 notpeter: upgrading and restarting udp2log on nfs1/2
  • 15:04 mutante: puppet fresh on nfs[12] after removing nonexistent misc::mediawiki-logger class
  • 14:46 mark: Shutdown db24 for memory testing by Chris
  • 13:27 mark: Sending European bits traffic back to pmtpa
  • 12:24 mark: Sending European bits traffic back to esams
  • 12:06 mark: Testing sess_leak_fix2 patch with a snapshot varnish build on cp3001
  • 11:56 Reedy: Ran ddsh -cM -g mediawiki-installation -o -oSetupTimeout=30 -- "cd /usr/local/apache/common && sudo -u mwdeploy ln -s php php-1.18" to create symlink for php-1.18
  • 11:51 Reedy: Killing php-1.18 again
  • 11:48 mutante: sq34 - System halted! Error: Internal Storage Slot, powered down, -> RT
  • 11:45 logmsgbot_: reedy synchronized php-1.18/ 'Symlink php-1.18 back to php (our current main running version) as lots of requests on bits are for 1.18 resources'
  • 11:44 mutante: sq34 was broken and died when connecting to mgmt, powercycling
  • 11:37 mutante: nfs1 - Could not find class misc::mediawiki-logger for nfs1
  • 10:57 Krinkle: bits.wikimedia.org back up, mark fixed it.
  • 10:33 Krinkle: bits.wikimedia.org serving Error 503 Service Unavailable on all load.php requests for mediawiki.org and nl.wikipedia.org, maybe more
  • 09:45 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'Set wgEnableJavaScriptTest to true for test2wiki'
  • 02:26 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Mon Apr 16 02:26:58 UTC 2012
  • 02:17 logmsgbot_: LocalisationUpdate completed (1.19) at Mon Apr 16 02:17:57 UTC 2012

April 15

  • 17:35 logmsgbot_: reedy synchronized php-1.20wmf1/includes/api '/me whistles'
  • 17:20 logmsgbot_: reedy synchronized php-1.20wmf1/includes/api
  • 02:25 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Sun Apr 15 02:25:58 UTC 2012
  • 02:17 logmsgbot_: LocalisationUpdate completed (1.19) at Sun Apr 15 02:17:19 UTC 2012

April 14

  • 18:14 mark: Shifting european bits traffic back from esams to pmtpa, session leak is still there
  • 17:08 mark: Shifting european bits traffic back from pmtpa to esams
  • 15:31 mark: Reverted varnish to 3.0.2-2wm4 on cp3001; the race condition patch did not fix the problem
  • 14:56 mark: Sending European bits traffic to pmtpa for testing
  • 13:52 mark: Backported varnish bug #897 patch to varnish 3.0.2, testing a snapshot build on cp3001
  • 11:37 mark: Raised session_max to 300000 (runtime) on cp3001/cp3002
  • 05:58 K4-713: re-enabled the queue consumer on aluminium
  • 02:26 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Sat Apr 14 02:26:55 UTC 2012
  • 02:17 logmsgbot_: LocalisationUpdate completed (1.19) at Sat Apr 14 02:17:34 UTC 2012
  • 02:16 K4-713: updated prod civi to r1616
  • 01:36 K4-713: turned off queue consumption on prod civicrm
  • 01:36 K4-713: updated production civicrm to r1614

April 13

  • 20:53 mark: Rebooting cp3002
  • 20:37 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php 'r114889'
  • 17:54 Jeff_Green: created new repo operations/debs/wikimedia-search-qa to stay within package naming conventions
  • 17:31 notpeter: upgrading udplog on locke to 1.8-2 and restarting, etc
  • 17:27 Jeff_Green: created new operations/debs/search-qa repo for packaging search qa scripts
  • 17:17 notpeter: restarting udp2log on emery
  • 12:53 notpeter: restopping puppet on locke/emery
  • 12:09 mark: Deploying varnish 3.0.2-2wm4 and enabling persistent storage on all even numbered eqiad upload varnish hosts
  • 11:46 mark: Imported varnish 3.0.2-2wm4 into the Wikimedia APT repository
  • 02:48 logmsgbot: tstarling synchronizing Wikimedia installation... :
  • 02:39 logmsgbot: LocalisationUpdate completed (1.20wmf1) at Fri Apr 13 02:39:01 UTC 2012
  • 02:20 logmsgbot: LocalisationUpdate completed (1.19) at Fri Apr 13 02:20:35 UTC 2012
  • 01:40 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/templates/ApplicationTemplate.php 'fix robots file'
  • 01:18 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/ 'zero and mobile changes'
  • 01:06 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/MobileFrontend.body.php 'fix html formatter'
  • 00:56 logmsgbot: tstarling synchronizing Wikimedia installation... :
  • 00:08 Ryan_Lane: rebooting ssl1004

April 12

  • 23:39 logmsgbot: tstarling synchronizing Wikimedia installation... :
  • 23:08 logmsgbot: preilly synchronizing Wikimedia installation... : zero rated mobile access changes and mobile frontend updates
  • 21:27 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34923 - namespace required for PORTAL'
  • 19:46 notpeter: stopping puppet on locke and emery
  • 18:41 logmsgbot: catrope synchronizing Wikimedia installation... : Deploying ArticleFeedbackv5 updates
  • 18:22 Reedy: Ran namespaceDupes against bewiki
  • 18:17 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34024 - Install ArticleFeedback on es.wikinews'
  • 18:15 logmsgbot: reedy synchronized php-1.19/resources/startup.js 'touch'
  • 18:14 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34024 - Install ArticleFeedback on es.wikinews'
  • 18:11 Reedy: Created AFT tables on eswikinews
  • 17:54 RoanKattouw: Running schema updates for ArticleFeedbackv5 on enwiki
  • 17:46 RoanKattouw: Deploying ArticleFeedbackv5 updates to testwiki and rebuilding localization cache
  • 16:54 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Allow bnwiki crats to grant/remove import'
  • 16:49 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35258 - Allow bureaucrats to remove sysop rights on fr.wikipedia'
  • 16:46 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Fix imports for wm2012'
  • 16:39 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35917 - allow transwiki imports on wikimania2012'
  • 16:36 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35666 - Renaming Namespace Wikisource:Author in gu.wikisource'
  • 16:33 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35694 - Add enotif on page changes in watchlist (guwiki and source)'
  • 16:27 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35818 - Change of Armenian Wikipedia namespace'
  • 16:20 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35905 - Change namespaces configuration - pl.wikipedia'
  • 16:18 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35261 - Add block permissions in rollback on Lusophone Wikipedia'
  • 16:13 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35823 - Wikijunior and cookbook namespaces for the Vietnamese Wikibooks'
  • 16:05 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35659 - Set logo for sl.wikiversity'
  • 16:00 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35853 - Set a non-empty default value for wmgArticleFeedbackBlacklistCategories on WMF wikis'
  • 15:58 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35878 - Enable e-mail notifications for watchlist (EnotifWatchlist) on tawiki'
  • 15:52 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35852 - Add a category to $wgArticleFeedbackBlacklistCategories for Portuguese Wikipedia to remove AFT from disambiguation pages'
  • 15:10 mutante: gallium - after files have been deleted/moved, puppet back to normal operation (and new clone directory in Apache)
  • 13:23 mutante: killed puppets on gallium
  • 12:33 mark: repooled ssl1002
  • 12:27 mutante: powercycling frozen ssl1002
  • 12:22 mark: Manually depooled down ssl1002 in pybal
  • 02:24 logmsgbot: LocalisationUpdate completed (1.20wmf1) at Thu Apr 12 02:24:29 UTC 2012
  • 02:15 logmsgbot: LocalisationUpdate completed (1.19) at Thu Apr 12 02:15:54 UTC 2012

April 11

  • 22:37 maplebed: deployed more log filters to emery: gerrit/r4758
  • 21:35 LeslieCarr: restarted nrpe on db10
  • 21:33 LeslieCarr: db1004 puppet is fubar
  • 21:33 LeslieCarr: restarted puppet on db30
  • 21:33 LeslieCarr: restarted puppet on mw1110
  • 19:41 notpeter: reimaging bellin and blondel
  • 19:28 logmsgbot: reedy synchronized live-1.5/ 'fix resources symlinks'
  • 19:23 logmsgbot: reedy synchronized live-1.5/ 'fix resources symlinks'
  • 16:54 notpeter: enabling notifications for eqiad lucene vips
  • 16:31 mark: Sending Canadian upload traffic to the eqiad varnish upload cluster
  • 15:59 logmsgbot: py synchronized wmf-config/lucene.php 'pushing search pool 4 to eqiad. for realz this time!'
  • 15:45 logmsgbot: py synchronized wmf-config/lucene.php 'pushing search pool 1 and prefix pool to eqiad. for realz this time!'
  • 15:31 logmsgbot: py synchronized wmf-config/lucene.php 'pushing search pool 2 to eqiad. for realz this time!'
  • 15:15 logmsgbot: py synchronized wmf-config/lucene.php 'pushing search pool 3 to eqiad. for realz this time!'
  • 14:40 notpeter: restarting indexer on searchidx2
  • 13:48 logmsgbot: reedy synchronized php-1.20wmf1/extensions/AbuseFilter/special/SpecialAbuseLog.php
  • 13:35 mutante: applied patch-RT-2804.diff to bugzilla per RT:2804 re: XMLRPC content-type verification
  • 12:07 mutante: moved another list: museum-l -> glam (http://lists.wikimedia.org/pipermail/glam/2012-April/000000.html)
  • 11:58 mark: Setup cp1036 with the persistent storage backend
  • 02:26 logmsgbot: LocalisationUpdate completed (1.20wmf1) at Wed Apr 11 02:26:28 UTC 2012
  • 02:17 logmsgbot: LocalisationUpdate completed (1.19) at Wed Apr 11 02:17:55 UTC 2012
  • 00:11 LeslieCarr: nagios down

April 10

  • 23:50 RoanKattouw: Removed srv187-189 from /etc/dsh/group/job-runners , their jobrunner class has been commented out in puppet since October
  • 23:31 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'bug 35869 - Add strategywiki as an import source on testwiki'
  • 22:53 RoanKattouw: Trying a graceful restart of the job runner on mw1 by sending SIGHUP to the jobs-loop.sh process
  • 22:53 logmsgbot: catrope synchronized php-1.19/extensions/WikimediaMaintenance/jobs-loop.sh 'r114834'
  • 22:24 logmsgbot: reedy synchronized php-1.20wmf1/extensions/CentralAuth/ 'g4102'
  • 22:23 logmsgbot: reedy synchronized php-1.20wmf1/extensions/AntiSpoof/ 'g4103'
  • 21:20 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Disabling mobile URL template for mediawiki.org (using "mediawikiwiki" this time)'
  • 21:18 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Disabling mobile URL template for mediawiki.org'
  • 21:08 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: mediawikiwiki to 1.20wmf1
  • 21:04 logmsgbot: reedy synchronized php-1.20wmf1/extensions/MobileFrontend/javascripts 'minified JS'
  • 20:55 logmsgbot: reedy synchronized docroot/ 'Fix symlinks'
  • 20:45 logmsgbot: reedy synchronized docroot/
  • 20:35 logmsgbot: reedy synchronized docroot/
  • 20:31 logmsgbot: reedy synchronized live-1.5/
  • 20:24 logmsgbot: reedy synchronized php-1.20wmf1/ 'Resyncing for apaches with no space'
  • 20:23 logmsgbot: reedy synchronized live-1.5 'Fix symlinks'
  • 20:18 Reedy: Deleting php-1.18 from all apaches due to lack of space
  • 20:14 logmsgbot: reedy synchronized php-1.20wmf1/extensions/PrefSwitch/ 'PrefSwitch is needed by SimpleSurvey'
  • 19:35 logmsgbot: reedy synchronizing Wikimedia installation... : Rebuilding localisation cache for test2/1.20wmf1
  • 19:24 logmsgbot: reedy synchronized wmf-config/ExtensionMessages-1.20wmf1.php 'Sync ExtensionMessages'
  • 19:23 logmsgbot: reedy synchronized php-1.20wmf1/extensions/ 'Would you like some extensions to go with that, sir?'
  • 19:21 LeslieCarr: restarting gmond on db1004 after removing it's 5gig log
  • 19:07 logmsgbot: reedy synchronized php-1.20wmf1/LocalSettings.php 'Push LocalSettings out'
  • 19:05 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: test2wiki to 1.20wmf1
  • 19:00 logmsgbot: reedy synchronized php-1.20wmf1/ 'Pushing files for 1.20wmf1'
  • 18:03 logmsgbot: aaron synchronized wmf-config/swift.php 'Catch e bogus empty file names from listings'
  • 14:17 robh: search in eqiad is being reinstalled, no need to be alarmed (thats a pun!)
  • 14:13 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'wgLanguageConverterCacheType for git deployment later'
  • 11:50 mutante: pxe boot / reinstall cp1029 - cp1036
  • 11:24 mark: Imported varnish 3.0.2-2wm3 into the Wikimedia APT repository
  • 09:30 apergos: restarted slaving on es1003, it will be a bit before it catches up. patience, young nagios
  • 02:16 logmsgbot: LocalisationUpdate completed (1.19) at Tue Apr 10 02:16:58 UTC 2012
  • 01:33 Tim: on sodium: enabling mod_auth on lists.wikimedia.org by running puppet

April 9

  • 23:14 mutante: migrated foundation-l to wikimedia-l (users/passwords/archive urls/settings stay, old mail address & siteinfo redirect)
  • 22:32 logmsgbot: asher synchronized wmf-config/db.php 'returning db12 as enwiki recentchange/watchlist db'
  • 21:39 LeslieCarr: restarted mysql on es1004 and cleared out its disk space
  • 17:49 LeslieCarr: moving es monitoring to nrpe and variables, may cause false pages if i did it wrong :)
  • 17:36 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php 'Bug 35426 - WebFonts on mr.wikisource.org'
  • 14:54 RobH: i killed eqiad search nodes, woooo
  • 02:17 logmsgbot: LocalisationUpdate completed (1.19) at Mon Apr 9 02:17:22 UTC 2012

April 8

  • 08:45 Nemo_bis: Servers have been very slow, almost unresponsive, and network had a drop of ~0.3 Gb/s, at ~8.35-40.
  • 02:16 logmsgbot: LocalisationUpdate completed (1.19) at Sun Apr 8 02:16:58 UTC 2012

April 7

  • 17:55 logmsgbot: reedy synchronized wmf-config/codereview.php 'Remove deferred paths'
  • 02:16 logmsgbot: LocalisationUpdate completed (1.19) at Sat Apr 7 02:16:54 UTC 2012

April 6

  • 22:23 LeslieCarr: deploying new squid config to all squids
  • 22:14 LeslieCarr: added neon into tiertwo of squid allowed hosts
  • 22:13 LeslieCarr: deploying new squid config to amssq35
  • 21:55 LeslieCarr: restarted puppet on spence
  • 21:35 LeslieCarr: moved jenkins_1.458_all.deb to /srv/wikimedia/incoming/ on brewster
  • 21:32 LeslieCarr: restarted squid on brewster
  • 18:27 Ryan_Lane: updating OpenStackManager to r114758 on virt0
  • 17:33 mark: Sending Japanese upload traffic to varnish in eqiad
  • 17:15 mark: Power cycled down host lvs5
  • 16:43 mutante: changed master and started slave on es1004
  • 15:55 mutante: used gerrit create-project to create operations/debs/wikistats.git
  • 14:13 mutante: manganese (gerrit) now sends SSL CA certificate on https, (curl -vvv says verify ok), should resolve RT:2777 and BZ:35709
  • 11:51 mutante: es1004 - rsync was finished, deleted all binlogs from old host, mysqld_safe& , but did not "change master.." and "start slave" (see mail)
  • 11:39 notpeter: restarting lsearchd on search3... again...
  • 02:17 logmsgbot: LocalisationUpdate completed (1.19) at Fri Apr 6 02:17:37 UTC 2012
  • 01:21 Ryan_Lane: updating OpenStackManager to r114757 on virt0
  • 00:18 Ryan_Lane: updating OpenStackManager to r114754 on virt0

April 5

  • 23:49 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Change guwikisource logo to point to the unscaled file instead'
  • 21:46 notpeter: halting db15 for it to await decom
  • 21:39 binasher: started enwiki.revision sha1 migration on db12
  • 21:32 notpeter: restarting lsearchd on search18
  • 21:22 logmsgbot: asher synchronized wmf-config/db.php 'pulling db12, moving enwiki watchlist,recentchange,etc to db53'
  • 21:19 logmsgbot: asher synchronized wmf-config/db.php 'returning db53'
  • 21:17 logmsgbot: py synchronized wmf-config/lucene.php 'pushing all search traffic back to pmtpa'
  • 18:34 Ryan_Lane: updating OpenStackManager to r114746 on virt0
  • 18:19 Ryan_Lane: updating OpenStackManager to r114744 on virt0
  • 16:49 RobH: brewster puppet running again, cisco installs wont work again until i finish puppetizing the files later today
  • 15:41 logmsgbot: py synchronized wmf-config/lucene.php 'pushing search pool4 to eqiad. this is the smaller wikis shard'
  • 15:40 notpeter: pointing search pool4 to eqiad (this is the "smaller languages" shard)
  • 15:14 Rob_H: puppet daemon being halted on brewster, i need to make local test changes to dhcp
  • 14:52 logmsgbot: py synchronized wmf-config/lucene.php 'pushing search prefix pool live in eqiad'
  • 14:51 notpeter: pushing search prefix pool live in eqiad
  • 14:51 mutante: gallium - disabled incompatible GitTool plugin on jenkins and restarted it
  • 14:34 mutante: importing jenkins_1.458_all.deb to wikipedia apt repo and upgrading it on gallium
  • 14:08 apergos: started rsync in screen session as root on es1003 copying snapshot from es1001 to /a/
  • 14:04 andrewbogott: created labs account for cneubauer
  • 14:02 logmsgbot: py synchronized wmf-config/lucene.php 'pointing enwiki search and enwiki.prefix at eqiad'
  • 14:00 notpeter: pointing enwiki and enwiki.prefix at eqiad search cluster
  • 13:48 mutante: gallium - upgraded all pear packages
  • 13:45 mutante: gallium - upgraded phpunit and php_codesniffer via pear (have been installed via pear before, distro outdated)
  • 13:43 mutante: gallium - upgrading pear
  • 13:33 mutante: installing package upgrades on gallium. apache,apt,postgres,php5-*,ruby,...various libs
  • 13:24 logmsgbot: py synchronized wmf-config/lucene.php 'pointing de, fr, ja, es, ru, nl, pl, pt, zh, and sv search at eqiad'
  • 13:21 notpeter: pointing de, fr, ja, es, ru, nl, pl, pt, zh, and sv search at eqiad
  • 12:27 notpeter: search1 and search4 seem to be dead. restarting lsearchd
  • 02:16 logmsgbot: LocalisationUpdate completed (1.19) at Thu Apr 5 02:16:52 UTC 2012
  • 00:33 Ryan_Lane: updating OpenStackManager to r114730 on virt0
  • 00:24 Ryan_Lane: updating OpenStackManager to r114729 on virt0
  • 00:19 Ryan_Lane: updating OpenStackManager to r114728 on virt0
  • 00:12 Ryan_Lane: updating OpenStackManager to r114726 on virt0
  • 00:00 Ryan_Lane: updating OpenStackManager to r114724 on virt0

April 4

  • 22:16 maplebed: deployed (3rd time's the charm!) udp-filter changes to emery for diederik
  • 22:14 logmsgbot: py synchronized wmf-config/lucene.php 'pointing all search back to pmtpa'
  • 22:13 notpeter: flipping all search back to pmtpa (until tomorrow...)
  • 22:00 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedback 'r114717'
  • 21:24 cmjohnson1: replacing power cable to psu1 (bottom) es1
  • 21:22 cmjohnson1: replacing power cable to psu1 (top) es1
  • 21:14 logmsgbot: py synchronized wmf-config/lucene.php 'pointing de, fr, and ja search at lvs pool in eqiad for live testing'
  • 21:12 notpeter: moving de, fr, and ja search to eqiad
  • 21:04 cmjohnson1: replacing power cable on labstore2 array psu2 (right side)
  • 21:00 cmjohnson1: replacing power cable on labstore1 array psu1 (left side)
  • 20:57 cmjohnson1: removing power from bottom power supply labstore 2
  • 20:54 cmjohnson1: removing power from top power supply on labstore2
  • 19:44 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Enable wmgArticleFeedbackv5AbuseFiltering on enwiki'
  • 19:40 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Disable wmgArticleFeedbackv5AbuseFiltering on enwiki'
  • 19:14 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php 'r114716'
  • 19:12 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Enable wmgArticleFeedbackv5AbuseFiltering on enwiki'
  • 19:04 RobH: dns update for zhen mgmt
  • 18:54 logmsgbot: catrope synchronizing Wikimedia installation... : Deploying AFTv5 update
  • 18:52 logmsgbot: py synchronized wmf-config/lucene.php 'pointing ru, nl, pl, pt, zh, and sv search at lvs pool in eqiad for live testing'
  • 18:51 notpeter: moving ru, nl, pl, pt, zh, and sv search to eqiad
  • 18:27 mutante: nuked /a contents on es1004, started rsync from es1001
  • 18:16 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Add code for wmgArticleFeedbackv5AbuseFiltering'
  • 18:16 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Add wmgArticleFeedbackv5AbuseFiltering, enabled on testwiki only'
  • 17:55 RoanKattouw: Running AFTv5 schema changes on enwiki
  • 17:47 RobH: i didnt crash the site, weeee
  • 17:46 RobH: gracefully restarting apaches
  • 17:46 RobH: pushing out redirects change to apaches for wikipedia.org/com.il redirect to he.wikipedia.org
  • 17:41 binasher: started enwiki.revision sha1 migration on db53
  • 17:38 logmsgbot: asher synchronized wmf-config/db.php 'returning db52, pulling db53'
  • 17:32 RobH: update done, all nameservers still online
  • 17:31 RobH: dns update for wikipedia.org/com.il being resolved
  • 17:08 RoanKattouw: Applying AFTv5 schema change on testwik
  • 15:30 logmsgbot: py synchronized wmf-config/lucene.php 'pointing eswiki search at lvs pool in eqiad for live testing'
  • 15:28 notpeter: pointing eswiki search at eqiad
  • 12:51 mutante: db1007 - add mysql startup via 'update-rc.d mysql defaults'
  • 12:42 apergos: started mysqld on db1007 via /etc/init.d/mysql (this doesn't seem to point to a special fb build, and can't seem to find one on this host, what's up with that?)
  • 12:31 apergos: rebooted bd1007, it was dead in the water (also no helpful messages on console, bah)
  • 11:16 mutante: enabled Renameuser extension on wikitech, renamed tchay per RT request, disabled extension again (it was installed but disabled)
  • 02:19 logmsgbot: LocalisationUpdate completed (1.19) at Wed Apr 4 02:19:03 UTC 2012
  • 01:50 logmsgbot: aaron synchronized php-1.19/includes/filerepo/backend/FileOp.php 'deployed r114697'
  • 01:39 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header of landing page only for mswiki'

April 3

  • 23:17 LeslieCarr: updating bgp policies on cr1.sdtpa
  • 22:44 LeslieCarr: reinstalling neon
  • 22:04 maplebed: rolled back changes to emery in udp-filter due to the new binary crashing.
  • 21:50 maplebed: ran /etc/init.d/udp2log reload on emery to enact the puppetted changes
  • 21:41 maplebed: deploying new udp-filter and teahouse filters to emery for diederik
  • 20:13 notpeter: restarting lsearchd on search7. was taosted
  • 18:37 logmsgbot: root synchronized wmf-config/mc.php
  • 18:37 RobH: syncing new mc.php, forgot to check for all three of the servers i took down, opps.
  • 18:28 RobH: shutting down mw28, mw49, & mw58 for rack relocation due to power overload in d2-pmtpa, relocation to d1-sdtpa per rt 2692
  • 17:59 K4-713: Synchronized payments cluster to r114642
  • 17:52 logmsgbot: reedy synchronized php-1.19/extensions/MobileFrontend/
  • 17:38 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Fix commas in guwikisource namespaces'
  • 17:36 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Fix commas in guwikisource namespaces'
  • 16:38 RobH: bringing down srv237 for phase balancing
  • 16:37 RobH: srv230 back in rotation
  • 16:26 RobH: shutting down srv230 for power phase move per rt 2759
  • 16:10 RobH: updating brewster to use new dhcp files for cisco, no more local hackin.
  • 15:53 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35482 - Add Patroller & Autopatroller groups on ml.wikisource'
  • 15:51 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35482 - Add Patroller & Autopatroller groups on ml.wikisource'
  • 15:41 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35545 - Grant the abusefilter-log-detail right to patrollers on Commons'
  • 15:39 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35545 - Grant the abusefilter-log-detail right to patrollers on Commons'
  • 15:33 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35624 - Subject namespace for the Vietnamese Wikibooks'
  • 15:23 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35603 - Enable Transwiki import on KN:WP'
  • 15:16 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35581 - Closure of nz.wikimedia.org'
  • 15:15 logmsgbot: reedy synchronized closed.dblist 'Bug 35581 - Closure of nz.wikimedia.org'
  • 13:35 Tim: manually reloaded rsyslogd on all apaches
  • 06:16 Tim: deploying limited/split apache syslog (https://gerrit.wikimedia.org/r/#change,4149)
  • 02:16 logmsgbot: LocalisationUpdate completed (1.19) at Tue Apr 3 02:16:32 UTC 2012
  • 00:37 logmsgbot: aaron synchronized php-1.19/includes/Block.php 'deployed r114672'

April 2

  • 23:54 Tim: restarting all apaches with apache-restart-all-hard
  • 23:51 logmsgbot: tstarling synchronized php-1.19/extensions/ConfirmEdit/FancyCaptcha.class.php
  • 23:37 logmsgbot: tstarling synchronizing Wikimedia installation... :
  • 23:36 maplebed: cleared the varnish cache for preilly
  • 23:34 Tim: on all apaches: running logrotate -f and deleting the resulting backup syslog files, to free up disk space
  • 23:32 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/templates/ApplicationTemplate.php 'r114673'
  • 23:21 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping MobileFrontend resource version number'
  • 23:05 logmsgbot: awjrichards synchronizing Wikimedia installation... : Deploying MobileFrontend changes at r114671 per http://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments#2_April.2C_2012
  • 21:43 maplebed: reverted changes to emery's logging due to a broken package in the deploy.
  • 21:30 LeslieCarr: turned down ms7's secondary ethernet port to prevent the flapping (stupid sun boxes)
  • 19:51 maplebed: deploying new udp-filter to emery rt-2501 gerrit/r4120
  • 19:51 notpeter: running authdns-update on dobson
  • 18:30 RobH: brewster puppet daemon stopped, doing local hacks
  • 18:17 RobH: removed old bin files on db1004 and prolly borked it by removing the wrong files
  • 17:54 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php '35436 - Enable Narayam at Hindi Wikipedia'
  • 17:47 logmsgbot: preilly synchronized wmf-config/InitialiseSettings.php 'add disable images option for default on zero domain'
  • 17:45 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php 'Bug 35328 - Enable WebFonts for fr.wikisource.org'
  • 17:40 logmsgbot: nikerabbit synchronized php-1.19/languages/Names.php 'I18ndeploy r114656'
  • 17:15 preilly: carrier testing push for DIGI
  • 17:15 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header of landing page only for mswiki'
  • 16:46 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header of landing page only for mswiki'
  • 02:16 logmsgbot: LocalisationUpdate completed (1.19) at Mon Apr 2 02:16:47 UTC 2012

April 1

  • 02:17 logmsgbot: LocalisationUpdate completed (1.19) at Sun Apr 1 02:17:22 UTC 2012

March 31

  • 10:22 mutante: srv222,225 were also upgraded but stopping there for now in favor of reinstalls
  • 09:58 mutante: nuked /usr/shared/doc on a couple srv's, hey at least 700MB or something, and yes we really should reinstall with a decent partitioning scheme as M ark said
  • 02:18 logmsgbot: LocalisationUpdate completed (1.19) at Sat Mar 31 02:18:10 UTC 2012

March 30

  • 19:37 hashar: configured jenkins on gallium to use smtp.pmtpa.wmnet as outgoing SMTP server
  • 19:28 RobH: puppet daemon restarted on brewster
  • 18:13 RobH: killing puppet daemon on brewster, i need to hack at local configuration for cisco server stuff
  • 12:56 mutante: db1047 - added system startup for /etc/init.d/mysql
  • 12:47 mutante: powercycling db1047
  • 12:28 mutante: deleted old kernel sources on upgraded srvs for that little extra space during peaks, suggesting to nuke /usr/share/doc if there should be more disk space warnings
  • 10:41 mutante: same for srv223
  • 09:18 mutante: srv224,srv219,srv220, upgrade apache, dist-upgrading w/ kernel, disabling ureadahead, rebooting one by one
  • 08:06 mutante: storage3 - gmond unable to find the metric information for any mysql_* .."module has not been loaded", starting mysql, running puppet ...
  • 07:57 mutante: powercycling storage3
  • 07:03 Tim: running bug 35578 cleanup script in screen on fenari
  • 06:41 logmsgbot: tstarling synchronized php-1.19/includes/specials/SpecialBlock.php
  • 06:40 logmsgbot: tstarling synchronized php-1.19/includes/specials/SpecialBlock.php
  • 06:39 logmsgbot: tstarling synchronized php-1.19/includes/specials/SpecialBlock.php
  • 06:15 Tim: killed vi on fenari owned by awjrichards, locking CommonSettings.php for two days
  • 02:17 logmsgbot: LocalisationUpdate completed (1.19) at Fri Mar 30 02:17:56 UTC 2012
  • 01:13 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Remove more crap'
  • 01:02 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Remove some dupe code'
  • 01:00 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Remove wmgUsabilityPrefSwitch'
  • 00:59 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Remove wmgUsabilityPrefSwitch'
  • 00:58 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Remove unused wmgUseUsabilityInitiativeAlpha'

March 29

  • 23:49 logmsgbot: aaron synchronized php-1.19/includes/revisiondelete/RevisionDeleteUser.php 'deployed r114619'
  • 21:20 LeslieCarr: rebooting db47
  • 20:28 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Swap wgUseCommaCount to wgArticleCountMethod'
  • 20:07 notpeter: restarting lsearchd on search2 to del the logfile to end all logfiles
  • 20:05 RoanKattouw: Stopping and starting Gerrit on manganese to apply Chad's change of the -1 text in the DB
  • 20:02 notpeter: restarting lsearchd on search7 to del the logfile to end all logfiles
  • 18:11 logmsgbot: catrope synchronized php-1.19/extensions/ClickTracking/ClickTracking.hooks.php
  • 17:59 RobH: search1021 coming back up, done with tests
  • 17:53 RobH: search1021 coming down for ssd fit test
  • 17:07 notpeter: disabling notifications for search lvs nagios checks for 24 hours to test fix
  • 15:42 notpeter: finished clearning up all pmtpa search hosts. hey look! they all have lots of space now!
  • 15:15 notpeter: restarting lsearchd on search3
  • 15:02 RobH: brewster puppet re-enabled
  • 15:02 RobH: virt1001 pxe boots via dhcp and fails tftp download, i have to hold off on further troubleshooting until i have a network admin
  • 14:47 RobH: did virt1001 wrong, reupdating dns
  • 14:39 RobH: all nameservers still online after udpate
  • 14:37 RobH: updating dns for virt1001 testing
  • 14:29 RobH: stopping puppet runs on brewster so my hacking at the dhcpd.conf file won't get overwritten until I have it working right
  • 14:01 Jeff_Green: restarted varnish on on cp3002 because it was thrashing futiley
  • 13:45 notpeter: rebooting (mostly) down cp3001
  • 13:17 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Add participation namespace to metawiki per request'
  • 13:11 notpeter: trimming logs and such on search1-20
  • 09:59 mutante: srv221, disabling ureadahead, installing package upgrades and new kernel, rebooting
  • 09:40 mutante: kill and start lsearchd on search7
  • 09:36 mutante: restarted defunct lsearchd on search6
  • 09:10 mutante: gallium - added demon,hashar,reedy to group jenkins as it's a problem using puppet when users and groups already exist
  • 06:25 mutante: powercycling sq40
  • 06:21 mutante: installed more package upgrades on sodium
  • 05:58 mutante: installed security upgrades on brewster, cadmium, capella (apache,mysql,ruby,apt..)
  • 05:49 mutante: db42 - mysql did not autostart after boot, added using update-rc.d
  • 05:42 mutante: db42 - reboot worked despite the grub warning about unreliable blocklists
  • 05:37 mutante: rebooting db42 to finish upgrades
  • 02:17 logmsgbot: LocalisationUpdate completed (1.19) at Thu Mar 29 02:17:53 UTC 2012

March 28

  • 23:27 Tim: running apt-get upgrade on mw22,mw66,srv193,srv250,srv253,srv236
  • 23:25 Tim: cleaned up stuck apt-get process on srv236
  • 23:22 Tim: cleaned up stuck apt-get processes on mw22,mw66,srv193,srv250,srv253
  • 21:44 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping mobile frontend resrouce version'
  • 21:43 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/javascripts/application.min.js 'r114576'
  • 21:43 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/javascripts/banner.min.js 'r114576'
  • 21:43 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php 'r114576'
  • 21:43 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/templates/ApplicationTemplate.php 'r114576'
  • 21:42 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/javascripts/application.js 'r114576'
  • 21:42 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/javascripts/banner.js 'r114576'
  • 20:43 logmsgbot: LocalisationUpdate completed (1.19) at Wed Mar 28 20:43:20 UTC 2012
  • 20:29 notpeter: restarted search1020. nothing conspicuous in logs
  • 19:56 RoanKattouw: Running a patched version of l10nupdate that rebuilds the localization cache
  • 18:49 logmsgbot: catrope synchronizing Wikimedia installation... : Bugfixes for ArticleFeedbackv5, ArticleFeedback and ClickTracking
  • 16:47 cmjohnson1: msw1-d1-pmtpa replacement complete
  • 16:34 cmjohnson1: replacing msw-d1-pmtpa per rt2639
  • 15:36 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Bump account creation limit per request on -tech'
  • 15:34 Reedy: srv221 is full
  • 15:34 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Bump account creation limit per request on -tech'
  • 14:39 RobH: restarted morebots in screen on wikitech, no longer as catrope, as roan has root on that box
  • 14:36 RobH: got virt1001 to pxe, but dhcp doesnt know how to handle, need subnet details.
  • 14:34 notpeter: lucene hosed on search9 and search15. restarting, then will look after cause
  • 13:14 Jeff_Green: restarting puppet/puppetmaster on stafford to experiment with report settings
  • 02:10 logmsgbot: LocalisationUpdate completed (1.19) at Wed Mar 28 02:10:34 UTC 2012

March 27

  • 23:12 logmsgbot: tstarling synchronized php-1.19/cache/trusted-xff.cdb
  • 20:19 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33789 - Enable botadmin usergroup on ml.wikipedia'
  • 19:20 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Fix lezwiki namespace'
  • 19:11 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Remove ruwiki arbcom talk from namespaceprotection'
  • 19:00 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35138 - Create Gujarati Wikisource'
  • 18:26 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35138 - Create Gujarati Wikisource'
  • 18:22 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35138 - Create Gujarati Wikisource'
  • 18:16 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34351 - Create Wikisource in Belarusian'
  • 18:14 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34351 - Create Wikisource in Belarusian'
  • 18:10 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34351 - Create Wikisource in Belarusian'
  • 18:05 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35290 - Create Slovenian Wikiversity'
  • 18:05 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35290 - Create Slovenian Wikiversity'
  • 17:00 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Config for lezwiki'
  • 16:58 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Config for lezwiki'
  • 16:56 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Config for lezwiki'
  • 16:48 logmsgbot: reedy ran sync-common-all
  • 16:32 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'prep work for new wikis'
  • 16:08 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34527 - Create a Arbcom namespace on Russian Wikipedia'
  • 16:06 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34527 - Create a Arbcom namespace on Russian Wikipedia'
  • 15:47 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35161 - Incubator configuration updates'
  • 15:11 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 32825 - Favicon for siwiki'
  • 14:40 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35516 - Add Skin: namespace to MW.org'
  • 08:15 apergos: test you silly morebot
  • 07:59:56 hashar: archived old server admin logs since the old page was too long for my connection to download :-/
  • 06:59:02 apergos: !log powercycled emery, it was unresponsive via the mgmt console and not pingable
  • 02:17:52 logmsgbot: LocalisationUpdate completed (1.19) at Tue Mar 27 02:17:52 UTC 2012
  • 00:56:51 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php 'r114507'
  • 00:55:36 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/templates/LeaveFeedbackTemplate.php 'r114508'
  • 00:42:50 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bmping resource version for MobileFrontend'
  • 00:41:58 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/stylesheets/beta_common.css 'r114509'
  • 00:37:30 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping MobileFrontend resource version #'
  • 00:36:36 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/templates/MobileFrontendTemplate.php 'r114507'
  • 00:36:09 logmsgbot: awjrichards[00:36:36] synchronized php/extensions/MobileFrontend/templates/LeaveFeedbackTemplate.php 'r114508'
  • 00:35:50 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/stylesheets/beta_common.css 'r114508'
  • 00:08:55 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php 'r114506'

March 26

  • 23:18:17 logmsgbot: awjrichards synchronizing Wikimedia installation... : Syncing MobileFrontend to r114504 changes per http://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments#26_March.2C_2012
  • 22:44:53 RobH: !log also rolling firmware to ps1-d[1|2|3]-pmtpa
  • 22:28:10 RobH: !log pushing firmware updates to servertechs in sequence: ps1-[a2|a3|a4|a5|b2|b3|b4|b5|c1|c2|c3|d1|d2|d3]-sdtpa, disregard any errors from rebooting alerts
  • 19:55:09 notpeter: !log stopping puppet on search6 and search15 for 24 hours to test new log rotation script
  • 19:19:35 RobH: !log cp1019 memory replaced per rt 2651
  • 19:07:14 apergos: rebooting ms1001 (new kernel)
  • 17:53:34 RobH: cp1019 coming down for memory replacement per rt 2651
  • 17:51:39 RobH: fluorine disk upgrade done, os install pending, details on rt 2350
  • 17:43:48 logmsgbot: nikerabbit synchronized php-1.19/extensions/WebFonts/ 'i18ndeploy r114492'
  • 17:36:51 RobH: fluorine coming down for new disks
  • 17:14 notpeter: backingup plwiki.nspart1 index on search7, deleting working copy, and restarting lsearchd. (note: this will probably cause some downtime on some languages while the proc restarts...)
  • 15:18 RobH: db59 has errors, but as it was a fusion io testbed server, it is more than likely tweaked for such, it is not in any rotation
  • 14:54 RobH: db59 shutting down for io card removal per rt 2589
  • 13:37 mutante: while on it, installing a whole bunch of package updates on db42
  • 13:25 mutante: db42 was out of disk , caused by ~5G citations.csv in /tmp, gzipped the file
  • 09:59 mutante: ..and on ms-be-3. running puppet on db59
  • 09:43 mutante: another corrupted .yaml file on ssl2
  • 09:33 mutante: brewster - delete puppet lock file, restart lighttpd, puppet ...
  • 09:05 mutante: brewster was out of disk - deleted lighttpd access.log.1, gzipped access.log
  • 08:24 mutante: on several mw* boxes puppet did not run because .yaml files on the puppetmaster became corrupted. need to delete the $hostname files in /var/lib/puppet/yaml/node on stafford and re-run. puppet bug similar to http://projects.puppetlabs.com/issues/7836
  • 02:18 logmsgbot: LocalisationUpdate completed (1.19) at Mon Mar 26 02:18:03 UTC 2012

March 25

  • 22:26 RobH: row b servertech firmware in eqiad all updated, should clear alarms as they come back online
  • 22:18 RobH: firmware updates on servertechs in row b eqiad, disregard alarms
  • 20:14 RobH: to fellow ops, you can disregard those observium errors, as I caused them
  • 20:13 RobH: firmware updated on all power strips in row a eqiad.
  • 16:22 RobH: ps1-a1-sdtpa firmware update complete
  • 16:15 RobH: updating firmware on ps1-a1-sdtpa
  • 16:14 RobH: ps1-b1-sdtpa firmware updated successfully
  • 16:14 RobH: ps1-a1-eqiad firmware updated successfully
  • 16:09 RobH: updating firmware on ps1-s1-eqiad and ps1-b1-sdtpa
  • 16:07 RobH: updated firmware successfully on ps1-a8-eqiad, if it has observium alarms now then there are bigger issues.
  • 02:17 logmsgbot: LocalisationUpdate completed (1.19) at Sun Mar 25 02:17:21 UTC 2012
  • 00:59 LeslieCarr: admin down asw-a-eqiad xe-1/1/2 and cr2-eqiad xe-5/0/0 due to framing errors causing packet loss and lacp sporadic timeouts. source of the issue

March 24

  • 19:46 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/MobileFrontend.body.php 'Following a performance regression reported on wikitech-l, added merciless profiling to ExtMobileFrontend::DOMParse()'
  • 19:46 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/MobileFrontend.body.php 'Following a performance regression reported on wikitech-l, added merciless profiling to ExtMobileFrontend::DOMParse()'
  • 17:35 mark: Migration from br1-knams to cr2-knams completed.
  • 17:09 mark: Migrated second knams-esams dark fiber link from br1-knams to cr2-knams
  • 16:36 mark: Corrected MTU setting on cr2-knams's AMS-IX interface
  • 16:20 Reedy: Some european users reporting oruting issues
  • 16:01 mark: Cleared OSPF session between csw1-esams and csw2-esams which magically made some internal routes reappear
  • 15:40 mark: Brought up AMS-IX ipv4 BGP sessions
  • 15:30 mark: Brought up AMS-IX ipv6 BGP sessions
  • 15:25 mark: Moved AMS-IX connection to cr2-knams:xe-1/1/0
  • 15:22 mark: Shutdown all AMS-IX BGP sessions
  • 15:06 mark: Disabled BFD on OSPF3 between cr2-knams and csw1-esams
  • 14:49 mark: Moved AS6908 and AS1257 PIs to cr2-knams
  • 14:18 mark: Brought up AS13030 and AS1299 BGP sessions on cr2-knams
  • 13:57 mark: Shutdown AS1299 BGP session on br1-knams
  • 13:14 mark: Established full iBGP mesh with added router cr2-knams. cr2-knams now has full Internet connectivity.
  • 12:48 mark: Moved fiber from br1-knams:e1/2 to cr2-knams:xe-0/0/0
  • 12:44 mark: Disabled br1-knams:e1/2 (DF leg 1 to esams)
  • 12:43 mark: Rack mounted and powered up cr2-knams
  • 02:17 logmsgbot: LocalisationUpdate completed (1.19) at Sat Mar 24 02:17:02 UTC 2012

March 23

  • 23:49 logmsgbot: reedy synchronized php-1.19/extensions/MoodBar/ 'r114466'
  • 23:15 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34005 - Change uploader flag configuration on Russian Wikipedia'
  • 23:13 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34005 - Change uploader flag configuration on Russian Wikipedia'
  • 23:09 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'wmgAutopromoteOnce'
  • 23:08 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'wgAutopromoteOnce'
  • 23:07 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'wgAutopromoteOnce'
  • 23:05 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'wmgAutopromoteOnce empty arrays'
  • 22:24 RobH: scs-a1-eqiad back online
  • 21:58 RobH: scs-a8-eqiad coming down for re-grounding
  • 19:51 RobH: all power strips in eqiad are now properly grounded
  • 18:12 maplebed: removed ms1 and most of ms2 from the production swift rings. no effect expected.
  • 18:04 logmsgbot: asher synchronized wmf-config/db.php 'returning db32, pulling db52 for migration'
  • 16:44 RobH: cp1019 in middle of firmware update, please dont touch
  • 16:44 RobH: cp1017 memory error seems ot have cleared post firmware update, will keep an eye on it for the rest of the day
  • 16:09 RobH: raid rebuilding on magnesium, however swift stuff is kind of black box mystery right now to me, need Ben to review magnesium later for that
  • 15:53 RobH: magnesium coming back online
  • 15:44 RobH: shutting down magnesium for disk swap
  • 15:37 RobH: firmware updating on cp1017, no one touch it please
  • 15:30 RobH: db1020 can go back into whatever rotation Asher wants it in
  • 15:29 RobH: db20 memory error on raid controller resolved with firmware updarte
  • 06:39 logmsgbot: tstarling synchronized php-1.19/includes/filerepo/file/LocalFile.php 'r114442'
  • 02:18 logmsgbot: LocalisationUpdate completed (1.19) at Fri Mar 23 02:18:35 UTC 2012
  • 01:55 mutante: deleting puppet report files older than 60hours on stafford to free disk space

March 22

  • 23:30 logmsgbot: reedy synchronized php/cache/interwiki.cdb 'Updating interwiki cache'
  • 23:18 RobH: db1020 firmware still updating, will check on it later tonight. offline until then
  • 22:19 notpeter: all 3 dns servers are responding to digs after reload
  • 22:10 notpeter: pushing a new zone file to add 2 more search-related vips for eqiad
  • 20:52 notpeter: stopping puppet on brewster temporarily
  • 20:25 notpeter: rebuilding search1015 and 1016 for disk shuffles
  • 20:01 RobH: magnesium goign down and up again, troubleshooting the disks
  • 19:47 apergos: rebooting ms1002, had stuck rsyncs, and kswapds at 100% cpu, weirdness like "ls /export/upload/wikipedia/am/0/00" hanging.
  • 18:08 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header of landing page only for mswiki'
  • 15:45 RobH: search 1015 and search1016 back up with added disks
  • 15:08 RobH: shutting down search1015 & search1016 for hdd additions
  • 14:45 RobH: db1020 still offline, requires firmware update on raid controller per rt 2621, will perform later today
  • 14:33 logmsgbot: reedy synchronizing Wikimedia installation... :
  • 02:17 logmsgbot: LocalisationUpdate completed (1.19) at Thu Mar 22 02:17:47 UTC 2012
  • 01:14 K4-713: Re-enabled the donations queue consumer in Jenkins
  • 00:28 binasher: started enwiki.revision alter on db32
  • 00:26 binasher: disabled lvm snapshots and puppet on db32 for revision sha1 alter
  • 00:24 logmsgbot: asher synchronized wmf-config/db.php 'pullin db32 for revision alter'

March 21

  • 22:27 ^demon|away: wmf-deployed extensions now r/o in SVN
  • 21:52 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php
  • 21:27 Ryan_Lane: bringing up all instances on virt3
  • 21:08 cmjohnson1: swapped 2 DIMMS in virt3 (b2 and b5)
  • 21:01 Ryan_Lane: shutting down virt3 to replace dimms
  • 20:47 ^demon: /trunk/phase3 is now r/o in SVN
  • 20:21 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Disable prefswitch'
  • 20:10 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Set $wgArticleFeedbackv5OversightEmails on enwiki'
  • 18:59 maplebed: rebooted ms-be3 after it crashed.
  • 18:51 binasher: brought db24 back up after hang, and reslaving, but leaving out of db.php. just replicating until a replacement s2 snapshot host is built
  • 18:51 logmsgbot: catrope synchronizing Wikimedia installation... : Deploying ArticleFeedbackv5 update
  • 18:46 logmsgbot: asher synchronized wmf-config/db.php 'returning db36'
  • 18:35 logmsgbot: asher synchronized wmf-config/db.php 'pulling db24, failing hw'
  • 18:03 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php
  • 18:01 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Temporarily disable ShortUrl on testwiki because we think it might conflict with ArticleFeedbackv5'
  • 17:59 K4-713: updated and synchronized payments cluster to r114382
  • 17:47 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
  • 12:25 notpeter: disabling notifications for search-pool1
  • 08:58 mutante: rebooting ms-be4
  • 08:37 mutante: stopped/started lsearchd on search9
  • 08:05 mutante: ms-be4 down but cant powercycle it yet..Unable to establish LAN session / ipmitool /ipmi_mgmt
  • 07:58 mutante: restarted lsearchd on search3 and 9
  • 05:23 logmsgbot: tstarling synchronized php-1.19/includes/parser/CoreParserFunctions.php
  • 05:23 logmsgbot: tstarling synchronized php-1.19/includes/parser/Parser.php
  • 05:23 logmsgbot: tstarling synchronized php-1.19/includes/parser/StripState.php
  • 05:22 logmsgbot: tstarling synchronized php-1.19/tests/parser/parserTests.txt
  • 03:51 mutante: added "lez" to langlist and running authdns-update, for lez.wikipedia per RT-2665
  • 03:29 mutante: magnesium - shutting down, has existing RT-2669 to replace disk
  • 03:18 mutante: magnesium - "..drive on port B of the Srial ATA controller is operating outsde of normal specifications.. Strike F1 key to continue"..
  • 03:16 mutante: powercycling magnesium - down and just "init: tty4 main" on mgmt, frozen
  • 03:10 mutante: running puppet on aluminium
  • 02:18 logmsgbot: LocalisationUpdate completed (1.19) at Wed Mar 21 02:18:10 UTC 2012
  • 01:06 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php 'r114342'
  • 00:25 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/MobileFrontend.body.php 'fix beta logo'
  • 00:03 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/MobileFrontend.body.php 'fix beta logo'

March 20

  • 23:19 Ryan_Lane: fixing the zero redirect
  • 22:46 logmsgbot: reedy synchronized wikipedia.dblist 'test'
  • 22:36 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/api/ApiQueryExtracts.php 'r114319'
  • 22:09 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping resrouce version # for MobileFrontend'
  • 21:54 logmsgbot: awjrichards synchronizing Wikimedia installation... : Pushing MobileFrontend changes per http://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments#20_March.2C_2012
  • 21:46 binasher: stopped eqiad bits servers from udplogging to emery, packet loss is back to zero
  • 20:59 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header of landing page only for mswiki'
  • 20:17 binasher: killed enwiki.revision sha1 migrator (upgrade-1.19wmf1-2.php). after db36 completes, will run the rest by hand
  • 19:52 Ryan_Lane: pushing change for zero.wikipedia.org to redirect to the english message
  • 19:41 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header of landing page'
  • 19:16 cmjohnson1: pulling disk 5 on virt1 for reseating
  • 18:34 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header of landing page'
  • 18:02 pgehres: flipped Template:CC-status on wmfwiki since credit cards are still disabled on payments.wikimedia.org
  • 17:52 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35193 - Enable sub page feature in Telugu Wikisource'
  • 17:49 notpeter: restarting lsearchd on search10
  • 17:30 logmsgbot: aaron synchronized php-1.19/includes/filerepo/file/LocalFile.php 'deployed r114285'
  • 17:02 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Revert that then'
  • 17:01 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Test something for sewikimedia'
  • 16:42 logmsgbot: reedy synchronized wmf-config/abusefilter.php 'Bug 35355 - Reset of permissions for Hindi Wikipedia'
  • 16:40 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Remove hiwiki botadmin from whGRoupsRemoveFromSelf'
  • 15:52 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35296 - Namespace names changing on Komi Wikipedia'
  • 15:50 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35296 - Namespace names changing on Komi Wikipedia'
  • 15:49 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35296 - Namespace names changing on Komi Wikipedia'
  • 15:30 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35161 - Incubator configuration updates'
  • 15:25 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 31209 - Enable the WikiLove extension for incubator'
  • 14:57 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Remove more group dupes'
  • 14:33 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35355 - Reset of permissions for Hindi Wikipedia (hiwiki)'
  • 14:14 logmsgbot: reedy synchronizing Wikimedia installation... : sscapping for r114268
  • 14:08 logmsgbot: reedy synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.i18n.php 'r114268'
  • 09:12 mutante: new URL pointing to Wikipedia Education Program - http://education.wikimedia.org
  • 08:59 mutante: several srv's said they were unable to contact NTP server
  • 08:57 mutante: apache-graceful-all to deploy changed redirects.conf
  • 08:53 logmsgbot: tfinc synchronized wmf-deployment/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'Fixes file pages showing data charge warnings'
  • 07:42 mutante: running authdns-update after adding education.wm for redirect RT:2634
  • 06:21 logmsgbot: tstarling synchronized php-1.19/includes/User.php
  • 05:35 logmsgbot: asher synchronized wmf-config/db.php 'pulling db36 durring db migration'
  • 02:17 logmsgbot: LocalisationUpdate completed (1.19) at Tue Mar 20 02:17:55 UTC 2012
  • 00:25 logmsgbot: awjrichards synchronizing Wikimedia installation... : Reverting MobileFrontend to r113973
  • 00:15 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php 'r114221'
  • 00:07 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Enabling zero rated mobile access everywhere'
  • 00:05 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumpging version number for MobileFrontend resources'

March 19

  • 23:54 logmsgbot: awjrichards synchronizing Wikimedia installation... : Redoing accidentally aborted scap, Pushing changes to MobileFrontend per https://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments#19_March.2C_2012
  • 23:51 logmsgbot: awjrichards synchronizing Wikimedia installation... : Pushing changes to MobileFrontend per https://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments#19_March.2C_2012
  • 23:35 AaronSchulz: fixed a few files, on commons and other wikis, with empty oi_archive_name values even though the file was on NFS
  • 23:20 Ryan_Lane: restarting all nginx servers
  • 23:20 Ryan_Lane: added a new proxy to the ssl configuration to temporarily proxy access to wikimania videos being transcoded
  • 21:38 binasher: creating "ops" db and related grants on prod db clusters 2-7 to prep rollout of ishmael / pt-digest beyond s1
  • 21:17 binasher: started enwiki.revision sha1 alter on production side
  • 20:57 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFormatter.php 'Removing debugging code from MobileFormatter'
  • 20:54 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFormatter.php 'Addin more debugging code to MobileFormatter'
  • 20:43 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFormatter.php 'Addin more debugging code to MobileFormatter'
  • 20:36 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFormatter.php 'Addin more debugging code to MobileFormatter'
  • 20:31 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFormatter.php 'Adding debugging code to MobileFormatter'
  • 20:07 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedback/modules/jquery.articleFeedback/jquery.articleFeedback.js 'r114176'
  • 19:41 Ryan_Lane: bringing virt3 instances back up
  • 19:33 binasher: deploying new frontend squid conf to add support for mf_useformat cookie [rt 2645]
  • 19:18 K4-713: CiviCRM 4.1.1 update script finished executing on prod.
  • 19:12 Ryan_Lane: shutting down virt3 for memory reseating
  • 19:09 K4-713: Started the CiviCRM 4.1.1 update script on prod.
  • 19:08 mark: Rebuilding RAID arrays on brewster
  • 18:58 K4-713: Put production civicrm / drupal instance in offline mode for upgrade
  • 18:54 K4-713: Disabled all production CiviCRM Jenkins jobs, for CiviCRM upgrade.
  • 18:54 cmjohnson1: brewster HDD replacement complete
  • 18:42 mark: Shutting down brewster for HDD replacement
  • 18:26 Jeff_Green: killed kill-slow-queries on db1008 for the duration of the civicrm upgrade
  • 18:19 logmsgbot: nikerabbit synchronized php-1.19/includes/Linker.php 'i18ndeploy r114160'
  • 18:19 logmsgbot: nikerabbit synchronized php-1.19/extensions/WebFonts/resources/ext.webfonts.fontlist.js 'i18ndeploy r114160'
  • 18:14 mark: Running smartctl -t long /dev/sdb on brewster
  • 12:58 logmsgbot: hashar synchronized php-1.19/includes/SiteStats.php 'Reenable SiteStatsInit::articles() for bug 35169. SiteStatsInit::doAllAndCommit() still disabled since it breaks the site'
  • 10:28 logmsgbot: tstarling synchronized wmf-config/PoolCounterSettings.php 'increased max queue from 50 to 100 on reports that the limit was reached on the enwiki main page in normal operation'
  • 09:11 mutante: nomcom and langcom wikis look kind of broken , redirecting to pages on incubator with "Error: This page is unprefixed! "
  • 08:49 mutante: making (almost) all private wikis https-only per RT-2565, vi remnant.conf,sync,graceful...
  • 07:30 mutante: running sync-apache after making a change to remnant.conf to make grants.wm https-only
  • 05:09 Ryan_Lane: bringing up most instances on virt3, doing so by project priority
  • 04:42 Ryan_Lane: bringing up all instances on virt4, waiting 30 seconds between instances
  • 04:25 Ryan_Lane: bringing up all instances on virt2, waiting 30 seconds between instances
  • 04:09 Ryan_Lane: bringing up all instances on virt1, waiting 30 seconds between instances
  • 04:00 Ryan_Lane: attempting to bring some instances up
  • 02:17 logmsgbot: LocalisationUpdate completed (1.19) at Mon Mar 19 02:17:17 UTC 2012
  • 01:15 mutante: killed, updated, restarted wikibugs bot per request in RT:2656, should have fixed bugzilla:18831

March 18

  • 23:18 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35308 - Install mw:Extension:DynamicPageList (Wikimedia) on Portuguese Wikipedia (ptwiki)'
  • 19:20 Ryan_Lane: stopping all labs instances, manually recovering gluster volume
  • 15:39 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35295 - Missing a in abusefilter-hide-log permission for oversighters'
  • 10:49 Ryan_Lane: rebooting virt4 thanks to defunct libvirt process
  • 03:43 Ryan_Lane: bringing all labs instances up
  • 02:18 logmsgbot: LocalisationUpdate completed (1.19) at Sun Mar 18 02:18:51 UTC 2012
  • 01:09 Ryan_Lane: rebooting all of the virt hosts, gluster is having major issues
  • 00:43 Ryan_Lane: rebooting virt2
  • 00:40 Ryan_Lane: restarting glusterfs on virt2
  • 00:11 Ryan_Lane: rebooting virt3 libirt is non-responsive
  • 00:00 Ryan_Lane: bringing up instances that were downed on virt3

March 17

  • 23:50 Ryan_Lane: virt3 crashed, powercycling it
  • 23:34 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Remove old comments'
  • 23:24 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Remove old comments'
  • 23:18 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Simplify config somewhat'
  • 23:11 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Simplify config somewhat'
  • 23:02 logmsgbot: catrope synchronizing Wikimedia installation... : Have to scap for that AFTv5 change to propagate i18n change
  • 22:52 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/ 'r114087'
  • 21:42 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35289 - Add wikisource logo to mobile wikisource gateway'
  • 02:21 logmsgbot: LocalisationUpdate completed (1.19) at Sat Mar 17 02:21:03 UTC 2012
  • 01:23 AaronSchulz: FindFilesMissingDBRows.php done, list under aaron/output/missingFileDBRows
  • 00:11 AaronSchulz: Running FindFilesMissingDBRows.php on all wikis

March 16

  • 21:21 binasher: running enwiki.revision sha1 schema migrations on eqiad side
  • 20:12 logmsgbot: reedy synchronizing Wikimedia installation... : Rebuild moodbar messages
  • 20:03 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Re-enable moodbar on enwiki'
  • 19:53 logmsgbot: reedy synchronized php-1.19/extensions/MoodBar/ 'r114030'
  • 19:15 Reedy: Ran namespaceDupes on stewardwiki
  • 17:11 RobH: hdd in search1017/1018 replaced per rt 2583
  • 16:54 RobH: search1017 and search1018 coming down for hdd swap
  • 16:53 RobH: cp1017 back in service pool
  • 16:43 RobH: cp1019 back in full service
  • 16:22 logmsgbot: reedy synchronized php-1.19/extensions/CentralAuth/specials/SpecialCentralAuth.php 'r114021'
  • 16:22 RobH: cp1017 memory error, coming down for troubleshooting.
  • 16:18 RobH: cp1019 memory error cleared after reseating, notes on rt 2651
  • 16:09 mark: Migrated all varnish3 packages to newer varnish packages from git
  • 16:08 RobH: cp1019 coming down for memory error troubleshooting
  • 15:58 RobH: cp1040 repaired per rt 2611
  • 15:48 RobH: cp1040 down for memory replacement
  • 15:09 logmsgbot: reedy synchronized stylize.php 'Test for hume'
  • 15:04 logmsgbot: root synchronized ufg.sql 'test sync to see if hume is fixed'
  • 14:55 logmsgbot: reedy synchronized php-1.19/extensions/WikimediaMaintenance/cleanupBug31576.php
  • 14:04 apergos: restarted swift-container-auditor on ms-be3, it had died for some reason
  • 08:07 mutante: i reverted that (star cert for wikitech), no worries i "shred"ded the files
  • 07:51 mutante: replaced self-signed cert on wikitech with the star cert
  • 04:19 mutante: on stafford, deleting spence's puppet report files to free some disk space (they are like the largest report files of all)
  • 03:09 mutante: stafford - - /var/lib/puppet/reports is getting quite large (18G), and we got the first disk space warning, do we want to keep those?
  • 02:45 mutante: killing nrpe on several hosts where it was running as the wrong user again (somehow through the use of dsh)
  • 02:21 logmsgbot: LocalisationUpdate completed (1.19) at Fri Mar 16 02:21:35 UTC 2012
  • 01:12 mutante: stopping nagios-wm temp. while changing nrpe config (will watch it manually until it's back)
  • 00:54 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Add NS to stewardwiki at request of Philippe'
  • 00:51 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Add NS to stewardwiki at request of Philippe'

March 15

  • 23:17 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/ArticleFeedbackv5.hooks.php 'r113974'
  • 23:12 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/templates/DisableTemplate.php 'r113973, fixes bug 35249'
  • 23:10 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/modules/ext.articleFeedbackv5/ext.articleFeedbackv5.js 'r113972'
  • 22:59 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
  • 22:59 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Bump AFTv4 event logging percentage from 25% to 100%'
  • 22:57 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedback/modules/jquery.articleFeedback/jquery.articleFeedback.js
  • 22:48 mutante: purging Lucene monitoring on indexer from db9, remove duplicate service definitions manually anyways (still tons left), run purge script, reload Nagios..
  • 22:24 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
  • 22:23 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Raise AFTv4 event logging percentage from 5% to 25%'
  • 22:21 mutante: getting rid of Swift HTTP checks on non production machines manually (come on spence _purge_ ;P)
  • 22:07 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
  • 22:04 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Raise AFTv4 event logging percentage from 1% to 5%'
  • 21:44 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
  • 21:28 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/ArticleFeedbackv5.hooks.php 'r113961'
  • 21:25 pgehres: K4-713 synchronized payments cluster to r113956
  • 21:25 pgehres: disabled credit cards on donate.wikimedia.org
  • 21:21 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/ArticleFeedbackv5.hooks.php 'fix fatal'
  • 21:20 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Bump AFTv4 event logging percentage from 0.27% to 1%'
  • 21:19 Ryan_Lane: rebalancing instances gluster volume
  • 21:18 RoanKattouw: That was r113959
  • 21:18 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedback/modules/ext.articleFeedback/ext.articleFeedback.js
  • 21:11 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedback/modules/ext.articleFeedback/ext.articleFeedback.js 'r113958'
  • 21:09 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/ 'r113957'
  • 20:46 mark: bits.pmtpa cluster back online
  • 20:44 RobH: dns update for silver and zhen servers
  • 20:37 logmsgbot: reedy synchronized php-1.19/extensions/WikimediaMaintenance/cleanupBug31576.php
  • 19:54 RobH: sq67-sq70 have been reinstalled, but not signed in puppet, not sure if they are ready for that or if there are other items mark needs to change first
  • 19:11 RobH: working on sq67-sq70 reinstalls, disregard alerts
  • 19:00 RobH: db1022 resetup and redeployed per rt 2537 and assigned back to asher
  • 18:51 logmsgbot: reedy synchronizing Wikimedia installation... : Running scap to deal with message changes earlier
  • 18:19 RobH: db1022 coming down for reinstall and resetup of raid per rt 2537
  • 17:55 logmsgbot: reedy synchronized php-1.19/extensions/CentralAuth/ 'r113940'
  • 17:54 logmsgbot: reedy synchronized php-1.19/extensions/CheckUser/ 'r113940'
  • 17:53 logmsgbot: reedy synchronized php-1.19/extensions/wikihiero/modules/ext.wikihiero.css 'r113940'
  • 17:52 logmsgbot: reedy synchronized php-1.19/extensions/NewUserMessage/NewUserMessage.class.php 'r113940'
  • 17:41 logmsgbot: reedy synchronized php-1.19/includes/RecentChange.php 'r113938'
  • 17:38 logmsgbot: reedy synchronized php-1.19/resources/mediawiki/mediawiki.util.js 'r113936'
  • 17:37 logmsgbot: reedy synchronized php-1.19/includes/specials/SpecialUndelete.php 'r113936'
  • 17:32 logmsgbot: reedy synchronized php-1.19/languages/messages/ 'r113935'
  • 17:31 logmsgbot: reedy synchronized php-1.19/resources/ 'r113935'
  • 17:31 logmsgbot: reedy synchronized php-1.19/includes/ 'r113935'
  • 17:16 logmsgbot: reedy synchronized php-1.19/includes/SkinTemplate.php 'r113932'
  • 16:13 logmsgbot: reedy synchronized php-1.19/extensions/WikimediaMaintenance/cleanupBug31576.php 'r113929'
  • 15:15 mark: Created git repo operations/debs/varnish in gerrit
  • 14:06 apergos: disabled moodbar temporarily on en wikii, see bug 35245
  • 14:02 logmsgbot: ariel synchronized wmf-config/InitialiseSettings.php 'emergency disable of feedback dashboard (right config var this time?)'
  • 13:51 logmsgbot: ariel synchronized wmf-config/InitialiseSettings.php 'emergency disable of feedback dashboard'
  • 13:11 apergos: on screen as root on dataset1001, copying to gluster volume; if this causes problems feel free to shoot it. ( cp -a 20120211 /mnt/glusterpublicdata/public/enwiki/ )
  • 09:08 mutante: ran puppet on mw1020
  • 08:12 mutante: installing apache,apt,cron,mysql-client upgrades on spence
  • 07:51 mutante: messed with /var/lib/dpkg/status on hume to fix broken packages/remove "marked for purging" on libmysql-php5 without removing a ton of other packages, rather hackish but seems fine anyways, like not broken anymore on simulated dist-upgrade etc
  • 07:01 mutante: uprading apache and apt on hume
  • 02:17 logmsgbot: LocalisationUpdate completed (1.19) at Thu Mar 15 02:17:35 UTC 2012
  • 01:26 Ryan_Lane: labsconsole was missing libapache2-mod-php5. puppet must have tried to upgrade a package unsuccessfully
  • 01:22 mutante: planet back up (installed libapache2-mod-php5 which installed apache2-mpm-prefork and removed apache2-mpm-worker)
  • 01:19 mutante: planet down - apache on singer, syntax error in site config "Invalid command 'php_admin_flag'"
  • 01:03 mutante: fixing nrpe "unable to read output" raid check on srv197,207,243,,244,253.. (nrpe running as wrong user)

March 14

  • 23:16 maplebed: installed the swiftcleaner to run daily from iron. see root's crontab for more info.
  • 20:41 binasher: disabled log_queries_not_using_indexes on all core dbs
  • 20:33 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header for image support debugging'
  • 20:30 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header for image support debugging'
  • 19:29 maplebed: rebooting ms-be1 to enable hyperthreading (and make it the same as all the other ms-be hosts)
  • 19:06 preilly: pushing x-images header for vary support
  • 19:06 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header for image support debugging'
  • 19:05 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/MobileFrontend.body.php 'zero needs to add x-images to vary header'
  • 18:58 maplebed: ms-be5 is back in rotatino
  • 18:31 preilly: push zero change for carrier testing
  • 18:31 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing'
  • 16:19 RobH: updating dns for new domain wikimediacommons.pt (nameservers not yet pointed at us)
  • 16:04 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'add vcs for extdist updates'
  • 13:03 RobH: cp1029-cp1035 all installed and ready for varnish deployment, puppet has been run
  • 08:24 mutante: running "apt-get -f install" on snapshot3 to fix dpkg, which installed mysql-client- and client-core-5.1
  • 08:02 mutante: stop/start memcached on srv254,srv255,srv257
  • 07:51 mutante: restarting mecached on marmontel
  • 07:51 mutante: fixing owa[1-3] Swift HTTP commands manually
  • 03:44 mutante: ekrem - user agent "AppleDictionaryService" requests cause temp. WAP outage ..it seems
  • 03:38 mutante: free some disk space on spence - deleted user.log.1 on spence, compressing messages.1, apt-get clean,...
  • 02:52 RobH: cp1032-cp1035 reinstall issue wiped mbr causing issues, will reinstall in my AM
  • 02:49 RobH: revoked, cp1032 is some reason in grub error, and its too late at night for me to work on it, will troubleshoot tomorrow
  • 02:48 RobH: realized i forgot to log hours ago that cp1029-cp1036 are installed with puppet run, ready for varnish deployment tomorrow
  • 02:17 logmsgbot: LocalisationUpdate completed (1.19) at Wed Mar 14 02:17:13 UTC 2012

March 13

  • 23:51 mutante: upgrading bugzilla to 4.0.5
  • 23:42 logmsgbot: reedy synchronized php-1.19/resources/jquery/jquery.textSelection.js 'r113786'
  • 23:14 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing'
  • 22:47 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/templates/ApplicationTemplate.php 'r113779'
  • 22:44 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/stylesheets/beta_common.css 'r113774'
  • 22:44 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/stylesheets/common.css 'r113774'
  • 22:43 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/templates/ApplicationTemplate.php 'r113771'
  • 22:42 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/api/ApiQueryExcerpts.php 'r113774'
  • 22:27 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Removing moile URL template for tewtwiki'
  • 21:44 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero'
  • 21:44 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.i18n.php 'changes for zero'
  • 21:31 logmsgbot: asher synchronized wmf-config/db.php 'replacing db18 with new s7 slave db56'
  • 21:19 binasher: started slaving db56 from db37
  • 20:30 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.i18n.php 'changes for zero'
  • 19:27 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing'
  • 19:17 RobH: iron updated to use ipmi_mgmt script
  • 19:08 preilly: pushing changes for zero to mswiki
  • 19:08 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.i18n.php 'changes for zero'
  • 19:08 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero'
  • 19:05 binasher: streaming hotbackup of db1041 to db56 (new s7 slave replacing db18)
  • 18:10 maplebed: failover successful, restarted pybal on lvs4, failback successful.
  • 18:09 binasher: power cycling db1020, which also froze this morning
  • 18:08 maplebed: stopping pybal on lvs4 - should fail over to lvs3
  • 17:47 maplebed: pybal restarted on lvs3
  • 17:47 binasher: power cycling db1040, crashed again
  • 17:30 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Bug 35183 - p include extensions/Renameuser/Renameuser.php instead of extensions/Renameuser/SpecialRenameuser.php'
  • 17:12 mark: Sending all normally-pmtpa upload traffic to upload-lb.eqiad
  • 17:05 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero'
  • 16:59 preilly: add disable images support to mswiki under zero domain
  • 16:59 logmsgbot: preilly synchronized wmf-config/CommonSettings.php 'add disable images option for mswiki on zero domain'
  • 16:58 logmsgbot: preilly synchronized wmf-config/InitialiseSettings.php 'add disable images option for mswiki on zero domain'
  • 16:46 logmsgbot: preilly synchronized wmf-config/InitialiseSettings.php 'add ZeroRatedMobileAccess extension to mswiki remove from mywiki'
  • 16:44 mark: Sending traffic from Japan, India, Mexico to upload-lb.eqiad
  • 16:37 LeslieCarr: reinstalling neon
  • 16:23 apergos: stole some free space from the phys volume on ms1002 to give us more time for the rsync to keep going til after the move to swift etc
  • 15:28 mark: Sending traffic from the USA to upload-lb.eqiad
  • 15:27 mark: Rebooting lvs1005 with upgraded kernel/packages
  • 15:12 LeslieCarr: manually deleted cp1025 info from nagios config file - nagios restored for now
  • 14:51 mark: Sending traffic from Canada to upload-lb.eqiad
  • 14:32 mark: Sending traffic from Brazil to upload-lb.eqiad
  • 13:58 mark: Sending traffic from Argentina to upload-lb.eqiad
  • 12:58 mark: Seeding the eqiad upload caches from live upload requests
  • 11:59 mark: Setup squid logging to oxygen, with oxygen relaying to multicast 233.58.59.1
  • 11:02 mark: Rebooting lvs1002 with kernel updates
  • 10:17 mark: Rebooting manutius with newer 2.6.36 kernel to attempt avoiding i/o kernel bug with torrus
  • 02:18 logmsgbot: LocalisationUpdate completed (1.19) at Tue Mar 13 02:18:03 UTC 2012

March 12

  • 22:55 K4-713: synchronized payments cluster to r113679, and tweaked the anti-fraud rules
  • 21:51 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/modules/jquery.articleFeedbackv5/jquery.articleFeedbackv5.js 'r113671'
  • 21:51 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/ArticleFeedbackv5.hooks.php 'r113671'
  • 21:44 Reedy: Running foreachwiki extensions/WikimediaMaintenance/cleanupBug31576.php in screen as me on hume
  • 21:39 RobH: search1014 repaired per rt 2483
  • 20:26 RobH: cp1040 coming down for hardware stuffs
  • 18:19 Nikerabbit: Assuming scap has finished
  • 17:48 logmsgbot: nikerabbit synchronizing Wikimedia installation... : Deploying updated Translate
  • 17:46 notpeter: restarting indexer on searchidx2
  • 17:24 logmsgbot: nikerabbit synchronized php-1.19/includes/Title.php 'r113635'
  • 17:22 logmsgbot: nikerabbit synchronized php-1.19/languages/ 'r113635'
  • 17:14 logmsgbot: nikerabbit synchronized php-1.19/extensions/Narayam/ 'Updating Narayam'
  • 17:13 mark: PXE booting cp1025-cp1028
  • 17:11 logmsgbot: nikerabbit synchronized php-1.19/extensions/WebFonts/ 'Updating WebFonts'
  • 15:16 mark: Rebooted manutius, stuck in a similar state as streber always did
  • 06:10 mutante: turning off debug mode in nagios-nrpe, again had to kill it , restart fails
  • 05:53 mutante: dunno, copper was stuck (no mgmt output after reboot) but powercycling it and back
  • 05:43 mutante: rebooting copper to make sure grub update didnt break it and asked for restart anyways
  • 05:37 mutante: copper - installing (security) updates (apt,grub,openssl,ruby,libc6..)
  • 04:19 mutante: wanted to restart nagios-nrpe-server on spence with debug=1 to investigate permission issue. arr! "Address already in use" "cant write to pidfile", killed the one started on Feb18, and reordered allowed_hosts, spence talks to itself again now :p
  • 03:40 mutante: same (and nscd) on fenari
  • 03:35 mutante: upgrading libc6 and related packages on spence
  • 02:17 logmsgbot: LocalisationUpdate completed (1.19) at Mon Mar 12 02:17:28 UTC 2012

March 11

  • 08:14 apergos: restarted lighttp on dataset2
  • 07:49 apergos: removed current htcp log file, restarted purger, it seems to be logging normallynow
  • 07:35 apergos: current ls shows 17416851456 2012-03-11 07:34 HTCPpurger.log while current du -sh shows 175M for /var/log. Sparse file that gets rotated badly? lots of leading nulls (many gb worth), why?
  • 07:33 apergos: on ms1004 the HTCPpurger.log file after rotation was 17 gb, filling the disk. Removed it.
  • 02:17 logmsgbot: LocalisationUpdate completed (1.19) at Sun Mar 11 02:17:35 UTC 2012

March 10

  • 22:09 Reedy: Make that wikimania2012, not wikimediawiki
  • 22:08 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Disable anon page creation for wikimediawiki'
  • 19:28 binasher: set sync_binlog = 1 on all current masters and eqiad dbs
  • 19:22 binasher: reslaved db1033
  • 07:03 mutante: ran puppet on db1022, another one that works fine manually but somehow did not by itself
  • 05:11 mutante: doing more (cp*, db*, msbe-* ,mw*) by hand / for loop
  • 05:01 mutante: starting nagios-nrpe-server on all via dsh (fail to restart on config change issue)
  • 02:16 logmsgbot: LocalisationUpdate completed (1.19) at Sat Mar 10 02:16:57 UTC 2012
  • 01:07 maplebed: started swiftcleaner on owa1 looking for (and purging) bad objects
  • 01:06 maplebed: rebalanced the swift rings to finish decreasing traffic sent to ms1 and ms2
  • 00:18 Ryan_Lane: powercycling ssl1003
  • 00:18 Ryan_Lane: powercycling ssl1001

March 9

  • 20:34 notpeter: stopping search indexer on searchidx2 for fresh rsync to searchidx1001
  • 19:58 preilly: pushed change to remove description from landing page
  • 19:57 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero'
  • 18:59 Ryan_Lane: sending test.m.wikipedia.org to the same place as test.wikipedia.org via squid
  • 18:58 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Fixing wgMobileUrlTemplate settings for domains that do not have .m. domains configured'
  • 18:48 logmsgbot: reedy synchronized php-1.19/extensions/WikiLove/modules/ext.wikiLove/ext.wikiLove.css 'r113497'
  • 18:40 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Changing the way in which wgMobileUrlTemplate is configurable by InitialiseSettings.php'
  • 18:39 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Disabling wgMobileUrlTemplate for testwiki - hopefully for real this time'
  • 18:34 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Making wgMobileUrlTemplate configurable by InitialiseSettings.php'
  • 18:34 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Disabling wgMobileUrlTemplate for testwiki'
  • 17:40 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/templates/ApplicationTemplate.php 'r113489'
  • 17:32 maplebed: set swift storage device weight on ms2 to 0 and pushed out rings
  • 15:52 apergos: cleared up a little bit of space on root partition of snapshot2, but that's about it. I hope we never have 3 versions of mw in test at the same time, the tmp caches will kill us
  • 15:52 mark: Turned off vcc_err_unref on all varnish servers, so varnish doesn't complain when ACLs/probes/backends are unused
  • 15:44 Jeff_Green: hume apt upgrades, puppetd --test, switch to mysql 5.1.53-fb3753-wm1
  • 06:38 Ryan_Lane: reloading autofs on all labs instances
  • 06:13 Tim: running svn cleanup on extdist trunk
  • 04:18 Tim: switched php and wmf-deployment symlinks over to php-1.19 instead of php-1.18
  • 04:18 Tim: restarted morebots
  • 00:57 pp-pdf2: updated pyfribidi to 0.11.0 fixing https://github.com/pediapress/pyfribidi/issues/2
  • 00:57 pp-pdf3: updated pyfribidi to 0.11.0 fixing https://github.com/pediapress/pyfribidi/issues/2
  • 00:57 pp-pdf1: updated pyfribidi to 0.11.0 fixing https://github.com/pediapress/pyfribidi/issues/2
  • 00:38 logmsgbot: preilly synchronized wmf-config/InitialiseSettings.php 'add ZeroRatedMobileAccess extension to mywiki'
  • 00:32 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/javascripts/beta_opensearch.js 'fixes to code push'
  • 00:32 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/javascripts/beta_opensearch.min.js 'fixes to code push'
  • 00:27 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/templates/ApplicationTemplate.php 'fixes to code push'
  • 00:27 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/javascripts/opensearch.js 'fixes to code push'
  • 00:27 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/javascripts/opensearch.min.js 'fixes to code push'
  • 00:01 RobH: oxygen install done, booting successfully after multiple tests, now running puppet for initial config
  • 00:01 K4-713: updated the paypal IPN listener on aluminium to r1450

March 8

  • 23:57 logmsgbot: awjrichards synchronized php-1.19/extensions/MobileFrontend/templates/ApplicationTemplate.php 'r113428'
  • 23:56 logmsgbot: tstarling synchronized wmf-config/CommonSettings.php 'per-wiki memory limit configuration, with extra memory for zh* for converter tables'
  • 23:55 logmsgbot: tstarling synchronized wmf-config/InitialiseSettings.php 'per-wiki memory limit configuration, with extra memory for zh* for converter tables'
  • 23:42 mutante: rebooting ms-be5
  • 23:37 logmsgbot: awjrichards synchronizing Wikimedia installation... : Updating MobileFrontend per https://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments
  • 23:24 binasher: streaming hotbacking of db1017 to db1033 - no snapshots of enwiki in eqiad til db1033 is back
  • 23:19 Tim: started changing the php symlink to 1.19 instead of 1.18, but then changed my mind and changed it back.
  • 23:16 logmsgbot: tstarling synchronizing Wikimedia installation... :
  • 23:07 logmsgbot: tstarling synchronized php-1.19/extensions/ExtensionDistributor/svn-invoker.conf
  • 23:01 logmsgbot: asher synchronized wmf-config/db.php 'returning db24 to service'
  • 22:58 maplebed: powercycled ms-be3 - it crashed 2.5 hours ag.
  • 22:52 logmsgbot: asher synchronized wmf-config/db.php 'pulling db18'
  • 22:40 logmsgbot: aaron synchronized php-1.19/includes/filerepo/backend 'deployed r113413, r113414'
  • 22:39 LeslieCarr: poked hole to allow labs machines to reach gluster machines in tampa
  • 22:13 logmsgbot: catrope synchronized php-1.19/includes/MagicWord.php 'r113411'
  • 22:13 logmsgbot: catrope synchronized php-1.19/includes/Cdb.php 'r113411'
  • 22:13 logmsgbot: catrope synchronized php-1.19/includes/WebRequest.php 'r113411'
  • 22:11 RobH: udpating dns for oxygen
  • 22:03 RobH: oxygen coming down for reinstall
  • 20:42 cmjohnson1: power to msw-c1-sdtpa restore
  • 20:40 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero'
  • 20:40 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.php 'changes for zero'
  • 20:39 cmjohnson1: removing and relocating power to msw-c1-sdtpa
  • 19:38 logmsgbot: catrope synchronizing Wikimedia installation... : ArticleFeedbackv5 updates
  • 19:34 RoanKattouw: Running scap for ArticleFeedbackv5 updates
  • 19:30 RoanKattouw: Running AFTv5 schema changes on enwiki
  • 19:29 logmsgbot: catrope synchronized wmf-config/CommonSettings.php '$wgArticleFeedbackv5OversightEmails'
  • 19:29 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php '$wgArticleFeedbackv5OversightEmails'
  • 19:26 RoanKattouw: Applying AFTv5 schema changes to en_labswikimedia
  • 19:09 preilly: push zero rated changes
  • 19:09 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.i18n.php 'changes for zero'
  • 19:09 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero'
  • 19:04 RoanKattouw: Clearing message blobs
  • 18:53 RoanKattouw: Running rebuildLocalisationCache.php
  • 18:49 binasher: power cycling cp1044
  • 18:46 binasher: purging entire mobile varnish cache - the main mobile template included robots no-follow
  • 18:43 preilly: needed to fix a google issue with robots
  • 18:43 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/ApplicationTemplate.php 'remove ROBOTS metatag'
  • 18:40 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/ApplicationTemplate.php 'remove ROBOTS metatag'
  • 18:40 binasher: deploying new squid frontend.conf to fix epic fail - all googlebot traffic was being redirected to mobile. now just if it's mobilegooglebot.
  • 18:29 RoanKattouw: Applying AFTv5 schema changes on testwiki
  • 18:27 RoanKattouw: Pushing new AFTv5 code to testwiki, do not sync to the live site just yet
  • 17:46 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'ptwikipedia to ptwiki'
  • 17:14 cmjohnson1: shutting down db18 for memory testing
  • 16:57 RobH: search1014 still down per rt2483
  • 16:47 maplebed: took ms-be5 out of rotation in the swift cluster - it's crashed 3 times now.
  • 16:36 logmsgbot: reedy synchronized php-1.19/extensions/ExtensionDistributor/ExtensionDistributor_body.php 'r113368'
  • 16:31 logmsgbot: reedy synchronized php-1.19/extensions/ExtensionDistributor/ExtensionDistributor_body.php 'Revert live hack because it works, will come in properly'
  • 16:30 logmsgbot: reedy synchronized php-1.19/extensions/ExtensionDistributor/ExtensionDistributor_body.php 'Test for bug 27246'
  • 16:16 RobH: search1008 repaired
  • 15:52 RobH: mw1103 finally repaired and ready for os and such
  • 14:48 pp-pdf1: installed python faulthandler 2.1
  • 14:47 pp-pdf3: installed python faulthandler 2.1
  • 14:47 pp-pdf2: installed python faulthandler 2.1
  • 14:24 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35012 - Namespace aliases for wikipedia and wikipedia-talk namespaces on Sanskrit wiki'
  • 09:17 mutante: running puppet on mw1010 - finished quickly without problems - uh, wonder why Nagios reported puppet freshness then
  • 08:22 mutante: cp1019 - Hitting F1 to continue reboot ( "Alert! System fatal error during previous boot")
  • 08:21 mutante: cp1019 went down, then rebooted by itself (i think) after showing "idrac-8W82BP1 Severity: Non Recoverable, SEL:CPU Machine Chk: Processor sensor, transition to non-recoverable was asserted"
  • 07:54 mutante: cadmium fixed by adding groups::wikidev
  • 07:41 mutante: puppet on cadmium broken due to dependency Group[500] for User[catrope]
  • 07:20 mutante: ms1004 ran out of disk - caused by 17G HTCPurger.log.1, trying to gzip it now
  • 06:52 logmsgbot: tstarling synchronized multiversion/MWMultiVersion.php
  • 06:51 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files:
  • 03:04 Guest32353: powercycled ms-be5; it has been unresponsive for 2 hours.
  • 02:18 logmsgbot: LocalisationUpdate completed (1.19) at Thu Mar 8 02:18:02 UTC 2012
  • 01:32 AaronSchulz: fixBug34995.php done
  • 01:26 AaronSchulz: running fixBug34995 on all wikis
  • 00:17 Ryan_Lane: adding zero cnames
  • 00:16 Ryan_Lane: installing newer wikimedia-task-dns-auth on all dns servers
  • 00:15 Ryan_Lane: added wikimedia-task-dns-auth_0.18 to the repo, to add support for zero

March 7

  • 23:05 logmsgbot: aaron synchronized php-1.19/includes/filerepo/file/LocalFile.php 'deployed r113319'
  • 22:39 maplebed: set swift weight for ms1 to 0 initiating the process to move data off the host in preparation for decomissioning it.
  • 21:17 Jeff_Green: running apt upgrades and puppetd --test on srv194, srv197, srv203, srv212, srv213, srv230, srv244, srv245, srv252, srv282 and manually restarting nrpe because they're reporting funky in nagios
  • 20:20 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
  • 20:17 Jeff_Green: yet another redirects.conf change, per RT#2498 redirect wikimedia.com-->wikimedia.org
  • 20:05 binasher: reverted no-pagecache rsync on search nodes - without corresponding index warmup in lsearchd, it just pushes back the pain a bit and does more harm than good
  • 20:04 binasher: deployed support for zero.wikipedia.org and carrier tagging to mobile varnish servers
  • 19:38 logmsgbot: catrope synchronized php-1.19/includes/resourceloader/ 'r113278'
  • 19:27 Jeff_Green: manual apt-upgrade, puppetd --refresh, and repeat on srv265 because it was running on outdated apache config
  • 18:44 RobH: correction sq39
  • 18:36 RobH: pulled sq39 from text pybal config, pulled sq46 from upload pybal config
  • 18:36 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
  • 18:36 logmsgbot: catrope synchronized php-1.19/extensions/CustomUserSignup/modules/AccountCreationUserBucket.js 'touch'
  • 18:12 RobH: shutting down sq38 and sq46 per rt 2581 for testing
  • 16:02 cmjohnson1: replacing hdd for disk 10 on db22
  • 16:00 cmjohnson1: pulling disk 10 from db22
  • 13:28 mark: Removed torrus from streber
  • 13:00 pp-pdf2: updated mwlib to 0.13.6
  • 13:00 pp-pdf3: updated mwlib to 0.13.6
  • 13:00 pp-pdf1: updated mwlib to 0.13.6
  • 11:29 logmsgbot: hashar synchronizing Wikimedia installation... : trigger a rebuild of l10n cache
  • 04:53 mutante: added ms-be5 drives to swift cluster
  • 02:18 logmsgbot: LocalisationUpdate completed (1.19) at Wed Mar 7 02:18:01 UTC 2012
  • 02:11 logmsgbot: catrope synchronized php-1.19/includes/api/ApiBase.php 'r113212'
  • 01:58 logmsgbot: aaron synchronized php-1.19/includes/filerepo/backend/FileBackend.php 'bumped max file size to 4GiB'
  • 00:27 maplebed: put ms-be4 into rotation as a new production swift backend storage node
  • 00:21 maplebed: put ms-be3 into rotation as a new production swift backend storage node
  • 00:05 maplebed: put ms-be2 into rotation as a new production swift backend storage node

March 6

  • 23:54 logmsgbot: catrope synchronized php-1.19/extensions/CustomUserSignup/ 'Belated sync of r113056'
  • 23:52 binasher: deploying new frontend squid config to include googlebot in mobile redirects
  • 23:36 logmsgbot: reedy synchronized php-1.19/extensions/SyntaxHighlight_GeSHi/SyntaxHighlight_GeSHi.class.php 'r113200 reverting r113198'
  • 23:25 Tim: patched 5xx-filter.c live on locke and reloaded udp2log to stop the segfaults
  • 23:20 logmsgbot: reedy synchronized php-1.19/extensions/SyntaxHighlight_GeSHi/SyntaxHighlight_GeSHi.class.php 'r113198'
  • 21:46 logmsgbot: catrope synchronized php-1.19/extensions/CentralAuth/specials/SpecialCentralAuth.php 'r113183'
  • 21:41 notpeter: restarting puppet on brewster
  • 21:03 Jeff_Green: pushing another change to redirects.conf and doing a graceful apache restart
  • 20:32 logmsgbot: reedy synchronizing Wikimedia installation... : Rebuild message cache stuffs for r113129
  • 20:31 Jeff_Green: disabled Global Connect nagios test (check_gcsip) on payments cluster because GC is down and nagios is spammy
  • 20:25 notpeter: reimaging search1001-1020 with new partman recipe :/
  • 20:22 notpeter: temp stopping puppet on brewster
  • 20:21 logmsgbot: reedy synchronized php-1.19/resources/mediawiki.action/mediawiki.action.edit.js 'r113175'
  • 20:20 logmsgbot: reedy synchronized php-1.19/maintenance/populateRevisionSha1.php 'r113175'
  • 20:19 logmsgbot: reedy synchronized php-1.19/includes/specials/SpecialContributions.php 'r113175'
  • 20:18 logmsgbot: reedy synchronized php-1.19/includes/specials/SpecialUserlogin.php 'r113176'
  • 20:00 pp-pdf1: installed log-wikimedia-operations (which can be used for automated logging to #wikimedia-operations)
  • 19:53 Ryan_Lane: restarting labs mysql to allow for more connections
  • 19:26 Ryan_Lane: installing nova-api on virt0
  • 19:09 Ryan_Lane: upping FLAGS.sql_max_pool_size for nova-api
  • 18:47 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
  • 18:46 Ryan_Lane: rebooting all instances
  • 18:34 Ryan_Lane: restarting nova-network on virt2
  • 18:19 Ryan_Lane: rebooting virt1
  • 18:15 Ryan_Lane: rebooting virt2
  • 18:11 Ryan_Lane: rebooting virt3
  • 18:07 Ryan_Lane: rebooting virt4
  • 17:57 Ryan_Lane: taking the opportunity to apply security updates to virt0-4
  • 16:25 logmsgbot: catrope synchronized docroot/foundation/FrameResize.html 'Put Jobvite frame resize file in foundationwiki docroot per Erik'
  • 11:40 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files: switching sr* to 1.19
  • 11:15 logmsgbot: hashar synchronized php-1.19/languages/messages/MessagesSa.php 'r1113039 for bug 34938 : title is sometime empty on Sanskrit wikis'
  • 11:13 logmsgbot: tstarling synchronized php-1.19/includes/OutputPage.php 'r113128'
  • 10:41 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files: switching zh* from 1.18 to 1.19
  • 08:36 mutante: on hooper: puppet broken due to dependency Package[libapache2-mod-php5] for Service[apache2]
  • 03:33 mutante: rebooting bast1001 for kernel upgrade
  • 03:32 mutante: upgrading apache2 packages, base-files, kernel, several libs on bast1001
  • 03:27 mutante: installing a couple upgrades on fenari (apache2-utils, update-manager-core, cvs, ruby, libxml*, libopenssl-ruby*...)
  • 02:37 logmsgbot: LocalisationUpdate completed (1.18) at Tue Mar 6 02:37:06 UTC 2012
  • 02:36 logmsgbot: tstarling synchronizing Wikimedia installation... : updating to r113119
  • 02:18 logmsgbot: LocalisationUpdate completed (1.19) at Tue Mar 6 02:18:13 UTC 2012
  • 01:27 Jeff_Green: manually updated packages and restarted apache on srv198, srv229, srv262, srv268, mw40 because their apache redirect configs failed to update after sync-apache and restart
  • 01:07 Jeff_Green: another adjustment to redirects.conf and apache-graceful-all for RT#2488

March 5

  • 22:24 Jeff_Green: modified redirects.conf per RT #2488
  • 21:21 Reedy: Ran foreachwiki cleanupUploadStash.php
  • 20:36 maplebed: enabled swift for 100% of thumbnails in production
  • 18:18 logmsgbot: nikerabbit synchronized php-1.19/extensions/WebFonts/ 'i18ndeploy r113058'
  • 18:11 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php 'WebFonts: bugwiki bug 34550; sawikisource bug 34159; amwiktionary amwikiquote bug 34700'
  • 18:01 mark: Raised MTU between cr1-sdtpa - (csw1-sdtpa) - cr2-pmtpa to 9192
  • 17:35 Jeff_Green: removed 3GB db30:/tmp/gmond.log and force-restarted gmond b/c the init script failed to restart it
  • 17:16 Jeff_Green: adjusted LVS partitions on hume, moved /usr/local/apache to a new 5GB mount
  • 15:18 mark: Fixed DNS resolving on the core routers by allowing DNS replies in the loopback filter
  • 14:44 logmsgbot: reedy synchronized php-1.19/includes/Title.php 'r113036'
  • 14:43 logmsgbot: reedy synchronized php-1.19/includes/AjaxResponse.php 'r113036'
  • 14:35 logmsgbot: reedy synchronized php-1.19/extensions/CentralAuth/ 'r113035'
  • 14:34 logmsgbot: reedy synchronized php-1.19/extensions/CategoryTree/ 'r113035'
  • 13:50 mark: Set increased OSPF/OSPFv3 metric 30 on both directions of the link cr1-eqiad:xe-5/2/1 <--> cr1-sdtpa:xe-0/0/1, to combat higher than normal jitter and packet loss on the link
  • 12:53 mark: Upgraded observium to latest version
  • 09:41 mutante: restarting memcached on marmontel
  • 09:40 mutante: restarting squid backend on knsq25
  • 06:52 Ryan_Lane: all of the instances are accessing the file descriptors of files inside of the _base directory, and fuse has an issue with this. gluster can't recreate the base directory because of the processes holding open the old one.
  • 06:50 Ryan_Lane: I've corrupted the _base directory on the instance's glusterfs share. I'm recovering the files from file descriptors using lsof. Not totally sure how I'm going to get the _base directory back, yet.
  • 02:33 logmsgbot: LocalisationUpdate completed (1.18) at Mon Mar 5 02:33:04 UTC 2012
  • 02:16 logmsgbot: LocalisationUpdate completed (1.19) at Mon Mar 5 02:16:39 UTC 2012

March 4

  • 21:48 logmsgbot: reedy synchronized wmf-config/ 'Bug 32726 - Set =true for Commons'
  • 21:44 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'fix .'
  • 21:41 logmsgbot: reedy synchronized wmf-config/ 'Bug 32726 - Set =true for Commons'
  • 21:02 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34897 - Enable Special:Import on Catalan wikisource'
  • 20:42 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34567 - New logo for Arabic Wiktionary'
  • 20:31 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34715 - Please modify the import sources for the Spanish Wikiversity'
  • 20:29 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34694 - Install the Quiz extension on de.wikibooks'
  • 20:25 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'wmgMoodBarCutoffTime'
  • 20:25 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Create wmgMoodBarCutoffTime'
  • 20:14 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Variablise moodbarconfig infoUrl'
  • 20:12 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Variablise moodbarconfig infoUrl'
  • 20:08 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34618 - Install MoodBar on fr.wikisource'
  • 20:01 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34766 - Logo of Sanskrit Wikisource'
  • 19:53 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34867 - Switch Sango wiktionary logo'
  • 19:48 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34931 - Add namespaces aliases on as.wikipedia.org'
  • 19:44 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34690 - Changing the name in the title bar to Assamese'
  • 02:35 logmsgbot: LocalisationUpdate completed (1.18) at Sun Mar 4 02:35:16 UTC 2012
  • 02:17 logmsgbot: LocalisationUpdate completed (1.19) at Sun Mar 4 02:17:34 UTC 2012

March 3

  • 18:48 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34728 - Categories added to user pages by Babel in pt.wiktionary'
  • 13:04 logmsgbot: aaron synchronized php-1.19/includes/Revision.php 'deployed r112949'
  • 02:35 logmsgbot: LocalisationUpdate completed (1.18) at Sat Mar 3 02:35:08 UTC 2012
  • 02:18 logmsgbot: LocalisationUpdate completed (1.19) at Sat Mar 3 02:18:04 UTC 2012

March 2

  • 21:41 logmsgbot: aaron synchronized php-1.19/extensions/CategoryTree/CategoryTreeFunctions.php 'disabled logging hack'
  • 20:47 Jeff_Green: added redirect/301 from http://static.wikimedia.org --> http://dumps.wikimedia.org now that archival static html dumps are located there
  • 19:53 mark: Decommissioned csw5-pmtpa from AS14907 service. rest in pieces ;)
  • 19:10 mark: Did a hot cut to remove csw5-pmtpa out of the path of cr1-sdtpa -> csw1-sdtpa -> csw5-pmtpa -> cr2-pmtpa
  • 17:46 cmjohnson1: powering down msw1-pmtpa for relcocation to d1-pmtpa
  • 17:40 cmjohnson1: disconnecting management fiber from msw1-pmtpa
  • 16:59 logmsgbot: reedy synchronized php-1.19/extensions/CentralAuth/CentralAuth.php 'r112904'
  • 16:55 RobH: ms-be4 boot order fixed, fixing ms-be5 & ms-be2
  • 16:49 RobH: fixed boot order on ms-be3, fixing ms-be4
  • 16:33 RobH: poking at bios on ms-be3
  • 16:05 RobH: wikitech outage resolved
  • 15:20 RobH: shutdown frdev offsite vm per email to engineering last week
  • 15:18 RobH: backing up wikitech in hopes of upgrading some of its software
  • 08:36 apergos: on ms1004, low on space, HTCPpurger.log.1 had about 16 gb of nulls before any real content, I tailed off the real stuff and tossed the original. The current log file has the same problem, why?
  • 02:34 logmsgbot: LocalisationUpdate completed (1.18) at Fri Mar 2 02:34:34 UTC 2012
  • 02:17 logmsgbot: LocalisationUpdate completed (1.19) at Fri Mar 2 02:17:51 UTC 2012
  • 01:36 logmsgbot: aaron synchronized php-1.19/includes/filerepo/backend/lockmanager/LockManager.php 'deployed r112867'
  • 00:41 logmsgbot: aaron synchronized php-1.19/extensions/CategoryTree 'deployed r112862'

March 1

  • 23:33 logmsgbot: aaron synchronized php-1.19/extensions/CategoryTree/CategoryTreeFunctions.php 'log agent'
  • 23:29 logmsgbot: reedy synchronizing Wikimedia installation... : Push message updates from r112848
  • 23:22 logmsgbot: aaron synchronized php-1.19/extensions/CategoryTree/CategoryTreeFunctions.php 'logging fix'
  • 23:22 logmsgbot: aaron synchronized php-1.19/extensions/CategoryTree/CategoryTreeFunctions.php
  • 23:20 logmsgbot: aaron synchronized php-1.19/extensions/CategoryTree/CategoryTreeFunctions.php
  • 23:17 logmsgbot: reedy synchronized php-1.19/includes/filerepo/backend/FSFileBackend.php 'r112850'
  • 23:16 logmsgbot: reedy synchronized php-1.19/includes/Article.php 'r112850'
  • 23:11 logmsgbot: aaron synchronized php-1.19/extensions/CategoryTree/CategoryTreeFunctions.php
  • 23:06 logmsgbot: reedy synchronized php-1.19/extensions/MoodBar/ApiFeedbackDashboardResponse.php 'r112848'
  • 23:05 logmsgbot: reedy synchronized php-1.19/extensions/Collection/Collection.body.php 'r112848'
  • 22:12 logmsgbot: aaron synchronized php-1.19/includes/specials/SpecialContributions.php 'deployed r112844'
  • 22:06 logmsgbot: aaron synchronized php-1.19/includes/filerepo/backend 'deployed r112841'
  • 21:04 logmsgbot: aaron synchronized wmf-config/CommonSettings.php 'enabled FileBackend debug log'
  • 19:57 cmjohnson1: replaced disk 3 labstore1 chassis
  • 19:54 cmjohnson1: removing disk 3 from labstore1 chassis
  • 19:47 Ryan_Lane: restarted memcached on virt0
  • 19:15 logmsgbot: reedy synchronized php-1.19/cache/interwiki.cdb 'Updating interwiki cache'
  • 17:39 Jeff_Green: Removed >5GB /tmp/gmond.log on db25, db32, db33, db37
  • 17:36 logmsgbot: hashar synchronized php-1.19/includes/EditPage.php 'r112819 - Bug 34849 diff during editing an old version compares to the old version instead of the current one'
  • 17:36 Jeff_Green: Removed >5GB /tmp/gmond.log on db13
  • 17:35 Jeff_Green: Removed >5GB /tmp/gmond.log on db11
  • 17:25 Jeff_Green: Removed 5.3GB /tmp/gmond.log on db1018
  • 17:24 Jeff_Green: Removed 5.3GB /tmp/gmond.log on db1017
  • 17:13 Jeff_Green: Removed 4.8GB /tmp/gmond.log on db1008. Tried to resist urge to make snarky comment about ganglia but failed.
  • 14:54 RobH: strontium server rebooting to set HT to enabled
  • 14:26 mark: Moving bits traffic back from pmtpa to eqiad
  • 14:24 mark: Cleared dnsmasq cache on virt2
  • 14:16 mark: csw5-pmtpa: Mar 1 14:01:42:A:Power Supply 2 , 2nd from left, bad
  • 14:14 mark: mr1-pmtpa rebooted/lost power for some reason
  • 14:07 mark: pmtpa/sdtpa management network went down
  • 13:54 mark: Pooled new eqiad bits servers strontium and palladium
  • 12:45 logmsgbot: hashar synchronized php-1.19/includes/specials/SpecialWatchlist.php 'r111882 for Bug 34835 - watchlist shows times in UTC'
  • 10:53 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files: reverting sr* wikis back to 1.18 per Siebrand's recommendation due to bug 34832
  • 06:26 logmsgbot: tstarling synchronized php-1.19/extensions/SpamBlacklist/SpamBlacklist.php 'r112781'
  • 05:46 maplebed: started swift deletion run on owa1, 2, and 3.
  • 02:33 logmsgbot: LocalisationUpdate completed (1.18) at Thu Mar 1 02:33:53 UTC 2012
  • 02:16 logmsgbot: LocalisationUpdate completed (1.19) at Thu Mar 1 02:16:52 UTC 2012
  • 02:15 Ryan_Lane: vlan tagged virt5's eth0 and eth1 ports on csw1-sdtpa
  • 02:12 logmsgbot: aaron synchronized php-1.19/extensions/CategoryTree/CategoryTreeFunctions.php 'debug logging'
  • 02:02 logmsgbot: reedy synchronized php-1.19/resources/mediawiki.action/mediawiki.action.history.diff.css 'r112750'
  • 01:59 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files: all zh wikis back to 1.18
  • 01:50 logmsgbot: aaron synchronized php-1.19/extensions/WikiLove 'deployed r112758'
  • 01:37 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Last 265 wikipedias over to 1.19wmf1
  • 01:28 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: s7 to 1.19wmf1
  • 01:23 logmsgbot: reedy synchronized php-1.19/extensions/CategoryTree/CategoryTreeFunctions.php 'r112754'
  • 01:09 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: s2 to 1.19wmf1
  • 00:58 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Meanwhile, on wikipedia.... Hello ruwiki!
  • 00:48 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: jawiki to 1.19wmf1
  • 00:31 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: frwiki to 1.19wmf1
  • 00:21 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: dewiki to 1.19wmf1
  • 00:05 logmsgbot: tstarling synchronized php-1.19/extensions/Collection/Collection.body.php 'r112745'


Archives

Personal tools
Namespaces

Variants
Actions
Navigation
Ops documentation
Wiki
Toolbox