PyBal

From Wikitech
(Difference between revisions)
Jump to: navigation, search
(added bgp asn)
 
(3 intermediate revisions by 3 users not shown)
Line 1: Line 1:
'''PyBal''' is a LVS monitoring script quite similar to [[lvsmon]]. It's written in Python using the [http://www.twistedmatrix.com Twisted framework].
+
'''PyBal''' is a [[LVS]] monitoring script quite similar to [[lvsmon]]. It's written in Python using the [http://www.twistedmatrix.com Twisted framework].
 +
 
 +
'''For more information about Wikimedia's LVS setup in general, see [[LVS]].'''
  
 
At this moment, just a few features distinguish it from lvsmon:
 
At this moment, just a few features distinguish it from lvsmon:
Line 8: Line 10:
 
...but I intend to polish it more, and extend it with useful things.
 
...but I intend to polish it more, and extend it with useful things.
  
The script is in MediaWiki <s>CVS</s> SVN repo '''mediawiki''', subdirectory [http://svn.wikimedia.org/viewvc/mediawiki/trunk/pybal/ pybal].
+
The script <s>is</s> was in MediaWiki <s>CVS</s> SVN repo "mediawiki", subdirectory <s>[http://svn.wikimedia.org/viewvc/mediawiki/trunk/pybal/ pybal]</s>.
  
 
-- Mark
 
-- Mark
 
pybal uses AS64600 for it's BGP stuff
 
  
 
== Setup ==
 
== Setup ==
Line 20: Line 20:
 
'''NOTE that recently pybal seems to not always stop''' when receiving a TERM signal (i.e. via this script).  So you need to check the process start time and see if it *really* restarted, if not kill -9 manually, then start via script.
 
'''NOTE that recently pybal seems to not always stop''' when receiving a TERM signal (i.e. via this script).  So you need to check the process start time and see if it *really* restarted, if not kill -9 manually, then start via script.
  
Configuration is in <tt>/etc/pybal/</tt>. <tt>pybal.conf</tt> defines the LVS service parameters, <tt>text-squids</tt> defines the list of text squid servers, upload-squids contains the list of upload squids, and >tt>apaches</tt> contains the list of all apaches. Attributes for servers:
+
Configuration is in <tt>/etc/pybal/</tt>. <tt>pybal.conf</tt> defines the LVS service parameters
 +
 
 +
The list of pooled hosts resides in <tt>/home/w/conf/pybal</tt> (on fenari), with one file per LVS service. Attributes:
  
 
*weight: a larger number means that more requests get sent to this server in comparison with others
 
*weight: a larger number means that more requests get sent to this server in comparison with others
Line 49: Line 51:
  
 
== Howto ==
 
== Howto ==
 +
See [[LVS]].
  
=== Depool a server ===
+
[[Category:Software]]
Either:
+
* Set <tt>'enabled': False</tt>, in which case PyBal keeps monitoring it, but keeps it unpooled, or
+
* Comment the entire line in the config file, in which case PyBal doesn't even know about it
+
 
+
PyBal rereads its realserver config files every minute, so there's no need to restart it.
+

Latest revision as of 09:09, 11 September 2012

PyBal is a LVS monitoring script quite similar to lvsmon. It's written in Python using the Twisted framework.

For more information about Wikimedia's LVS setup in general, see LVS.

At this moment, just a few features distinguish it from lvsmon:

  • It's using asynchronous communication, and thus runs all checks in parallel instead of sequentially
  • It has an extra monitoring method called IdleConnection, which keeps an idle connection open to all squids, and therefore notices immediately when the Squid processes are shut down / crashing
  • It can fetch server lists over HTTP as well as from the local filesystems

...but I intend to polish it more, and extend it with useful things.

The script is was in MediaWiki CVS SVN repo "mediawiki", subdirectory pybal.

-- Mark

[edit] Setup

PyBal is currently installed on our LVS hosts, in directory /usr/sbin. Start or stop it by

 /etc/init.d/pybal start|stop

NOTE that recently pybal seems to not always stop when receiving a TERM signal (i.e. via this script). So you need to check the process start time and see if it *really* restarted, if not kill -9 manually, then start via script.

Configuration is in /etc/pybal/. pybal.conf defines the LVS service parameters

The list of pooled hosts resides in /home/w/conf/pybal (on fenari), with one file per LVS service. Attributes:

  • weight: a larger number means that more requests get sent to this server in comparison with others
  • enabled: either True or False, depnding on whether you want requests to be sent to this server

The format should be fairly self explanatory; the files more or less use Python assignment / dictionary syntax.

PyBal supports multiple LVS services through a single instance and configuration file pybal.conf, e.g.:

[text]
protocol = tcp
ip = 145.97.39.155
port = 80
scheduler = wlc
config = file:///etc/pybal/text-squids

[images]
protocol = tcp
ip = 145.97.39.156
port = 80
scheduler = wlc
config = file:///etc/pybal/upload-squids

Beware, the code as checked out from Subversion has DryRun = True set in ipvs.py, meaning that it will not modify any actual IPVS state but only show the commands for debugging. This should be changed to a commandline option, but for now edit that file to DryRun = False.

The configuration files are (manually) backed up in /home/wikipedia/conf/pybal.

[edit] Howto

See LVS.

Personal tools
Namespaces

Variants
Actions
Navigation
Ops documentation
Wiki
Toolbox