DOKK / manpages / debian 10 / circus / circus.1.en

Circus

NAME

circus - Circus Documentation [image]

Circus is a Python program which can be used to monitor and control processes and sockets.

Circus can be driven via a command-line interface, a web interface or programmatically through its python API.

To install it and try its features check out the examples, or read the rest of this page for a quick introduction.

RUNNING A CIRCUS DAEMON

Circus provides a command-line script call circusd that can be used to manage processes organized in one or more watchers.

Circus' command-line tool is configurable using an ini-style configuration file.

Here's a very minimal example:

[watcher:program]
cmd = python myprogram.py
numprocesses = 5
[watcher:anotherprogram]
cmd = another_program
numprocesses = 2

The file is then passed to circusd:

$ circusd example.ini

Besides processes, Circus can also bind sockets. Since every process managed by Circus is a child of the main Circus daemon, that means any program that's controlled by Circus can use those sockets.

Running a socket is as simple as adding a socket section in the config file:

[socket:mysocket]
host = localhost
port = 8080

To learn more about sockets, see sockets.

To understand why it's a killer feature, read whycircussockets.

CONTROLLING CIRCUS

Circus provides two command-line tools to manage your running daemon:

circusctl, a management console you can use to perform actions such as adding or removing workers
circus-top, a top-like console you can use to display the memory and cpu usage of your running Circus.

To learn more about these, see cli

Circus also offers a web dashboard that can connect to a running Circus daemon and let you monitor and interact with it.

To learn more about this feature, see circushttpd

What now ?

If you are a developer and want to leverage Circus in your own project, write plugins or hooks, go to fordevs.

If you are an ops and want to manage your processes using Circus, go to forops.

Installing Circus

Circus is a Python package which is published on PyPI - the Python Package Index.

The simplest way to install it is to use pip, a tool for installing and managing Python packages:

$ pip install circus

Or download the archive on PyPI, extract and install it manually with:

$ python setup.py install

If you want to try out Circus, see the examples.

If you are using debian or any debian based distribution, you also can use the ppa to install circus, it's at https://launchpad.net/~roman-imankulov/+archive/circus

zc.buildout

We provide a zc.buildout configuration, you can use it by simply running the bootstrap script, then calling buildout:

$ python bootstrap.py
$ bin/buildout

More on Requirements

Circus works with:

Python 2.6, 2.7, 3.2 or 3.3

zeromq >= 2.1.10

The version of zeromq supported is ultimately determined by what version of pyzmq is installed by pip during circus installation.
Their current release supports 2.x (limited), 3.x, and 4.x ZeroMQ versions.
Note: If you are using PyPy instead of CPython, make sure to read their installation docs as ZeroMQ version support is not the same on PyPy.

When you install circus, the latest versions of the Python dependencies will be pulled out for you.

You can also install them manually using the pip-requirements.txt file we provide:

$ pip install -r pip-requirements.txt

If you want to run the Web console you will need to install circus-web:

$ pip install circus-web

Step-by-step tutorial

The examples directory in the Circus repository contains many examples to get you started, but here's a full tutorial that gives you an overview of the features.

We're going to supervise a WSGI application.

Installation

Circus is tested on Mac OS X and Linux with the latest Python 2.6, 2.7, 3.2 and 3.3. To run a full Circus, you will also need libzmq, libevent & virtualenv.

On Debian-based systems:

$ sudo apt-get install libzmq-dev libevent-dev python-dev python-virtualenv

Create a virtualenv and install circus, circus-web and chaussette in it

$ virtualenv /tmp/circus
$ cd /tmp/circus
$ bin/pip install circus
$ bin/pip install circus-web
$ bin/pip install chaussette

Once this is done, you'll find a plethora of commands in the local bin dir.

Usage

Chaussette comes with a default Hello world app, try to run it:

$ bin/chaussette

You should be able to visit http://localhost:8080 and see hello world.

Stop Chaussette and add a circus.ini file in the directory containing:

[circus]
statsd = 1
httpd = 1
[watcher:webapp]
cmd = bin/chaussette --fd $(circus.sockets.web)
numprocesses = 3
use_sockets = True
[socket:web]
host = 127.0.0.1
port = 9999

This config file tells Circus to bind a socket on port 9999 and run 3 chaussettes workers against it. It also activates the Circus web dashboard and the statistics module.

Save it & run it using circusd:

$ bin/circusd --daemon circus.ini

Now visit http://127.0.0.1:9999, you should see the hello world app. The difference now is that the socket is managed by Circus and there are several web workers that are accepting connections against it.

NOTE:

The load balancing is operated by the operating system so you're getting the same speed as any other pre-fork web server like Apache or NGinx. Circus does not interfer with the data that goes through.

You can also visit http://localhost:8080/ and enjoy the Circus web dashboard.

Interaction

Let's use the circusctl shell while the system is running:

$ bin/circusctl
circusctl 0.7.1
circusd-stats: active
circushttpd: active
webapp: active
(circusctl)

You get into an interactive shell. Type help to get all commands:

(circusctl) help
Documented commands (type help <topic>):
========================================
add     get            list         numprocesses  quit     rm      start   stop
decr    globaloptions  listen       numwatchers   reload   set     stats
dstats  incr           listsockets  options       restart  signal  status
Undocumented commands:
======================
EOF  help

Let's try basic things. Let's list the web workers processes and add a new one:

(circusctl) list webapp
13712,13713,13714
(circusctl) incr webapp
4
(circusctl) list webapp
13712,13713,13714,13973

Congrats, you've interacted with your Circus! Get off the shell with Ctrl+D and now run circus-top:

$ bin/circus-top

This is a top-like command to watch all your processes' memory and CPU usage in real time.

Hit Ctrl+C and now let's quit Circus completely via circus-ctl:

$ bin/circusctl quit
ok

Next steps

You can plug your own WSGI application instead of Chaussette's hello world simply by pointing the application callable.

Chaussette also comes with many backends like Gevent or Meinheld.

Read https://chaussette.readthedocs.org/ for all options.

Why should I use Circus instead of X ?

1.

Circus simplifies your web stack process management

Circus knows how to manage processes and sockets, so you don't have to delegate web workers management to a WGSI server.

See whycircussockets

2.

Circus provides pub/sub and poll notifications via ZeroMQ

Circus has a pub/sub channel you can subscribe to. This channel receives all events happening in Circus. For example, you can be notified when a process is flapping, or build a client that triggers a warning when some processes are eating all the CPU or RAM.

These events are sent via a ZeroMQ channel, which makes it different from the stdin stream Supervisord uses:

Circus sends events in a fire-and-forget fashion, so there's no need to manually loop through all listeners and maintain their states.
Subscribers can be located on a remote host.

Circus also provides ways to get status updates via one-time polls on a req/rep channel. This means you can get your information without having to subscribe to a stream. The cli command provided by Circus uses this channel.

See examples.

3.: Circus is (Python) developer friendly

While Circus can be driven entirely by a config file and the circusctl / circusd commands, it is easy to reuse all or part of the system to build your own custom process watcher in Python.

Every layer of the system is isolated, so you can reuse independently:

the process wrapper (Process)
the processes manager (Watcher)
the global manager that runs several processes managers (Arbiter)
and so on…

4.: Circus scales

One of the use cases of Circus is to manage thousands of processes without adding overhead -- we're dedicated to focusing on this.

Coming from Supervisor

Supervisor is a very popular solution in the Python world and we're often asked how Circus compares with it.

If you are coming from Supervisor, this page tries to give an overview of how the tools differ.

Differences overview

Supervisor & Circus have the same goals - they both manage processes and provide a command-line script — respectively supervisord and circusd — that reads a configuration file, forks new processes and keep them alive.

Circus has an extra feature: the ability to bind sockets and let the processes it manages use them. This "pre-fork" model is used by many web servers out there, like Apache or Unicorn. Having this option in Circus can simplify a web app stack: all processes and sockets are managed by a single tool.

Both projects provide a way to control a running daemon via another script. respectively supervisorctl and circusctl. They also both have events and a way to subscribe to them. The main difference is the underlying technology: Supervisor uses XML-RPC for interacting with the daemon, while Circus uses ZeroMQ.

Circus & Supervisor both have a web interface to display what's going on. Circus' is more advanced because you can follow in real time what's going on and interact with the daemon. It uses web sockets and is developed in a separate project (circus-web.)

There are many other subtle differences in the core design, we might list here one day… In the meantime, you can learn more about circus internals in design.

Configuration

Both systems use an ini-like file as a configuration.

Supervisor documentation
Circus documentation

Here's a small example of running an application with Supervisor. In this case, the application will be started and restarted in case it crashes

[program:example]
command=npm start
directory=/home/www/my-server/
user=www-data
autostart=true
autorestart=true
redirect_stderr=True

In Circus, the same configuration is done by:

[watcher:example]
cmd=npm start
working_dir=/home/www/my-server/
user=www-data
stderr_stream.class=StdoutStream

Notice that the stderr redirection is slightly different in Circus. The tool does not have a tail feature like in Supervisor, but will let you hook any piece of code to deal with the incoming stream. You can create your own stream hook (as a Class) and do whatever you want with the incoming stream. Circus provides some built-in stream classes like StdoutStream, FileStream, WatchedFileStream, or TimedRotatingFileStream.

Circus for Ops

WARNING:

By default, Circus doesn't secure its messages when sending information through ZeroMQ. Before running Circus in a production environment, make sure to read the Security page.

The first step to manage a Circus daemon is to write its configuration file. See configuration. If you are deploying a web stack, have a look at sockets.

Circus can be deployed using Python 2.6, 2.7, 3.2 or 3.3 - most deployments out there are done in 2.7. To learn how to deploy Circus, check out deployment.

To manage a Circus daemon, you should get familiar with the list of commands you can use in circusctl. Notice that you can have the same help online when you run circusctl as a shell.

We also provide circus-top, see cli and a nice web dashboard. see circushttpd.

Last, to get the most out of Circus, make sure to check out how to use plugins and hooks. See plugins and hooks.

Ops documentation index

Configuration

Circus can be configured using an ini-style configuration file.

Example:

[circus]
check_delay = 5
endpoint = tcp://127.0.0.1:5555
pubsub_endpoint = tcp://127.0.0.1:5556
include = \*.more.config.ini
umask = 002
[watcher:myprogram]
cmd = python
args = -u myprogram.py $(circus.wid) $(CIRCUS.ENV.VAR)
warmup_delay = 0
numprocesses = 5
# hook
hooks.before_start = my.hooks.control_redis
# will push in test.log the stream every 300 ms
stdout_stream.class = FileStream
stdout_stream.filename = test.log
# optionally rotate the log file when it reaches 1 gb
# and save 5 copied of rotated files
stdout_stream.max_bytes = 1073741824
stdout_stream.backup_count = 5
[env:myprogram]
PATH = $PATH:/bin
CAKE = lie
[plugin:statsd]
use = circus.plugins.statsd.StatsdEmitter
host = localhost
port = 8125
sample_rate = 1.0
application_name = example
[socket:web]
host = localhost
port = 8080

circus - single section

endpoint: The ZMQ socket used to manage Circus via circusctl. (default: tcp://127.0.0.1:5555)
endpoint_owner: If set to a system username and the endpoint is an ipc socket like ipc://var/run/circusd.sock, then ownership of the socket file will be changed to that user at startup. For more details, see security. (default: None)
pubsub_endpoint: The ZMQ PUB/SUB socket receiving publications of events. (default: tcp://127.0.0.1:5556)
papa_endpoint: If using papa, you can specify the endpoint, such as ipc://var/run/circusd.sock. (default: tcp://127.0.0.1:20202)
statsd: If set to True, Circus runs the circusd-stats daemon. (default: False)
stats_endpoint: The ZMQ PUB/SUB socket receiving publications of stats. (default: tcp://127.0.0.1:5557)
statsd_close_outputs: If True sends the circusd-stats stdout/stderr to /dev/null. (default: False)
check_delay: The polling interval in seconds for the ZMQ socket. (default: 5)
include: List of config files to include. You can use wildcards (*) to include particular schemes for your files. The paths are absolute or relative to the config file. (default: None)
include_dir: List of config directories. All files matching *.ini under each directory will be included. The paths are absolute or relative to the config file. (default: None)
stream_backend: Defines the type of backend to use for the streaming. Possible values are thread or gevent. (default: thread)
warmup_delay: The interval in seconds between two watchers start. Must be an int. (default: 0)
httpd: If set to True, Circus runs the circushttpd daemon. (default: False)
httpd_host: The host ran by the circushttpd daemon. (default: localhost)
httpd_port: The port ran by the circushttpd daemon. (default: 8080)
httpd_close_outputs: If True, sends the circushttpd stdout/stderr to /dev/null. (default: False)
debug: If set to True, all Circus stout/stderr daemons are redirected to circusd stdout/stderr (default: False)
debug_gc: If set to True, circusd outputs additional log info from the garbage collector. This can be useful in tracking down memory leaks. (default: False)
pidfile: The file that must be used to keep the daemon pid.
umask: Value for umask. If not set, circusd will not attempt to modify umask.
loglevel: The loglevel that we want to see (default: INFO)
logoutput: The logoutput file where we want to log (default: - to log on stdout). You can log to a remote syslog by using the following syntax: syslog://host:port?facility where host is your syslog server, port is optional and facility is the syslog facility to use. If you wish to log to a local syslog you can use syslog:///path/to/syslog/socket?facility instead.
loggerconfig: A path to an INI, JSON or YAML file to configure standard Python logging for the Arbiter. The special value "default" uses the builtin logging configuration based on the optional loglevel and logoutput options.
Example YAML Configuration File

version: 1
disable_existing_loggers: false
formatters:


  simple:


    format: '%(asctime)s - %(name)s - [%(levelname)s] %(message)s'
handlers:


  logfile:


    class: logging.FileHandler


    filename: logoutput.txt


    level: DEBUG


    formatter: simple
loggers:


  circus:


    level: DEBUG


    handlers: [logfile]


    propagate: no
root:


  level: DEBUG


  handlers: [logfile]

watcher:NAME - as many sections as you want

NAME: The name of the watcher. This name is used in circusctl
cmd: The executable program to run.
args: Command-line arguments to pass to the program. You can use the python format syntax here to build the parameters. Environment variables are available, as well as the worker id and the environment variables that you passed, if any, with the "env" parameter. See Formatting the commands and arguments with dynamic variables for more information on this.
shell: If True, the processes are run in the shell (default: False)
shell_args: Command-line arguments to pass to the shell command when shell is True. Works only for *nix system (default: None)
working_dir: The working dir for the processes (default: None)
uid: The user id or name the command should run with. (The current uid is the default).
gid: The group id or name the command should run with. (The current gid is the default).
copy_env: If set to true, the local environment variables will be copied and passed to the workers when spawning them. (Default: False)
copy_path: If set to true, sys.path is passed in the subprocess environ using PYTHONPATH. copy_env has to be true. (Default: False)
warmup_delay: The delay (in seconds) between running processes.
autostart: If set to false, the watcher will not be started automatically when the arbiter starts. The watcher can be started explicitly (example: circusctrl start myprogram). (Default: True)
numprocesses: The number of processes to run for this watcher.
rlimit_LIMIT: Set resource limit LIMIT for the watched processes. The config name should match the RLIMIT_* constants (not case sensitive) listed in the Python resource module reference. For example, the config line 'rlimit_nofile = 500' sets the maximum number of open files to 500. To set a limit value to RLIM_INFINITY, do not set a value, like this config line: 'rlimit_nofile = '.
stderr_stream.class: A fully qualified Python class name that will be instanciated, and will receive the stderr stream of all processes in its __call__() method.
Circus provides some stream classes you can use without prefix:

FileStream: writes in a file and can do automatic log rotation
WatchedFileStream: writes in a file and relies on external log rotation
TimedRotatingFileStream: writes in a file and can do rotate at certain timed intervals.
QueueStream: write in a memory Queue
StdoutStream: writes in the stdout
FancyStdoutStream: writes colored output with time prefixes in the stdout

stderr_stream.*: All options starting with stderr_stream. other than class will be passed the constructor when creating an instance of the class defined in stderr_stream.class.
stdout_stream.class: A fully qualified Python class name that will be instanciated, and will receive the stdout stream of all processes in its __call__() method.
Circus provides some stream classes you can use without prefix:

FileStream: writes in a file and can do automatic log rotation
WatchedFileStream: writes in a file and relies on external log rotation
TimedRotatingFileStream: writes in a file and can do rotate at certain timed intervals.
QueueStream: write in a memory Queue
StdoutStream: writes in the stdout
FancyStdoutStream: writes colored output with time prefixes in the stdout

stdout_stream.*: All options starting with stdout_stream. other than class will be passed the constructor when creating an instance of the class defined in stdout_stream.class.
close_child_stdout: If set to True, the stdout stream of each process will be sent to /dev/null after the fork. Defaults to False.
close_child_stderr: If set to True, the stderr stream of each process will be sent to /dev/null after the fork. Defaults to False.
send_hup: If True, a process reload will be done by sending the SIGHUP signal. Defaults to False.
stop_signal: The signal to send when stopping the process. Can be specified as a number or a signal name. Signal names are case-insensitive and can include 'SIG' or not. So valid examples include quit, INT, SIGTERM and 3. Defaults to SIGTERM.
stop_children: When sending the stop_signal, send it to the children as well. Defaults to False.
max_retry: The number of times we attempt to start a process, before we abandon and stop the whole watcher. Defaults to 5. Set to -1 to disable max_retry and retry indefinitely.

graceful_timeout

The number of seconds to wait for a process to terminate gracefully before killing it.

When stopping a process, we first send it a stop_signal. A worker may catch this signal to perform clean up operations before exiting. If the worker is still active after graceful_timeout seconds, we send it a SIGKILL signal. It is not possible to catch SIGKILL signals so the worker will stop.

Defaults to 30s.

priority

Integer that defines a priority for the watcher. When the Arbiter do some operations on all watchers, it will sort them with this field, from the bigger number to the smallest. Defaults to 0.

singleton

If set to True, this watcher will have at the most one process. Defaults to False.

use_sockets

If set to True, this watcher will be able to access defined sockets via their file descriptors. If False, all parent fds are closed when the child process is forked. Defaults to False.

max_age

If set then the process will be restarted sometime after max_age seconds. This is useful when processes deal with pool of connectors: restarting processes improves the load balancing. Defaults to being disabled.

max_age_variance

If max_age is set then the process will live between max_age and max_age + random(0, max_age_variance) seconds. This avoids restarting all processes for a watcher at once. Defaults to 30 seconds.

on_demand

If set to True, the processes will be started only after the first connection to one of the configured sockets (see below). If a restart is needed, it will be only triggered at the next socket event.

hooks.*

Available hooks: before_start, after_start, before_spawn, after_spawn, before_stop, after_stop, before_signal, after_signal, extended_stats

Define callback functions that hook into the watcher startup/shutdown process.

If the hook returns False and if the hook is one of before_start, before_spawn, after_start or after_spawn, the startup will be aborted.

If the hook is before_signal and returns False, then the corresponding signal will not be sent (except SIGKILL which is always sent)

Notice that a hook that fails during the stopping process will not abort it.

The callback definition can be followed by a boolean flag separated by a comma. When the flag is set to true, any error occuring in the hook will be ignored. If set to false (the default), the hook will return False.

socket:NAME - as many sections as you want

host: The host of the socket. Defaults to 'localhost'
port: The port. Defaults to 8080.
family: The socket family. Can be 'AF_UNIX', 'AF_INET' or 'AF_INET6'. Defaults to 'AF_INET'.
type: The socket type. Can be 'SOCK_STREAM', 'SOCK_DGRAM', 'SOCK_RAW', 'SOCK_RDM' or 'SOCK_SEQPACKET'. Defaults to 'SOCK_STREAM'.
interface: When provided a network interface name like 'eth0', binds the socket to that particular device so that only packets received from that particular interface are processed by the socket. This can be used for example to limit which device to bind when binding on IN_ADDR_ANY (0.0.0.0) or IN_ADDR_BROADCAST (255.255.255.255). Note that this only works for some socket types, particularly AF_INET sockets.
path: When provided a path to a file that will be used as a unix socket file. If a path is provided, family is forced to AF_UNIX and host and port are ignored.
umask: When provided, sets the umask that will be used to create an AF_UNIX socket. For example, umask=000 will produce a socket with permission 777.
replace: When creating Unix sockets ('AF_UNIX'), an existing file may indicate a problem so the default is to fail. Specify True to simply remove the old file if you are sure that the socket is managed only by Circus.
so_reuseport: If set to True and SO_REUSEPORT is available on target platform, circus will create and bind new SO_REUSEPORT socket(s) for every worker it starts which is a user of this socket(s).
use_papa: Set to true to use the papa.

Once a socket is created, the ${circus.sockets.NAME} string can be used in the command (cmd or args) of a watcher. Circus will replace it by the FD value. The watcher must also have use_sockets set to True otherwise the socket will have been closed and you will get errors when the watcher tries to use it.

Example:

[watcher:webworker]
cmd = chaussette --fd $(circus.sockets.webapp) chaussette.util.bench_app
use_sockets = True
[socket:webapp]
host = 127.0.0.1
port = 8888

plugin:NAME - as many sections as you want

use: The fully qualified name that points to the plugin class.
anything else: Every other key found in the section is passed to the plugin constructor in the config mapping.
You can use all the watcher options, since a plugin is started like a watcher.

Circus comes with a few pre-shipped plugins but you can also extend them easily by developing your own.

env or env[:WATCHERS] - as many sections as you want

anything: The name of an environment variable to assign value to. bash style environment substitutions are supported. for example, append /bin to PATH 'PATH = $PATH:/bin'

Section responsible for delivering environment variable to run processes.

Example:

[watcher:worker1]
cmd = ping 127.0.0.1
[watcher:worker2]
cmd = ping 127.0.0.1
[env]
CAKE = lie

The variable CAKE will propagated to all watchers defined in config file.

WATCHERS can be a comma separated list of watcher sections to apply this environment to. if multiple env sections match a watcher, they will be combine in the order they appear in the configuration file. later entries will take precedence.

Example:

[watcher:worker1]
cmd = ping 127.0.0.1
[watcher:worker2]
cmd = ping 127.0.0.1
[env:worker1,worker2]
PATH = /bin
[env:worker1]
PATH = $PATH
[env:worker2]
CAKE = lie

worker1 will be run with PATH = $PATH (expanded from the environment circusd was run in) worker2 will be run with PATH = /bin and CAKE = lie

It's possible to use wildcards as well.

Example:

[watcher:worker1]
cmd = ping 127.0.0.1
[watcher:worker2]
cmd = ping 127.0.0.1
[env:worker*]
PATH = /bin

Both worker1 and worker2 will be run with PATH = /bin

Using environment variables

When writing your configuration file, you can use environment variables defined in the env section or in os.environ itself.

You just have to use the circus.env. prefix.

Example:

[watcher:worker1]
cmd = $(circus.env.shell)
[watcher:worker2]
baz = $(circus.env.user)
bar = $(circus.env.yeah)
sup = $(circus.env.oh)
[socket:socket1]
port = $(circus.env.port)
[plugin:plugin1]
use = some.path
parameter1 = $(circus.env.plugin_param)
[env]
yeah = boo
[env:worker2]
oh = ok

If a variable is defined in several places, the most specialized value has precedence: a variable defined in env:XXX will override a variable defined in env, which will override a variable defined in os.environ.

environment substitutions can be used in any section of the configuration in any section variable.

Formatting the commands and arguments with dynamic variables

As you may have seen, it is possible to pass some information that are computed dynamically when running the processes. Among other things, you can get the worker id (WID) and all the options that are passed to the Process. Additionally, it is possible to access the options passed to the Watcher which instanciated the process.

NOTE:

The worker id is different from the process id. It's a unique value, starting at 1, which is only unique for the watcher.

For instance, if you want to access some variables that are contained in the environment, you would need to do it with a setting like this:

cmd = "make-me-a-coffee --sugar $(CIRCUS.ENV.SUGAR_AMOUNT)"

This works with both cmd and args.

Important:

All variables are prefixed with circus.
The replacement is case insensitive.

Stream configuration

Simple stream class like QueueStream and StdoutStream don't have specific attributes but some other stream class may have some:

FileStream

filename: The file path where log will be written.
time_format: The strftime format that will be used to prefix each time with a timestamp. By default they will be not prefixed.
i.e: %Y-%m-%d %H:%M:%S
max_bytes: The max size of the log file before a new file is started. If not provided, the file is not rolled over.
backup_count: The number of log files that will be kept By default backup_count is null.

NOTE:

Rollover occurs whenever the current log file is nearly max_bytes in length. If backup_count is >= 1, the system will successively create new files with the same pathname as the base file, but with extensions ".1", ".2" etc. appended to it. For example, with a backup_count of 5 and a base file name of "app.log", you would get "app.log", "app.log.1", "app.log.2", ... through to "app.log.5". The file being written to is always "app.log" - when it gets filled up, it is closed and renamed to "app.log.1", and if files "app.log.1", "app.log.2" etc. exist, then they are renamed to "app.log.2", "app.log.3" etc. respectively.

Example:

[watcher:myprogram]
cmd = python -m myapp.server
stdout_stream.class = FileStream
stdout_stream.filename = test.log
stdout_stream.time_format = %Y-%m-%d %H:%M:%S
stdout_stream.max_bytes = 1073741824
stdout_stream.backup_count = 5

WatchedFileStream

filename: The file path where log will be written.
time_format: The strftime format that will be used to prefix each time with a timestamp. By default they will be not prefixed.
i.e: %Y-%m-%d %H:%M:%S

NOTE:

WatchedFileStream relies on an external log rotation tool to ensure that log files don't become too big. The output file will be monitored and if it is ever deleted or moved by the external log rotation tool, then the output file handle will be automatically reloaded.

Example:

[watcher:myprogram]
cmd = python -m myapp.server
stdout_stream.class = WatchedFileStream
stdout_stream.filename = test.log
stdout_stream.time_format = %Y-%m-%d %H:%M:%S

TimedRotatingFileStream

filename

The file path where log will be written.

backup_count

The number of log files that will be kept By default backup_count is null.

time_format

The strftime format that will be used to prefix each time with a timestamp. By default they will be not prefixed.

i.e: %Y-%m-%d %H:%M:%S

rotate_when

The type of interval. The list of possible values is below. Note that they are not case sensitive.

Value	Type of interval
'S'	Seconds
'M'	Minutes
'H'	Hours
'D'	Days
'W0'-'W6'	Weekday (0=Monday)
'midnight'	Roll over at midnight

rotate_interval

The rollover interval.

NOTE:

TimedRotatingFileStream rotates logfiles at certain timed intervals. Rollover interval is determined by a combination of rotate_when and rotate_interval.

Example:

[watcher:myprogram]
cmd = python -m myapp.server
stdout_stream.class = TimedRotatingFileStream
stdout_stream.filename = test.log
stdout_stream.time_format = %Y-%m-%d %H:%M:%S
stdout_stream.utc = True
stdout_stream.rotate_when = H
stdout_stream.rotate_interval = 1

FancyStdoutStream

color

The name of an ascii color:

red
green
yellow
blue
magenta
cyan
white

time_format: The strftime format that each line will be prefixed with.
Default to: %Y-%m-%d %H:%M:%S

Example:

[watcher:myprogram]
cmd = python -m myapp.server
stdout_stream.class = FancyStdoutStream
stdout_stream.color = green
stdout_stream.time_format = %Y/%m/%d | %H:%M:%S

Commands

At the epicenter of circus lives the command systems. circusctl is just a zeromq client, and if needed you can drive programmaticaly the Circus system by writing your own zmq client.

All messages are JSON mappings.

For each command below, we provide a usage example with circusctl but also the input / output zmq messages.

circus-ctl commands

add: commands/add
decr: commands/decr
dstats: commands/dstats
get: commands/get
globaloptions: commands/globaloptions
incr: commands/incr
ipython: commands/ipython
list: commands/list
listen: commands/listen
listsockets: commands/listsockets
numprocesses: commands/numprocesses
numwatchers: commands/numwatchers
options: commands/options
quit: commands/quit
reload: commands/reload
reloadconfig: commands/reloadconfig
restart: commands/restart
rm: commands/rm
set: commands/set
signal: commands/signal
start: commands/start
stats: commands/stats
status: commands/status
stop: commands/stop

Add a watcher

This command add a watcher dynamically to a arbiter.

ZMQ Message

{


    "command": "add",


    "properties": {


        "cmd": "/path/to/commandline --option"


        "name": "nameofwatcher"


        "args": [],


        "options": {},


        "start": false


    }
}

A message contains 2 properties:

cmd: Full command line to execute in a process
args: array, arguments passed to the command (optional)
name: name of watcher
options: options of a watcher
start: start the watcher after the creation

The response return a status "ok".

Command line

$ circusctl add [--start] <name> <cmd>

Options

<name>: name of the watcher to create
<cmd>: full command line to execute in a process
--start: start the watcher immediately

Decrement the number of processes in a watcher

This comment decrement the number of processes in a watcher by <nbprocess>, 1 being the default.

ZMQ Message

{


    "command": "decr",


    "propeties": {


        "name": "<watchername>"


        "nb": <nbprocess>


        "waiting": False


    }
}

The response return the number of processes in the 'numprocesses` property:

{ "status": "ok", "numprocesses": <n>, "time", "timestamp" }

Command line

$ circusctl decr <name> [<nb>] [--waiting]

Options

<name>: name of the watcher
<nb>: the number of processes to remove.

Get circusd stats

You can get at any time some statistics about circusd with the dstat command.

ZMQ Message

To get the circusd stats, simply run:

{


    "command": "dstats"
}

The response returns a mapping the property "infos" containing some process informations:

{


  "info": {


    "children": [],


    "cmdline": "python",


    "cpu": 0.1,


    "ctime": "0:00.41",


    "mem": 0.1,


    "mem_info1": "3M",


    "mem_info2": "2G",


    "nice": 0,


    "pid": 47864,


    "username": "root"


  },


  "status": "ok",


  "time": 1332265655.897085
}

Command Line

$ circusctl dstats

Get the value of specific watcher options

This command can be used to query the current value of one or more watcher options.

ZMQ Message

{


    "command": "get",


    "properties": {


        "keys": ["key1, "key2"]


        "name": "nameofwatcher"


    }
}

A request message contains two properties:

keys: list, The option keys for which you want to get the values
name: name of watcher

The response object has a property options which is a dictionary of option names and values.

eg:

{


    "status": "ok",


    "options": {


        "graceful_timeout": 300,


        "send_hup": True,


    },


    time': 1332202594.754644
}

Command line

$ circusctl get <name> <key1> <key2>

Get the arbiter options

This command return the arbiter options

ZMQ Message

{


    "command": "globaloptions",


    "properties": {


        "key1": "val1",


        ..


    }
}

A message contains 2 properties:

•: keys: list, The option keys for which you want to get the values

The response return an object with a property "options" containing the list of key/value returned by circus.

eg:

{


    "status": "ok",


    "options": {


        "check_delay": 1,


        ...


    },


    time': 1332202594.754644
}

Command line

$ circusctl globaloptions

Options

Options Keys are:

endpoint: the controller ZMQ endpoint
pubsub_endpoint: the pubsub endpoint
check_delay: the delay between two controller points
multicast_endpoint: the multicast endpoint for circusd cluster auto-discovery

Increment the number of processes in a watcher

This comment increment the number of processes in a watcher by <nbprocess>, 1 being the default

ZMQ Message

{


    "command": "incr",


    "properties": {


        "name": "<watchername>",


        "nb": <nbprocess>,


        "waiting": False


    }
}

The response return the number of processes in the 'numprocesses` property:

{ "status": "ok", "numprocesses": <n>, "time", "timestamp" }

Command line

$ circusctl incr <name> [<nb>] [--waiting]

Options

<name>: name of the watcher.
<nb>: the number of processes to add.

Create shell into circusd process

This command is only useful if you have the ipython package installed.

Command Line

$ circusctl ipython

Get list of watchers or processes in a watcher

ZMQ Message

To get the list of all the watchers:

{


    "command": "list",
}

To get the list of active processes in a watcher:

{


    "command": "list",


    "properties": {


        "name": "nameofwatcher",


    }
}

The response return the list asked. the mapping returned can either be 'watchers' or 'pids' depending the request.

Command line

$ circusctl list [<name>]

Subscribe to a watcher event

ZMQ

At any moment you can subscribe to a circus event. Circus provides a PUB/SUB feed on which any clients can subscribe. The subscriber endpoint URI is set in the circus.ini configuration file.

Events are pubsub topics:

watcher.<watchername>.reap: when a process is reaped
watcher.<watchername>.spawn: when a process is spawned
watcher.<watchername>.kill: when a process is killed
watcher.<watchername>.updated: when watcher configuration is updated
watcher.<watchername>.stop: when a watcher is stopped
watcher.<watchername>.start: when a watcher is started

All events messages are in a json struct.

Command line

The client has been updated to provide a simple way to listen on the events:

circusctl listen [<topic>, ...]

Example of result:

$ circusctl listen tcp://127.0.0.1:5556
watcher.refuge.spawn: {u'process_id': 6, u'process_pid': 72976,


                       u'time': 1331681080.985104}
watcher.refuge.spawn: {u'process_id': 7, u'process_pid': 72995,


                       u'time': 1331681086.208542}
watcher.refuge.spawn: {u'process_id': 8, u'process_pid': 73014,


                       u'time': 1331681091.427005}

Get the list of sockets

ZMQ Message

To get the list of sockets:

{


    "command": "listsockets",
}

The response return a list of json mappings with keys for fd, name, host and port.

Command line

$ circusctl listsockets

Get the number of processes

Get the number of processes in a watcher or in a arbiter

ZMQ Message

{


    "command": "numprocesses",


    "propeties": {


        "name": "<watchername>"


    }
}

The response return the number of processes in the 'numprocesses` property:

{ "status": "ok", "numprocesses": <n>, "time", "timestamp" }

If the property name isn't specified, the sum of all processes managed is returned.

Command line

$ circusctl numprocesses [<name>]

Options

•: <name>: name of the watcher

Get the number of watchers

Get the number of watchers in a arbiter

ZMQ Message

{


    "command": "numwatchers",
}

The response return the number of watchers in the 'numwatchers` property:

{ "status": "ok", "numwatchers": <n>, "time", "timestamp" }

Command line

$ circusctl numwatchers

Get the value of all options for a watcher

This command returns all option values for a given watcher.

ZMQ Message

{


    "command": "options",


    "properties": {


        "name": "nameofwatcher",


    }
}

A message contains 1 property:

•: name: name of watcher

The response object has a property options which is a dictionary of option names and values.

eg:

{


    "status": "ok",


    "options": {


        "graceful_timeout": 300,


        "send_hup": True,


        ...


    },


    time': 1332202594.754644
}

Command line

$ circusctl options <name>

Options

•: <name>: name of the watcher

Options Keys are:

numprocesses: integer, number of processes
warmup_delay: integer or number, delay to wait between process spawning in seconds
working_dir: string, directory where the process will be executed
uid: string or integer, user ID used to launch the process
gid: string or integer, group ID used to launch the process
send_hup: boolean, if TRU the signal HUP will be used on reload
shell: boolean, will run the command in the shell environment if true
cmd: string, The command line used to launch the process
env: object, define the environnement in which the process will be launch
retry_in: integer or number, time in seconds we wait before we retry to launch the process if the maximum number of attempts has been reach.
max_retry: integer, The maximum of retries loops
graceful_timeout: integer or number, time we wait before we definitely kill a process.
priority: used to sort watchers in the arbiter
singleton: if True, a singleton watcher.
max_age: time a process can live before being restarted
max_age_variance: variable additional time to live, avoids stampeding herd.

Quit the arbiter immediately

When the arbiter receive this command, the arbiter exit.

ZMQ Message

{


    "command": "quit",


    "waiting": False
}

The response return the status "ok".

If waiting is False (default), the call will return immediately after calling stop_signal on each process.

If waiting is True, the call will return only when the stop process is completely ended. Because of the graceful_timeout option, it can take some time.

Command line

$ circusctl quit [--waiting]

Reload the arbiter or a watcher

This command reloads all the process in a watcher or all watchers. This will happen in one of 3 ways:

If graceful is false, a simple restart occurs.
If send_hup is true for the watcher, a HUP signal is sent to each process.

Otherwise:

If sequential is false, the arbiter will attempt to spawn numprocesses new processes. If the new processes are spawned successfully, the result is that all of the old processes are stopped, since by default the oldest processes are stopped when the actual number of processes for a watcher is greater than numprocesses.
If sequential is true, the arbiter will restart each process in a sequential way (with a warmup_delay pause between each step)

ZMQ Message

{


    "command": "reload",


    "properties": {


        "name": '<name>",


        "graceful": true,


        "sequential": false,


        "waiting": False


    }
}

The response return the status "ok". If the property graceful is set to true the processes will be exited gracefully.

If the property name is present, then the reload will be applied to the watcher.

Command line

$ circusctl reload [<name>] [--terminate] [--waiting]


                            [--sequential]

Options

<name>: name of the watcher
--terminate; quit the node immediately

Reload the configuration file

This command reloads the configuration file, so changes in the configuration file will be reflected in the configuration of circus.

ZMQ Message

{


    "command": "reloadconfig",


    "waiting": False
}

The response return the status "ok". If the property graceful is set to true the processes will be exited gracefully.

Command line

$ circusctl reloadconfig [--waiting]

Restart the arbiter or a watcher

This command restart all the process in a watcher or all watchers. This funtion simply stop a watcher then restart it.

ZMQ Message

{


    "command": "restart",


    "properties": {


        "name": "<name>",


        "waiting": False,


        "match": "[simple|glob|regex]"


    }
}

The response return the status "ok".

If the property name is present, then the reload will be applied to the watcher.

If waiting is False (default), the call will return immediately after calling stop_signal on each process.

If waiting is True, the call will return only when the restart process is completely ended. Because of the graceful_timeout option, it can take some time.

The match parameter can have the value simple for string compare, glob for wildcard matching (default) or regex for regex matching.

Command line

$ circusctl restart [name] [--waiting] [--match=simple|glob|regex]

Options

<name>: name or pattern of the watcher(s)
<match>: watcher match method

Remove a watcher

This command removes a watcher dynamically from the arbiter. The watchers are gracefully stopped by default.

ZMQ Message

{


    "command": "rm",


    "properties": {


        "name": "<nameofwatcher>",


        "nostop": False,


        "waiting": False


    }
}

The response return a status "ok".

If nostop is True (default: False), the processes for the watcher will not be stopped - instead the watcher will just be forgotten by circus and the watcher processes will be responsible for stopping themselves. If nostop is not specified or is False, then the watcher processes will be stopped gracefully.

If waiting is False (default), the call will return immediately after starting to remove and stop the corresponding watcher.

If waiting is True, the call will return only when the remove and stop process is completely ended. Because of the graceful_timeout option, it can take some time.

Command line

$ circusctl rm <name> [--waiting] [--nostop]

Options

<name>: name of the watcher to remove
nostop: do not stop the watcher processes, just remove the watcher

Set a watcher option

ZMQ Message

{


    "command": "set",


    "properties": {


        "name": "nameofwatcher",


        "options": {


            "key1": "val1",


            ..


        }


        "waiting": False


    }
}

The response return the status "ok". See the command Options for a list of key to set.

Command line

$ circusctl set <name> <key1> <value1> <key2> <value2> --waiting

Send a signal

This command allows you to send a signal to all processes in a watcher, a specific process in a watcher or its children.

ZMQ Message

To send a signal to all the processes for a watcher:

{


    "command": "signal",


    "property": {


        "name": <name>,


        "signum": <signum>
}

To send a signal to a process:

{


    "command": "signal",


    "property": {


        "name": <name>,


        "pid": <processid>,


        "signum": <signum>
}

An optional property "children" can be used to send the signal to all the children rather than the process itself:

{


    "command": "signal",


    "property": {


        "name": <name>,


        "pid": <processid>,


        "signum": <signum>,


        "children": True
}

To send a signal to a process child:

{


    "command": "signal",


    "property": {


        "name": <name>,


        "pid": <processid>,


        "signum": <signum>,


        "child_pid": <childpid>,
}

It is also possible to send a signal to all the children of the watcher:

{


    "command": "signal",


    "property": {


        "name": <name>,


        "signum": <signum>,


        "children": True
}

Lastly, you can send a signal to the process and its children, with the recursive option:

{


    "command": "signal",


    "property": {


        "name": <name>,


        "signum": <signum>,


        "recursive": True
}

Command line

$ circusctl signal <name> [<pid>] [--children]


        [--recursive] <signum>

Options:

<name>: the name of the watcher
<pid>: integer, the process id.
<signum>: the signal number (or name) to send.
<childpid>: the pid of a child, if any
<children>: boolean, send the signal to all the children
<recursive>: boolean, send the signal to the process and its children

Start the arbiter or a watcher

This command starts all the processes in a watcher or all watchers.

ZMQ Message

{


    "command": "start",


    "properties": {


        "name": '<name>",


        "waiting": False,


        "match": "[simple|glob|regex]"


    }
}

The response return the status "ok".

If the property name is present, the watcher will be started.

If waiting is False (default), the call will return immediately after calling start on each process.

If waiting is True, the call will return only when the start process is completely ended. Because of the graceful_timeout option, it can take some time.

The match parameter can have the value simple for string compare, glob for wildcard matching (default) or regex for regex matching.

Command line

$ circusctl restart [name] [--waiting] [--match=simple|glob|regex]

Options

<name>: name or pattern of the watcher(s)
<match>: watcher match method

Get process infos

You can get at any time some statistics about your processes with the stat command.

ZMQ Message

To get stats for all watchers:

{


    "command": "stats"
}

To get stats for a watcher:

{


    "command": "stats",


    "properties": {


        "name": <name>


    }
}

To get stats for a process:

{


    "command": "stats",


    "properties": {


        "name": <name>,


        "process": <processid>


    }
}

Stats can be extended with the extended_stats hook but extended stats need to be requested:

{


    "command": "stats",


    "properties": {


        "name": <name>,


        "process": <processid>,


        "extended": True


    }
}

The response retun an object per process with the property "info" containing some process informations:

{


  "info": {


    "children": [],


    "cmdline": "python",


    "cpu": 0.1,


    "ctime": "0:00.41",


    "mem": 0.1,


    "mem_info1": "3M",


    "mem_info2": "2G",


    "nice": 0,


    "pid": 47864,


    "username": "root"


  },


  "process": 5,


  "status": "ok",


  "time": 1332265655.897085
}

Command Line

$ circusctl stats [--extended] [<watchername>] [<processid>]

Get the status of a watcher or all watchers

This command start get the status of a watcher or all watchers.

ZMQ Message

{


    "command": "status",


    "properties": {


        "name": '<name>",


    }
}

The response return the status "active" or "stopped" or the status / watchers.

Command line

$ circusctl status [<name>]

Options

•: <name>: name of the watcher

Example

$ circusctl status dummy
active
$ circusctl status
dummy: active
dummy2: active
refuge: active

Stop watchers

This command stops a given watcher or all watchers.

ZMQ Message

{


    "command": "stop",


    "properties": {


        "name": "<name>",


        "waiting": False,


        "match": "[simple|glob|regex]"


    }
}

The response returns the status "ok".

If the name property is present, then the stop will be applied to the watcher corresponding to that name. Otherwise, all watchers will get stopped.

If waiting is False (default), the call will return immediatly after calling stop_signal on each process.

If waiting is True, the call will return only when the stop process is completly ended. Because of the graceful_timeout option, it can take some time.

The match parameter can have the value simple for string compare, glob for wildcard matching (default) or regex for regex matching.

Command line

$ circusctl stop [name] [--waiting] [--match=simple|glob|regex]

Options

<name>: name or pattern of the watcher(s)
<match>: watcher match method

CLI tools

circus-top

circus-top is a top-like console you can run to watch live your running Circus system. It will display the CPU, Memory usage and socket hits if you have some.

Example of output:

-----------------------------------------------------------------------
circusd-stats


 PID                 CPU (%)             MEMORY (%)
14252                 0.8                 0.4


                      0.8 (avg)           0.4 (sum)
dummy


 PID                 CPU (%)             MEMORY (%)
14257                 78.6                0.1
14256                 76.6                0.1
14258                 74.3                0.1
14260                 71.4                0.1
14259                 70.7                0.1


                      74.32 (avg)         0.5 (sum)
----------------------------------------------------------------------

circus-top is a read-only console. If you want to interact with the system, use circusctl.

circusctl

circusctl can be used to run any command listed in commands . For example, you can get a list of all the watchers, you can do

$ circusctl list

Besides supporting a handful of options you can also specify the endpoint circusctl should use using the CIRCUSCTL_ENDPOINT environment variable.

The Web Console

Circus comes with a Web Console that can be used to manage the system.

The Web Console lets you:

Connect to any running Circus system
Watch the processes CPU and Memory usage in real-time
Add or kill processes
Add new watchers

NOTE:

The real-time CPU & Memory usage feature uses the stats socket. If you want to activate it, make sure the Circus system you'll connect to has the stats enpoint enabled in its configuration:

[circus]
statsd = True

By default, this option is not activated.

The web console is its own package, you need to install:

$ pip install circus-web

To enable the console, add a few options in the Circus ini file:

[circus]
httpd = True
httpd_host = localhost
httpd_port = 8080

httpd_host and httpd_port are optional, and default to localhost and 8080.

If you want to run the web app on its own, just run the circushttpd script:

$ circushttpd
Bottle server starting up...
Listening on http://localhost:8080/
Hit Ctrl-C to quit.

By default the script will run the Web Console on port 8080, but the --port option can be used to change it.

Using the console

Once the script is running, you can open a browser and visit http://localhost:8080. You should get this screen: [image]

The Web Console is ready to be connected to a Circus system, given its endpoint. By default the endpoint is tcp://127.0.0.1:5555.

Once you hit Connect, the web application will connect to the Circus system.

With the Web Console logged in, you should get a list of watchers, and a real-time status of the two Circus processes (circusd and circusd-stats).

You can click on the status of each watcher to toggle it from Active (green) to Inactive (red). This change is effective immediatly and let you start & stop watchers.

If you click on the watcher name, you will get a web page for that particular watcher, with its processes:

On this screen, you can add or remove processes, and kill existing ones.

Last but not least, you can add a brand new watcher by clicking on the Add Watcher link in the left menu: .SS Running behind Nginx

Nginx can act as a proxy and security layer in front of circus-web.

NOTE:

To receive real-time status updates and graphs in circus-web, you must provide a Nginx proxy solution that has websocket support

Nginx >= 1.3.13

As of Nginx>=1.3.13 websocket support is built-in, so there is no need to combine Nginx with Varnish or HAProxy. An example Nginx config with websocket support:

upstream circusweb_server {


  server 127.0.0.1:8080;
}
server {


 listen   80;


 server_name  _;


 location / {


   proxy_pass http://circusweb_server;


   proxy_http_version 1.1;


   proxy_set_header Upgrade $http_upgrade;


   proxy_set_header Connection "upgrade";


   proxy_set_header Host $host;


   proxy_set_header X-Real-IP $remote_addr;


   proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;


   proxy_set_header X-Forwarded-Proto http;


   proxy_redirect off;


  }


 location ~/media/\*(.png|.jpg|.css|.js|.ico)$ {


   alias /path_to_site-packages/circusweb/media/;


  }
}

Nginx < 1.3.13

Nginx versions < 1.3.13 do not have websocket support built-in.

To provide websocket support for circus-web when using Nginx < 1.3.13, you can combine Nginx with Varnish or HAProxy. That is, Nginx in front of circus-web, with Varnish or HAProxy in front of Nginx.

The example below shows the combined Nginix and Varnish configuration required to proxy circus-web and provide websocket support.

Nginx configuration:

upstream circusweb_server {


  server 127.0.0.1:8080;
}
server {


 listen   8001;


 server_name  _;


 location / {


    proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;


    proxy_set_header Host $http_host;


    proxy_redirect off;


    proxy_pass http://circusweb_server;


  }


 location ~/media/\*(.png|.jpg|.css|.js|.ico)$ {


   alias /path_to_site-packages/circusweb/media/;


  }
}

If you want more Nginx configuration options, see http://wiki.nginx.org/HttpProxyModule.

Varnish configuration:

backend default {


    .host = "127.0.0.1";


    .port = "8001";
}
backend socket {


    .host = "127.0.0.1";


    .port = "8080";


    .connect_timeout = 1s;


    .first_byte_timeout = 2s;


    .between_bytes_timeout = 60s;
}
sub vcl_pipe {


     if (req.http.upgrade) {


         set bereq.http.upgrade = req.http.upgrade;


     }
}
sub vcl_recv {


    if (req.http.Upgrade ~ "(?i)websocket") {


        set req.backend = socket;


      return (pipe);


    }
}

In the Varnish configuration example above two backends are defined. One serving the web console and one serving the socket connections. Web console requests are bound to port 8001. The Nginx 'server' directive should be configured to listen on port 8001.

Websocket connections are upgraded and piped directly to the circushttpd process listening on port 8080 by Varnish. i.e. bypassing the Nginx proxy.

Ubuntu

Since the version 13.10 (Saucy), Ubuntu includes Nginx with websocket support in its own repositories. For older versions, you can install Nginx>=1.3.13 from the official Nginx stable PPA, as so:

sudo apt-get install python-software-properties
sudo add-apt-repository ppa:nginx/stable
sudo apt-get update
sudo apt-get install nginx
nginx -v

Password-protect circushttpd

As explained in the Security page, running circushttpd is pretty unsafe. We don't provide any security in Circus itself, but you can protect your console at the NGinx level, by using http://wiki.nginx.org/HttpAuthBasicModule

Example:

location / {


    proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;


    proxy_set_header Host $http_host;


    proxy_set_header X-Forwarded-Host: $http_host;


    proxy_set_header X-Forwarded-Proto: $scheme;


    proxy_redirect off;


    proxy_pass http://127.0.0.1:8080;


    auth_basic            "Restricted";


    auth_basic_user_file  /path/to/htpasswd;
}

The htpasswd file contains users and their passwords, and a password prompt will pop when you access the console.

You can use Apache's htpasswd script to edit it, or the Python script they provide at: http://trac.edgewall.org/browser/trunk/contrib/htpasswd.py

However, there's no native support for the combined use of HTTP Authentication and WebSockets (the server will throw HTTP 401 error codes). A workaround is to disable such authentication for the socket.io server.

Example (needs to be added before the previous rule):

location /socket.io {


    proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;


    proxy_set_header Host $http_host;


    proxy_set_header X-Forwarded-Host: $http_host;


    proxy_set_header X-Forwarded-Proto: $scheme;


    proxy_redirect off;


    proxy_pass http://127.0.0.1:8080;
}

Of course that's just one way to protect your web console, you could use many other techniques.

Extending the web console

We picked bottle to build the webconsole, mainly because it's a really tiny framework that doesn't do much. By having a look at the code of the web console, you'll eventually find out that it's really simple to understand.

Here is how it's split:

The circushttpd.py file contains the "views" definitions and some code to handle the socket connection (via socketio).
the controller.py contains a single class which is in charge of doing the communication with the circus controller. It allows to have a nicer high level API when defining the web server.

If you want to add a feature in the web console you can reuse the code that's existing. A few tools are at your disposal to ease the process:

There is a render_template function, which takes the named arguments you pass to it and pass them to the template renderer and return the resulting HTML. It also passes some additional variables, such as the session, the circus version and the client if defined.
If you want to run commands and doa redirection depending the result of it, you can use the run_command function, which takes a callable as a first argument, a message in case of success and a redirection url.

The StatsNamespace class is responsible for managing the websocket communication on the server side. Its documentation should help you to understand what it does.

Working with sockets

Circus can bind network sockets and manage them as it does for processes.

The main idea is that a child process that's created by Circus to run one of the watcher's command can inherit from all the opened file descriptors.

That's how Apache or Unicorn works, and many other tools out there.

Goal

The goal of having sockets managed by Circus is to be able to manage network applications in Circus exactly like other applications.

For example, if you use Circus with Chaussette -- a WGSI server, you can get a very fast web server running and manage "Web Workers" in Circus as you would do for any other process.

Splitting the socket managment from the network application itself offers a lot of opportunities to scale and manage your stack.

Design

The gist of the feature is done by binding the socket and start listening to it in circusd:

import socket
sock = socket.socket(FAMILY, TYPE)
sock.bind((HOST, PORT))
sock.listen(BACKLOG)
fd = sock.fileno()

Circus then keeps track of all the opened fds, and let the processes it runs as children have access to them if they want.

If you create a small Python network script that you intend to run in Circus, it could look like this:

import socket
import sys
fd = int(sys.argv[1])   # getting the FD from circus
sock = socket.fromfd(fd, FAMILY, TYPE)
# dealing with one request at a time
while True:


    conn, addr = sock.accept()


    request = conn.recv(1024)


    .. do something ..


    conn.sendall(response)


    conn.close()

Then Circus could run like this:

[circus]
check_delay = 5
endpoint = tcp://127.0.0.1:5555
pubsub_endpoint = tcp://127.0.0.1:5556
stats_endpoint = tcp://127.0.0.1:5557
[watcher:dummy]
cmd = mycoolscript $(circus.sockets.foo)
use_sockets = True
warmup_delay = 0
numprocesses = 5
[socket:foo]
host = 127.0.0.1
port = 8888

$(circus.sockets.foo) will be replaced by the FD value once the socket is created and bound on the 8888 port.

NOTE:

Starting at Circus 0.8 there's an alternate syntax to avoid some conflicts with some config parsers. You can write:

((circus.sockets.foo))

Real-world example

Chaussette is the perfect Circus companion if you want to run your WSGI application.

Once it's installed, running 5 meinheld workers can be done by creating a socket and calling the chaussette command in a worker, like this:

[circus]
endpoint = tcp://127.0.0.1:5555
pubsub_endpoint = tcp://127.0.0.1:5556
stats_endpoint = tcp://127.0.0.1:5557
[watcher:web]
cmd = chaussette --fd $(circus.sockets.web) --backend meinheld mycool.app
use_sockets = True
numprocesses = 5
[socket:web]
host = 0.0.0.0
port = 8000

We did not publish benchmarks yet, but a Web cluster managed by Circus with a Gevent or Meinheld backend is as fast as any pre-fork WSGI server out there.

Using built-in plugins

Circus comes with a few built-in plugins. This section presents these plugins and their configuration options.

Statsd

use: set to 'circus.plugins.statsd.StatsdEmitter'
application_name: the name used to identify the bucket prefix to emit the stats to (it will be prefixed with circus. and suffixed with .watcher)
host: the host to post the statds data to
port: the port the statsd daemon listens on
sample_rate: if you prefer a different sample rate than 1, you can set it here

FullStats

An extension on the Statsd plugin that is also publishing the process stats. As such it has the same configuration options as Statsd and the following.

use: set to circus.plugins.statsd.FullStats
loop_rate: the frequency the plugin should ask for the stats in seconds. Default: 60.

RedisObserver

This services observers a redis process for you, publishes the information to statsd and offers to restart the watcher when it doesn't react in a given timeout. This plugin requires redis-py to run.

It has the same configuration as statsd and adds the following:

use: set to circus.plugins.redis_observer.RedisObserver
loop_rate: the frequency the plugin should ask for the stats in seconds. Default: 60.
redis_url: the database to check for as a redis url. Default: "redis://localhost:6379/0"
timeout: the timeout in seconds the request can take before it is considered down. Defaults to 5.
restart_on_timeout: the name of the process to restart when the request timed out. No restart triggered when not given. Default: None.

HttpObserver

This services observers a http process for you by pinging a certain website regularly. Similar to the redis observer it offers to restart the watcher on an error. It requires tornado to run.

It has the same configuration as statsd and adds the following:

use: set to circus.plugins.http_observer.HttpObserver
loop_rate: the frequency the plugin should ask for the stats in seconds. Default: 60.
check_url: the url to check for. Default: http://localhost/
timeout: the timeout in seconds the request can take before it is considered down. Defaults to 10.
restart_on_error: the name of the process to restart when the request timed out or returned any other kind of error. No restart triggered when not given. Default: None.

ResourceWatcher

This services watches the resources of the given process and triggers a restart when they exceed certain limitations too often in a row.

It has the same configuration as statsd and adds the following:

use: set to circus.plugins.resource_watcher.ResourceWatcher
loop_rate: the frequency the plugin should ask for the stats in seconds. Default: 60.
watcher: the watcher this resource watcher should be looking after. (previously called service but service is now deprecated)
max_cpu: The maximum cpu one process is allowed to consume (in %). Default: 90
min_cpu: The minimum cpu one process should consume (in %). Default: None (no minimum) You can set the min_cpu to 0 (zero), in this case if one process consume exactly 0% cpu, it will trigger an exceeded limit.
max_mem: The amount of memory one process of this watcher is allowed to consume. Default: 90. If no unit is specified, the value is in %. Example: 50 If a unit is specified, the value is in bytes. Supported units are B, K, M, G, T, P, E, Z, Y. Example: 250M
min_mem: The minimum memory one process of this watcher should consume. Default: None (no minimum). If no unit is specified, the value is in %. Example: 50 If a unit is specified, the value is in bytes. Supported units are B, K, M, G, T, P, E, Z, Y. Example: 250M
health_threshold: The health is the average of cpu and memory (in %) the watchers processes are allowed to consume (in %). Default: 75
max_count: How often these limits (each one is counted separately) are allowed to be exceeded before a restart will be triggered. Default: 3

Example:

[circus]
; ...
[watcher:program]
cmd = sleep 120
[plugin:myplugin]
use = circus.plugins.resource_watcher.ResourceWatcher
watcher = program
min_cpu = 10
max_cpu = 70
min_mem = 0
max_mem = 20

Watchdog

Plugin that binds an udp socket and wait for watchdog messages. For "watchdoged" processes, the watchdog will kill them if they don't send a heartbeat in a certain period of time materialized by loop_rate * max_count. (circus will automatically restart the missing processes in the watcher)

Each monitored process should send udp message at least at the loop_rate. The udp message format is a line of text, decoded using msg_regex parameter. The heartbeat message MUST at least contain the pid of the process sending the message.

The list of monitored watchers are determined by the parameter watchers_regex in the configuration.

Configuration parameters:

use: set to circus.plugins.watchdog.WatchDog
loop_rate: watchdog loop rate in seconds. At each loop, WatchDog will looks for "dead" processes.
watchers_regex: regex for matching watcher names that should be monitored by the watchdog (default: .* all watchers are monitored)
msg_regex: regex for decoding the received heartbeat message in udp (default: ^(?P<pid>.*);(?P<timestamp>.*)$) the default format is a simple text message: pid;timestamp
max_count: max number of passed loop without receiving any heartbeat before restarting process (default: 3)
ip: ip the watchdog will bind on (default: 127.0.0.1)
port: port the watchdog will bind on (default: 1664)

Flapping

When a worker restarts too often, we say that it is flapping. This plugin keeps track of worker restarts and stops the corresponding watcher in case it is flapping. This plugin may be used to automatically stop workers that get constantly restarted because they're not working properly.

use: set to circus.plugins.flapping.Flapping
attempts: the number of times a process can restart, within window seconds, before we consider it flapping (default: 2)
window: the time window in seconds to test for flapping. If the process restarts more than attempts times within this time window, we consider it a flapping process. (default: 1)
retry_in: time in seconds to wait until we try to start again a process that has been flapping. (default: 7)
max_retry: the number of times we attempt to start a process that has been flapping, before we abandon and stop the whole watcher. (default: 5) Set to -1 to disable max_retry and retry indefinitely.
active: define if the plugin is active or not (default: True). If the global flag is set to False, the plugin is not started.

Options can be overriden in the watcher section using a flapping. prefix. For instance, here is how you would configure a specific max_retry value for nginx:

[watcher:nginx]
cmd = /path/to/nginx
flapping.max_retry = 2
[watcher:myscript]
cmd = ./my_script.py
; ... other watchers
[plugin:flapping]
use = circus.plugins.flapping.Flapping
max_retry = 5

CommandReloader

This plugin will restart watchers when their command file is modified. It works by checking the modification time and the path of the file pointed by the cmd option every loop_rate seconds. This may be useful while developing worker processes or even for hot code upgrade in production.

use: set to circus.plugins.command_reloader.CommandReloader
loop_rate: the frequency the plugin should check for modification in seconds. Default: 1.

Deployment

Although the Circus daemon can be managed with the circusd command, it's easier to have it start on boot. If your system supports Upstart, you can create this Upstart script in /etc/init/circus.conf.

start on filesystem and net-device-up IFACE=lo
stop on runlevel [016]
respawn
exec /usr/local/bin/circusd /etc/circus/circusd.ini

This assumes that circusd.ini is located at /etc/circus/circusd.ini. After rebooting, you can control circusd with the service command:

# service circus start/stop/restart

If your system supports systemd, you can create this systemd unit file under /etc/systemd/system/circus.service.

[Unit]
Description=Circus process manager
After=syslog.target network.target nss-lookup.target
[Service]
Type=simple
ExecReload=/usr/bin/circusctl reload
ExecStart=/usr/bin/circusd /etc/circus/circus.ini
Restart=always
RestartSec=5
[Install]
WantedBy=default.target

A reboot isn't required if you run the daemon-reload command below:

# systemctl --system daemon-reload

Then circus can be managed via:

# systemctl start/stop/status/reload circus

Recipes

This section will contain recipes to deploy Circus. Until then you can look at Pete's Puppet recipe or at Remy's Chef recipe

Papa Process Kernel

One problem common to process managers is that you cannot restart the process manager without restarting all of the processes it manages. This makes it difficult to deploy a new version of Circus or new versions of any of the libraries on which it depends.

If you are on a Unix-type system, Circus can use the Papa process kernel. When used, Papa will create a long-lived daemon that will serve as the host for any processes and sockets you create with it. If circus is shutdown, Papa will maintain everything it is hosting.

Setup

Start by installing the papa and setproctitle modules:

pip install papa
pip install setproctitle

The setproctitle module is optional. It will be used if present to rename the Papa daemon for top and ps to something like "papa daemon from circusd". If you do not install the setproctitle module, that title will be the command line of the process that launched it. Very confusing.

Once Papa is installed, add use_papa=true to your critical processes and sockets. Generally you want to house all of the processes of your stack in Papa, and none of the Circus support processes such as the flapping and stats plugins.

[circus]
loglevel = info
[watcher:nginx]
cmd = /usr/local/nginx/sbin/nginx -p /Users/scottmax/Source/service-framework/Common/conf/nginx -c /Users/scottmax/Source/service-framework/Common/conf/nginx/nginx.conf
warmup_delay = 3
graceful_timeout = 10
max_retry = 5
singleton = true
send_hup = true
stop_signal = QUIT
stdout_stream.class = FileStream
stdout_stream.filename = /var/logs/web-server.log
stdout_stream.max_bytes = 10000000
stdout_stream.backup_count = 10
stderr_stream.class = FileStream
stderr_stream.filename = /var/logs/web-server-error.log
stderr_stream.max_bytes = 1000000
stderr_stream.backup_count = 10
active = true
use_papa = true
[watcher:logger]
cmd = /my_service/env/bin/python logger.py run
working_dir = /my_service
graceful_timeout = 10
singleton = true
stop_signal = INT
stdout_stream.class = FileStream
stdout_stream.filename = /var/logs/logger.log
stdout_stream.max_bytes = 10000000
stdout_stream.backup_count = 10
stderr_stream.class = FileStream
stderr_stream.filename = /var/logs/logger.log
stderr_stream.max_bytes = 1000000
stderr_stream.backup_count = 10
priority = 50
use_papa = true
[watcher:web_app]
cmd = /my_service/env/bin/uwsgi --ini uwsgi-live.ini --socket fd://$(circus.sockets.web) --stats 127.0.0.1:809$(circus.wid)
working_dir = /my_service/web_app
graceful_timeout=10
stop_signal = QUIT
use_sockets = True
stdout_stream.class = FileStream
stdout_stream.filename = /var/logs/web_app.log
stdout_stream.max_bytes = 10000000
stdout_stream.backup_count = 10
stderr_stream.class = FileStream
stderr_stream.filename = /var/logs/web_app.log
stderr_stream.max_bytes = 1000000
stderr_stream.backup_count = 10
hooks.after_spawn = examples.uwsgi_lossless_reload.children_started
hooks.before_signal = examples.uwsgi_lossless_reload.clean_stop
hooks.extended_stats = examples.uwsgi_lossless_reload.extended_stats
priority = 40
use_papa = true
[socket:web]
path = /my_service/sock/uwsgi
use_papa = true
[plugin:flapping]
use = circus.plugins.flapping.Flapping
window = 10
priority = 1000

NOTE:

If the Papa processes use any sockets, those sockets must also use papa.

Design Goal

Papa is designed to be very minimalist in features and requirements. It does:

Start and stop sockets
Provide a key/value store
Start processes and return stdout, stderr and the exit code

It does not:

Restart processes
Provide a way to stop processes
Provide any information about processes other than whether or not they are still running

Papa requires no third-party libraries so it can run on just the standard Python library. It can make use of the setproctitle package but that is only used for making the title prettier for ps and top and is not essential.

The functionality has been kept to a minimum so that you should never need to restart the Papa daemon. As much of the functionality has been pushed to the client library as possible. That way you should be able to deploy a new copy of Papa for new client features without needing to restart the Papa daemon. Papa is meant to be a pillar of stability in a changing sea of 3rd party libraries.

Operation

Most things remain unchanged whether you use Papa or not. You can still start and stop processes. You can still get status and stats for processes. The main thing that changes is that when you do circusctl quit, all of the Papa processes are left running. When you start circusd back up, those processes are recovered.

NOTE:

When processes are recovered, before_start and before_spawn hooks are skipped.

Logging

While Circus is shut down, Papa will store up to 2M of output per process. Then it will start dumping the oldest data. When you restart Circus, this cached output will be quickly retrieved and sent to the output streams. Papa requires that receipt of output be acknowledged, so you should not lose any output during a shutdown.

Not only that, but Papa saves the timestamp of the output. Circus has been enhanced to take advantage of timestamp data if present. So if you are writing the output to log files or somewhere, your timestamps should all be correct.

Problems

If you use the incr or decr command to change the process count for a watcher, this will be reset to the level specified in the INI file when circusd is restarted.

Also, I have experienced problems with the combination of copy_env and virtualenv. You may note that the INI sample above circumvents this issue with explicit paths.

Telnet Interface

Papa has a basic command-line interface that you can access through telnet:

telnet localhost 20202
help

Circus for developers

Using Circus as a library

Circus provides high-level classes and functions that will let you manage processes in your own applications.

For example, if you want to run four processes forever, you could write:

from circus import get_arbiter
myprogram = {"cmd": "python myprogram.py", "numprocesses": 4}
arbiter = get_arbiter([myprogram])
try:


    arbiter.start()
finally:


    arbiter.stop()

This snippet will run four instances of myprogram and watch them for you, restarting them if they die unexpectedly.

To learn more about this, see library

Extending Circus

It's easy to extend Circus to create a more complex system, by listening to all the circusd events via its pub/sub channel, and driving it via commands.

That's how the flapping feature works for instance: it listens to all the processes dying, measures how often it happens, and stops the incriminated watchers after too many restarts attempts.

Circus comes with a plugin system to help you write such extensions, and a few built-in plugins you can reuse. See plugins.

You can also have a more subtile startup and shutdown behavior by using the hooks system that will let you run arbitrary code before and after some processes are started or stopped. See hooks.

Last but not least, you can also add new commands. See addingcmds.

Developers Documentation Index

Circus Library

The Circus package is composed of a high-level get_arbiter() function and many classes. In most cases, using the high-level function should be enough, as it creates everything that is needed for Circus to run.

You can subclass Circus' classes if you need more granularity than what is offered by the configuration.

The get_arbiter function

get_arbiter() is just a convenience on top of the various circus classes. It creates a arbiter (class Arbiter) instance with the provided options, which in turn runs a single Watcher with a single Process.

circus.get_arbiter()

Example:

from circus import get_arbiter
arbiter = get_arbiter([{"cmd": "myprogram", "numprocesses": 3}])
try:


    arbiter.start()
finally:


    arbiter.stop()

Classes

Circus provides a series of classes you can use to implement your own process manager:

Process: wraps a running process and provides a few helpers on top of it.
Watcher: run several instances of Process against the same command. Manage the death and life of processes.
Arbiter: manages several Watcher.

class circus.process.Process(name, wid, cmd, args=None, working_dir=None, shell=False, uid=None, gid=None, env=None, rlimits=None, executable=None, use_fds=False, watcher=None, spawn=True, pipe_stdout=True, pipe_stderr=True, close_child_stdout=False, close_child_stderr=False): Wraps a process.
Options:

wid: the process unique identifier. This value will be used to replace the $WID string in the command line if present.
cmd: the command to run. May contain any of the variables available that are being passed to this class. They will be replaced using the python format syntax.
args: the arguments for the command to run. Can be a list or a string. If args is a string, it's splitted using shlex.split(). Defaults to None.
executable: When executable is given, the first item in the args sequence obtained from cmd is still treated by most programs as the command name, which can then be different from the actual executable name. It becomes the display name for the executing program in utilities such as ps.
working_dir: the working directory to run the command in. If not provided, will default to the current working directory.
shell: if True, will run the command in the shell environment. False by default. warning: this is a security hazard.
uid: if given, is the user id or name the command should run with. The current uid is the default.
gid: if given, is the group id or name the command should run with. The current gid is the default.
env: a mapping containing the environment variables the command will run with. Optional.
rlimits: a mapping containing rlimit names and values that will be set before the command runs.
use_fds: if True, will not close the fds in the subprocess. Must be be set to True on Windows if stdout or stderr are redirected. default: False.
pipe_stdout: if True, will open a PIPE on stdout. default: True.
pipe_stderr: if True, will open a PIPE on stderr. default: True.
close_child_stdout: If True, redirects the child process' stdout to /dev/null after the fork. default: False.
close_child_stderr: If True, redirects the child process' stdout to /dev/null after the fork. default: False.

age(): Return the age of the process in seconds.

children(): Return a list of children pids.

info(): Return process info.
The info returned is a mapping with these keys:

mem_info1: Resident Set Size Memory in bytes (RSS)
mem_info2: Virtual Memory Size in bytes (VMS).
cpu: % of cpu usage.
mem: % of memory usage.
ctime: process CPU (user + system) time in seconds.
pid: process id.
username: user name that owns the process.
nice: process niceness (between -20 and 20)
cmdline: the command line the process was run with.

is_child(pid): Return True is the given pid is a child of that process.

pid: Return the pid

send_signal(*args, **kw): Sends a signal sig to the process.

send_signal_child(*args, **kw): Send signal signum to child pid.

send_signal_children(*args, **kw): Send signal signum to all children.

status: Return the process status as a constant

RUNNING
DEAD_OR_ZOMBIE
UNEXISTING
OTHER

stderr: Return the stdout stream

stdout: Return the stdout stream

stop(*args, **kw)

Stop the process and close stdout/stderr

If the corresponding process is still here (normally it's already killed by the watcher), a SIGTERM is sent, then a SIGKILL after 1 second.

The shutdown process (SIGTERM then SIGKILL) is normally taken by the watcher. So if the process is still there here, it's a kind of bad behavior because the graceful timeout won't be respected here.

Example:

>>> from circus.process import Process
>>> process = Process('Top', 'top', shell=True)
>>> process.age()
3.0107998847961426
>>> process.info()
'Top: 6812  N/A tarek Zombie N/A N/A N/A N/A N/A'
>>> process.status
1
>>> process.stop()
>>> process.status
2
>>> process.info()
'No such process (stopped?)'

class circus.watcher.Watcher(name, cmd, args=None, numprocesses=1, warmup_delay=0.0, working_dir=None, shell=False, shell_args=None, uid=None, max_retry=5, gid=None, send_hup=False, stop_signal=15, stop_children=False, env=None, graceful_timeout=30.0, prereload_fn=None, rlimits=None, executable=None, stdout_stream=None, stderr_stream=None, priority=0, loop=None, singleton=False, use_sockets=False, copy_env=False, copy_path=False, max_age=0, max_age_variance=30, hooks=None, respawn=True, autostart=True, on_demand=False, virtualenv=None, close_child_stdout=False, close_child_stderr=False, virtualenv_py_ver=None, use_papa=False, **options): Class managing a list of processes for a given command.
Options:

name: name given to the watcher. Used to uniquely identify it.
cmd: the command to run. May contain $WID, which will be replaced by wid.
args: the arguments for the command to run. Can be a list or a string. If args is a string, it's splitted using shlex.split(). Defaults to None.
numprocesses: Number of processes to run.
working_dir: the working directory to run the command in. If not provided, will default to the current working directory.
shell: if True, will run the command in the shell environment. False by default. warning: this is a security hazard.
uid: if given, is the user id or name the command should run with. The current uid is the default.
gid: if given, is the group id or name the command should run with. The current gid is the default.
send_hup: if True, a process reload will be done by sending the SIGHUP signal. Defaults to False.
stop_signal: the signal to send when stopping the process. Defaults to SIGTERM.
stop_children: send the stop_signal to the children too. Defaults to False.
env: a mapping containing the environment variables the command will run with. Optional.
rlimits: a mapping containing rlimit names and values that will be set before the command runs.
stdout_stream: a mapping that defines the stream for the process stdout. Defaults to None.
Optional. When provided, stdout_stream is a mapping containing up to three keys:

class: the stream class. Defaults to circus.stream.FileStream
filename: the filename, if using a FileStream
max_bytes: maximum file size, after which a new output file is opened. defaults to 0 which means no maximum size (only applicable with FileStream).
backup_count: how many backups to retain when rotating files according to the max_bytes parameter. defaults to 0 which means no backups are made (only applicable with FileStream)

This mapping will be used to create a stream callable of the specified class. Each entry received by the callable is a mapping containing:

pid - the process pid
name - the stream name (stderr or stdout)
data - the data

This is not supported on Windows.

•: stderr_stream: a mapping that defines the stream for the process stderr. Defaults to None.
Optional. When provided, stderr_stream is a mapping containing up to three keys: - class: the stream class. Defaults to circus.stream.FileStream - filename: the filename, if using a FileStream - max_bytes: maximum file size, after which a new output file is

opened. defaults to 0 which means no maximum size (only applicable with FileStream)

•: backup_count: how many backups to retain when rotating files according to the max_bytes parameter. defaults to 0 which means no backups are made (only applicable with FileStream).

This mapping will be used to create a stream callable of the specified class.

Each entry received by the callable is a mapping containing:

pid - the process pid
name - the stream name (stderr or stdout)
data - the data

This is not supported on Windows.

priority -- integer that defines a priority for the watcher. When the Arbiter do some operations on all watchers, it will sort them with this field, from the bigger number to the smallest. (default: 0)
singleton -- If True, this watcher has a single process. (default:False)
use_sockets -- If True, the processes will inherit the file descriptors, thus can reuse the sockets opened by circusd. (default: False)
on_demand -- If True, the processes will be started only at the first connection to the socket (default: False)
copy_env -- If True, the environment in which circus is running run will be reproduced for the workers. This defaults to True on Windows as you cannot run any executable without the SYSTEMROOT variable. (default: False)
copy_path -- If True, circusd sys.path is sent to the process through PYTHONPATH. You must activate copy_env for copy_path to work. (default: False)
max_age: If set after around max_age seconds, the process is replaced with a new one. (default: 0, Disabled)
max_age_variance: The maximum number of seconds that can be added to max_age. This extra value is to avoid restarting all processes at the same time. A process will live between max_age and max_age + max_age_variance seconds.
hooks: callback functions for hooking into the watcher startup and shutdown process. hooks is a dict where each key is the hook name and each value is a 2-tuple with the name of the callable or the callabled itself and a boolean flag indicating if an exception occuring in the hook should not be ignored. Possible values for the hook name: before_start, after_start, before_spawn, after_spawn, before_stop, after_stop., before_signal, after_signal or extended_stats.
options -- extra options for the worker. All options found in the configuration file for instance, are passed in this mapping -- this can be used by plugins for watcher-specific options.
respawn -- If set to False, the processes handled by a watcher will not be respawned automatically. (default: True)
virtualenv -- The root directory of a virtualenv. If provided, the watcher will load the environment for its execution. (default: None)
close_child_stdout: If True, closes the stdout after the fork. default: False.
close_child_stderr: If True, closes the stderr after the fork. default: False.
use_papa: If True, use the papa process kernel for this process. default: False.

kill_process(*args, **kwargs): Kill process (stop_signal, graceful_timeout then SIGKILL)

kill_processes(*args, **kwargs): Kill all processes (stop_signal, graceful_timeout then SIGKILL)

manage_processes(*args, **kwargs): Manage processes.

notify_event(topic, msg): Publish a message on the event publisher channel

reap_and_manage_processes(*args, **kwargs): Reap & manage processes.

reap_processes(*args, **kw): Reap all the processes for this watcher.

send_signal_child(*args, **kw): Send signal to a child.

spawn_process(recovery_wid=None): Spawn process.
Return True if ok, False if the watcher must be stopped

spawn_processes(*args, **kwargs): Spawn processes.

class circus.arbiter.Arbiter(watchers, endpoint, pubsub_endpoint, check_delay=1.0, prereload_fn=None, context=None, loop=None, statsd=False, stats_endpoint=None, statsd_close_outputs=False, multicast_endpoint=None, plugins=None, sockets=None, warmup_delay=0, httpd=False, httpd_host='localhost', httpd_port=8080, httpd_close_outputs=False, debug=False, debug_gc=False, ssh_server=None, proc_name='circusd', pidfile=None, loglevel=None, logoutput=None, loggerconfig=None, fqdn_prefix=None, umask=None, endpoint_owner=None, papa_endpoint=None): Class used to control a list of watchers.
Options:

watchers -- a list of Watcher objects
endpoint -- the controller ZMQ endpoint
pubsub_endpoint -- the pubsub endpoint
statsd -- If True, a circusd-stats process is run (default: False)
stats_endpoint -- the stats endpoint.
statsd_close_outputs -- if True sends the circusd-stats stdout/stderr to /dev/null (default: False)
multicast_endpoint -- the multicast endpoint for circusd cluster auto-discovery (default: udp://237.219.251.97:12027) Multicast addr should be between 224.0.0.0 to 239.255.255.255 and the same for the all cluster.
check_delay -- the delay between two controller points (default: 1 s)
prereload_fn -- callable that will be executed on each reload (default: None)
context -- if provided, the zmq context to reuse. (default: None)

loop: if provided, a zmq.eventloop.ioloop.IOLoop instance: to reuse. (default: None)

•: plugins -- a list of plugins. Each item is a mapping with:

use -- Fully qualified name that points to the plugin class
every other value is passed to the plugin in the config option

sockets -- a mapping of sockets. Each key is the socket name, and each value a CircusSocket class. (default: None)
warmup_delay -- a delay in seconds between two watchers startup. (default: 0)
httpd -- If True, a circushttpd process is run (default: False)
httpd_host -- the circushttpd host (default: localhost)
httpd_port -- the circushttpd port (default: 8080)
httpd_close_outputs -- if True, sends circushttpd stdout/stderr to /dev/null. (default: False)
debug -- if True, adds a lot of debug info in the stdout (default: False)
debug_gc -- if True, does gc.set_debug(gc.DEBUG_LEAK) (default: False) to circusd to analyze problems (default: False)
proc_name -- the arbiter process name

fqdn_prefix -- a prefix for the unique identifier of the circus: instance on the cluster.

endpoint_owner -- unix user to chown the endpoint to if using ipc.
papa_endpoint -- the papa process kernel endpoint

add_watcher(*args, **kwargs): Adds a watcher.
Options:

name: name of the watcher to add
cmd: command to run.
all other options defined in the Watcher constructor.

get_watcher(name): Return the watcher name.

numprocesses(): Return the number of processes running across all watchers.

numwatchers(): Return the number of watchers.

reload(*args, **kwargs): Reloads everything.
Run the prereload_fn() callable if any, then gracefuly reload all watchers.

start(*args, **kwargs)

Starts all the watchers.

If the ioloop has been provided during __init__() call, starts all watchers as a standard coroutine

If the ioloop hasn't been provided during __init__() call (default), starts all watchers and the eventloop (and blocks here). In this mode the method MUST NOT yield anything because it's called as a standard method.

Parameters: cb -- Callback called after all the watchers have been started, when the loop hasn't been provided.

Writing plugins

Circus comes with a plugin system which lets you interact with circusd.

NOTE:

We might add circusd-stats support to plugins later on.

A Plugin is composed of two parts:

a ZMQ subscriber to all events published by circusd
a ZMQ client to send commands to circusd

Each plugin is run as a separate process under a custom watcher.

A few examples of some plugins you could create with this system:

a notification system that sends e-mail alerts when a watcher is flapping
a logger
a tool that adds or removes processes depending on the load
etc.

Circus itself comes with a few built-in plugins.

The CircusPlugin class

Circus provides a base class to help you implement plugins: circus.plugins.CircusPlugin

class circus.plugins.CircusPlugin(endpoint, pubsub_endpoint, check_delay, ssh_server=None, **config): Base class to write plugins.
Options:

context -- the ZMQ context to use
endpoint -- the circusd ZMQ endpoint
pubsub_endpoint -- the circusd ZMQ pub/sub endpoint
check_delay -- the configured check delay
config -- free config mapping

call(command, **props): Sends the command to circusd
Options:

command -- the command to call
props -- keyword arguments to add to the call

Returns the JSON mapping sent back by circusd

cast(command, **props): Fire-and-forget a command to circusd
Options:

command -- the command to call
props -- keyword arguments to add to the call

handle_init(): Called right before a plugin is started - in the thread context.

handle_recv(data): Receives every event published by circusd
Options:

•: data -- a tuple containing the topic and the message.

handle_stop(): Called right before the plugin is stopped by Circus.

When initialized by Circus, this class creates its own event loop that receives all circusd events and pass them to handle_recv(). The data received is a tuple containing the topic and the data itself.

handle_recv() must be implemented by the plugin.

The call() and cast() methods can be used to interact with circusd if you are building a Plugin that actively interacts with the daemon.

handle_init() and handle_stop() are just convenience methods you can use to initialize and clean up your code. handle_init() is called within the thread that just started. handle_stop() is called in the main thread just before the thread is stopped and joined.

Writing a plugin

Let's write a plugin that logs in a file every event happening in circusd. It takes one argument which is the filename.

The plugin may look like this:

from circus.plugins import CircusPlugin
class Logger(CircusPlugin):


    name = 'logger'


    def __init__(self, *args, **config):


        super(Logger, self).__init__(*args, **config)


        self.filename = config.get('filename')


        self.file = None


    def handle_init(self):


        self.file = open(self.filename, 'a+', buffering=1)


    def handle_stop(self):


        self.file.close()


    def handle_recv(self, data):


        watcher_name, action, msg = self.split_data(data)


        msg_dict = self.load_message(msg)


        self.file.write('%s %s::%r\n' % (action, watcher_name, msg_dict))

That's it ! This class can be saved in any package/module, as long as it can be seen by Python.

For example, Logger may be found in a plugins module within a myproject package.

Async requests

In case you want to make any asynchronous operations (like a Tornado call or using periodicCall) make sure you are using the right loop. The loop you always want to be using is self.loop as it gets set up by the base class. The default loop often isn't the same and therefore code might not get executed as expected.

Trying a plugin

You can run a plugin through the command line with the circus-plugin command, by specifying the plugin fully qualified name:

$ circus-plugin --endpoint tcp://127.0.0.1:5555 --pubsub tcp://127.0.0.1:5556 --config filename:circus-events.log myproject.plugins.Logger
[INFO] Loading the plugin...
[INFO] Endpoint: 'tcp://127.0.0.1:5555'
[INFO] Pub/sub: 'tcp://127.0.0.1:5556'
[INFO] Starting

Another way to run a plugin is to let Circus handle its initialization. This is done by adding a [plugin:NAME] section in the configuration file, where NAME is a unique name for your plugin:

[plugin:logger]
use = myproject.plugins.Logger
filename = /var/myproject/circus.log

use is mandatory and points to the fully qualified name of the plugin.

When Circus starts, it creates a watcher with one process that runs the pointed class, and pass any other variable contained in the section to the plugin constructor via the config mapping.

You can also programmatically add plugins when you create a circus.arbiter.Arbiter class or use circus.get_arbiter(), see library.

Performances

Since every plugin is loaded in its own process, it should not impact the overall performances of the system as long as the work done by the plugin is not doing too many calls to the circusd process.

Hooks

Circus provides hooks that can be used to trigger actions upon watcher events. Available hooks are:

before_start: called before the watcher is started. If the hook returns False the startup is aborted.
after_start: called after the watcher is started. If the hook returns False the watcher is immediately stopped and the startup is aborted.
before_spawn: called before the watcher spawns a new process. If the hook returns False the watcher is immediately stopped and the startup is aborted.
after_spawn: called after the watcher spawns a new process. If the hook returns False the watcher is immediately stopped and the startup is aborted.
before_stop: called before the watcher is stopped. The hook result is ignored.
after_stop: called after the watcher is stopped. The hook result is ignored.
before_signal: called before a signal is sent to a watcher's process. If the hook returns False the signal is not sent (except SIGKILL which is always sent)
after_signal: called after a signal is sent to a watcher's process.
extended_stats: called when stats are requested with extended=True. Used for adding process-specific stats to the regular stats output.

Example

A typical use case is to control that all the conditions are met for a process to start. Let's say you have a watcher that runs Redis and a watcher that runs a Python script that works with Redis. With Circus you can order the startup by using the priority option:

[watcher:queue-worker]
cmd = python -u worker.py
priority = 1
[watcher:redis]
cmd = redis-server
priority = 2

With this setup, Circus will start Redis first and then it will start the queue worker. But Circus does not really control that Redis is up and running. It just starts the process it was asked to start. What we miss here is a way to control that Redis is started and fully functional. A function that controls this could be:

import redis
import time
def check_redis(*args, **kw):


    time.sleep(.5)  # give it a chance to start


    r = redis.StrictRedis(host='localhost', port=6379, db=0)


    r.set('foo', 'bar')


    return r.get('foo') == 'bar'

This function can be plugged into Circus as an before_start hook:

[watcher:queue-worker]
cmd = python -u worker.py
hooks.before_start = mycoolapp.myplugins.check_redis
priority = 1
[watcher:redis]
cmd = redis-server
priority = 2

Once Circus has started the redis watcher, it will start the queue-worker watcher, since it follows the priority ordering. Just before starting the second watcher, it will run the check_redis function, and in case it returns False will abort the watcher starting process.

Hook signature

A hook must follow this signature:

def hook(watcher, arbiter, hook_name, **kwargs):


    ...


    # If you don't return True, the hook can change


    # the behavior of circus (depending on the hook)


    return True

Where watcher is the Watcher class instance, arbiter the Arbiter one, hook_name the hook name and kwargs some additional optional parameters (depending on the hook type).

The after_spawn hook adds the pid parameters:

def after_spawn(watcher, arbiter, hook_name, pid, **kwargs):


    ...


    # If you don't return True, circus will kill the process


    return True

Where pid is the PID of the corresponding process.

Likewise, before_signal and after_signal hooks add pid and signum:

def before_signal_hook(watcher, arbiter, hook_name, pid, signum, **kwargs):


    ...


    # If you don't return True, circus won't send the signum signal


    # (SIGKILL is always sent)


    return True

Where pid is the PID of the corresponding process and signum is the corresponding signal.

You can ignore those but being able to use the watcher and/or arbiter data and methods can be useful in some hooks.

Note that hooks are called with named arguments. So use the hook signature without changing argument names.

The extended_stats hook has its own additional parameters in kwargs:

def extended_stats_hook(watcher, arbiter, hook_name, pid, stats, **kwargs):


    ...

Where pid is the PID of the corresponding process and stats the regular stats to be returned. Add your own stats into stats. An example is in examples/uwsgi_lossless_reload.py.

As a last example, here is a super hook which can deal with all kind of signals:

def super_hook(watcher, arbiter, hook_name, **kwargs):


    pid = None


    signum = None


    if hook_name in ('before_signal', 'after_signal'):


        pid = kwargs['pid']


        signum = kwargs['signum']


    ...


    return True

Hook events

Everytime a hook is run, its result is notified as an event in Circus.

There are two events related to hooks:

hook_success: a hook was successfully called. The event keys are name the name if the event, and time: the date of the events.
hook_failure: a hook has failed. The event keys are name the name if the event, time: the date of the events and error: the exception that occurred in the event, if any.

Adding new commands

We tried to make adding new commands as simple as possible.

You need to do three things:

1.: create a your_command.py file under circus/commands/.
2.: Implement a single class in there, with predefined methods
3.: Add the new command in circus/commands/__init__.py.

Let's say we want to add a command which returns the number of watchers currently in use, we would do something like this (extensively commented to allow you to follow more easily):

from circus.commands.base import Command
from circus.exc import ArgumentError, MessageError
class NumWatchers(Command):


    """It is a good practice to describe what the class does here.


    Have a look at other commands to see how we are used to format


    this text. It will be automatically included in the documentation,


    so don't be affraid of being exhaustive, that's what it is made


    for.


    """


    # all the commands inherit from `circus.commands.base.Command`


    # you need to specify a name so we find back the command somehow


    name = "numwatchers"


    # Set waiting to True or False to define your default behavior


    # - If waiting is True, the command is run synchronously, and the client may get


    #   back results.


    # - If waiting is False, the command is run asynchronously on the server and the client immediately


    #   gets back an 'ok' response


    #


    #   By default, commands are set to waiting = False


    waiting = True


    # options


    options = [('', 'optname', default_value, 'description')]


    properties = ['foo', 'bar']


    # properties list the command arguments that are mandatory. If they are


    # not provided, then an error will be thrown


    def execute(self, arbiter, props):


        # the execute method is the core of the command: put here all the


        # logic of the command and return a dict containing the values you


        # want to return, if any


        return {"numwatchers": arbiter.numwatchers()}


    def console_msg(self, msg):


        # msg is what is returned by the execute method.


        # this method is used to format the response for a console (it is


        # used for instance by circusctl to print its messages)


        return "a string that will be displayed"


    def message(self, *args, **opts):


        # message handles console input.


        # this method is used to map console arguments to the command


        # options. (its is used for instance when calling the command via


        # circusctl)


        # NotImplementedError will be thrown if the function is missing


        numArgs = 1


        if not len(args) == numArgs:


            raise ArgumentError('Invalid number of arguments.')


        else:


            opts['optname'] = args[0]


        return self.make_message(**opts)


    def validate(self, props):


        # this method is used to validate that the arguments passed to the


        # command are correct. An ArgumentError should be thrown in case


        # there is an error in the passed arguments (for instance if they


        # do not match together.


        # In case there is a problem wrt their content, a MessageError


        # should be thrown. This method can modify the content of the props


        # dict, it will be passed to execute afterwards.

Use cases examples

This chapter presents a few use cases, to give you an idea on how to use Circus in your environment.

Running a WSGI application

Running a WSGI application with Circus is quite interesting because you can watch & manage your web workers using circus-top, circusctl or the Web interface.

This is made possible by using Circus sockets. See whycircussockets.

Let's take an example with a minimal Pyramid application:

from pyramid.config import Configurator
from pyramid.response import Response
def hello_world(request):


    return Response('Hello %(name)s!' % request.matchdict)
config = Configurator()
config.add_route('hello', '/hello/{name}')
config.add_view(hello_world, route_name='hello')
application = config.make_wsgi_app()

Save this script into an app.py file, then install those projects:

$ pip install Pyramid
$ pip install chaussette

Next, make sure you can run your Pyramid application using the chaussette console script:

$ chaussette app.application
Application is <pyramid.router.Router object at 0x10a4d4bd0>
Serving on localhost:8080
Using <class 'chaussette.backend._waitress.Server'> as a backend

And check that you can reach it by visiting http://localhost:8080/hello/tarek

Now that your application is up and running, let's create a Circus configuration file:

[circus]
check_delay = 5
endpoint = tcp://127.0.0.1:5555
pubsub_endpoint = tcp://127.0.0.1:5556
stats_endpoint = tcp://127.0.0.1:5557
[watcher:webworker]
cmd = chaussette --fd $(circus.sockets.webapp) app.application
use_sockets = True
numprocesses = 3
[socket:webapp]
host = 127.0.0.1
port = 8080

This file tells Circus to bind a socket on port 8080 and run chaussette workers on that socket -- by passing its fd.

Save it to server.ini and try to run it using circusd

$ circusd server.ini
[INFO] Starting master on pid 8971
[INFO] sockets started
[INFO] circusd-stats started
[INFO] webapp started
[INFO] Arbiter now waiting for commands

Make sure you still get the app on http://localhost:8080/hello/tarek.

Congrats ! you have a WSGI application running 3 workers.

You can run the circushttpd or the cli, and enjoy Circus management.

Running a Django application

Running a Django application is done exactly like running a WSGI application. Use the PYTHONPATH to import the directory the project is in, the directory that contains the directory that has settings.py in it (with Django 1.4+ this directory has manage.py in it) :

[socket:dwebapp]
host = 127.0.0.1
port = 8080
[watcher:dwebworker]
cmd = chaussette --fd $(circus.sockets.dwebapp) dproject.wsgi.application
use_sockets = True
numprocesses = 2
[env:dwebworker]
PYTHONPATH = /path/to/parent-of-dproject

If you need to pass the DJANGO_SETTINGS_MODULE for a backend worker for example, you can pass that also though the env configation option:

[watcher:dbackend]
cmd = /path/to/script.py
numprocesses=3
[env:dbackend]
PYTHONPATH = /path/to/parent-of-dproject
DJANGO_SETTINGS_MODULE=dproject.settings

See http://chaussette.readthedocs.org for more about chaussette.

Design decisions

Overall architecture

[image]

Circus is composed of a main process called circusd which takes care of running all the processes. Each process managed by Circus is a child process of circusd.

Processes are organized in groups called watchers. A watcher is basically a command circusd runs on your system, and for each command you can configure how many processes you want to run.

The concept of watcher is useful when you want to manage all the processes running the same command -- like restart them, etc.

circusd binds two ZeroMQ sockets:

REQ/REP -- a socket used to control circusd using json-based commands.
PUB/SUB -- a socket where circusd publishes events, like when a process is started or stopped.

NOTE:

Despite its name, ZeroMQ is not a queue management system. Think of it as an inter-process communication (IPC) library.

Another process called circusd-stats is run by circusd when the option is activated. circusd-stats's job is to publish CPU/Memory usage statistics in a dedicated PUB/SUB channel.

This specialized channel is used by circus-top and circus-httpd to display a live stream of the activity.

circus-top is a console script that mimics top to display all the CPU and Memory usage of the processes managed by Circus.

circus-httpd is the web managment interface that will let you interact with Circus. It displays a live stream using web sockets and the circusd-stats channel, but also let you interact with circusd via its REQ/REP channel.

Last but not least, circusctl is a command-line tool that let you drive circusd via its REQ/REP channel.

You can also have plugins that subscribe to circusd's PUB/SUB channel and let you send commands to the REQ/REP channel like circusctl would.

Security

Circus is built on the top of the ZeroMQ library and comes with no security at all in its protocols. However, you can run a Circus system on a server and set up an SSH tunnel to access it from another machine.

This section explains what Circus does on your system when you run it, and ends up describing how to use an SSH tunnel.

You can also read http://www.zeromq.org/area:faq#toc5

TCP ports

By default, Circus opens the following TCP ports on the local host:

5555 -- the port used to control circus via circusctl
5556 -- the port used for the Publisher/Subscriber channel.
5557 -- the port used for the statistics channel -- if activated.
8080 -- the port used by the Web UI -- if activated.

These ports allow client apps to interact with your Circus system, and depending on how your infrastructure is organized, you may want to protect these ports via firewalls or configure Circus to run using IPC ports.

Here's an example of running Circus using only IPC entry points:

[circus]
check_delay = 5
endpoint = ipc:///var/circus/endpoint
pubsub_endpoint = ipc:///var/circus/pubsub
stats_endpoint = ipc:///var/circus/stats

When Configured using IPC, the commands must be run from the same box, but no one can access them from outside, unlike using TCP. The commands must also be run as a user that has write access to the ipc socket paths. You can modify the owner of the endpoint using the endpoint_owner config option. This allows you to run circusd as the root user, but allow non-root processes to send commands to circusd. Note that when using endpoint_owner, in order to prevent non-root processes from being able to start arbitrary processes that run with greater privileges, the add command will enforce that new Watchers must run as the endpoint_owner user. Watcher definitions in the local config files will not be restricted this way.

Of course, if you activate the Web UI, the 8080 port will still be open.

circushttpd

When you run circushttpd manually, or when you use the httpd option in the ini file like this:

[circus]
check_delay = 5
endpoint = ipc:///var/circus/endpoint
pubsub_endpoint = ipc:///var/circus/pubsub
stats_endpoint = ipc:///var/circus/stats
httpd = 1

The web application will run on port 8080 and will let anyone accessing the web page manage the circusd daemon.

That includes creating new watchers that can run any command on your system !

Do not make it publicly available

If you want to protect the access to the web panel, you can serve it behind Nginx or Apache or any proxy-capable web server, that can take care of the security.

User and Group Permissions

By default, all processes started with Circus will be running with the same user and group as circusd. Depending on the privileges the user has on the system, you may not have access to all the features Circus provides.

For instance, some statistics features on a running processes require extended privileges. Typically, if the CPU usage numbers you get using the stats command are N/A, it means your user can't access the proc files. This will be the case by default under Mac OS X.

You may run circusd as root to fix this, and set the uid and gid values for each watcher to get all the features.

But beware that running circusd as root exposes you to potential privilege escalation bugs. While we're doing our best to avoid any bugs, running as root and facing a bug that performs unwanted actions on your system may be dangerous.

The best way to prevent this is to make sure that the system running Circus is completely isolated (like a VM) or to run the whole system under a controlled user.

SSH tunneling

Clients can connect to a circusd instance by creating an SSH tunnel. To do so, pass the command line option --ssh followed by user@address, where user is the user on the remote server and address is the server's address as seen by the client. The SSH protocol will require credentials to complete the login.

If circusd as seen by the SSH server is not at the default endpoint address localhost:5555 then specify the circusd address using the option --endpoint

Secured setup example

Setting up a secured Circus server can be done by:

Running an SSH Server
Running Apache or Nginx on the 80 port, and doing a reverse-proxy on the 8080 port.
Blocking the 8080 port from outside access.
Running all ZMQ Circusd ports using IPC files instead of TCP ports, and tunneling all calls via SSH.

[image]

Contributing to Circus

Circus has been started at Mozilla but its goal is not to stay only there. We're trying to build a tool that's useful for others, and easily extensible.

We really are open to any contributions, in the form of code, documentation, discussions, feature proposal etc.

You can start a topic in our mailing list : http://tech.groups.yahoo.com/group/circus-dev/

Or add an issue in our bug tracker

Fixing typos and enhancing the documentation

It's totally possible that your eyes are bleeding while reading this half-english half-french documentation, don't hesitate to contribute any rephrasing / enhancement on the form in the documentation. You probably don't even need to understand how Circus works under the hood to do that.

Adding new features

New features are of course very much appreciated. If you have the need and the time to work on new features, adding them to Circus shouldn't be that complicated. We tried very hard to have a clean and understandable API, hope it serves the purpose.

You will need to add documentation and tests alongside with the code of the new feature. Otherwise we'll not be able to accept the patch.

How to submit your changes

We're using git as a DVCS. The best way to propose changes is to create a branch on your side (via git checkout -b branchname) and commit your changes there. Once you have something ready for prime-time, issue a pull request against this branch.

We are following this model to allow to have low coupling between the features you are proposing. For instance, we can accept one pull request while still being in discussion for another one.

Before proposing your changes, double check that they are not breaking anything! You can use the tox command to ensure this, it will run the testsuite under the different supported python versions.

Please use : http://issue2pr.herokuapp.com/ to reference a commit to an existing circus issue, if any.

Avoiding merge commits

Avoiding merge commits allows to have a clean and readable history. To do so, instead of doing "git pull" and letting git handling the merges for you, using git pull --rebase will put your changes after the changes that are commited in the branch, or when working on master.

That is, for us core developers, it's not possible anymore to use the handy github green button on pull requests if developers didn't rebased their work themselves or if we wait too much time between the request and the actual merge. Instead, the flow looks like this:

git remote add name repo-url
git fetch name
git checkout feature-branch
git rebase master
# check that everything is working properly and then merge on master
git checkout master
git merge feature-branch

Discussing

If you find yourself in need of any help while looking at the code of Circus, you can go and find us on irc at #circus-tent on irc.freenode.org (or if you don't have any IRC client, use the webchat)

You can also start a thread in our mailing list - http://tech.groups.yahoo.com/group/circus-dev

Frequently Asked Questions

Here is a list of frequently asked questions about Circus:

How does Circus stack compare to a classical stack?

In a classical WSGI stack, you have a server like Gunicorn that serves on a port or an unix socket and is usually deployed behind a web server like Nginx: [image]

Clients call Nginx, which reverse proxies all the calls to Gunicorn.

If you want to make sure the Gunicorn process stays up and running, you have to use a program like Supervisord or upstart.

Gunicorn in turn watches for its processes ("workers").

In other words you are using two levels of process managment. One that you manage and control (supervisord), and a second one that you have to manage in a different UI, with a different philosophy and less control over what's going on (the wsgi server's one)

This is true for Gunicorn and most multi-processes WSGI servers out there I know about. uWsgi is a bit different as it offers plethoras of options.

But if you want to add a Redis server in your stack, you will end up with managing your stack processes in two different places.

Circus' approach on this is to manage processes and sockets.

A Circus stack can look like this: [image]

So, like Gunicorn, Circus is able to bind a socket that will be proxied by Nginx. Circus don't deal with the requests but simply binds the socket. It's then up to a web worker process to accept connections on the socket and do the work.

It provides equivalent features than Supervisord but will also let you manage all processes at the same level, wether they are web workers or Redis or whatever. Adding a new web worker is done exactly like adding a new Redis process.

Benches

We did a few benches to compare Circus & Chaussette with Gunicorn. To summarize, Circus is not adding any overhead and you can pick up many different backends for your web workers.

See:

http://blog.ziade.org/2012/06/28/wgsi-web-servers-bench
http://blog.ziade.org/2012/07/03/wsgi-web-servers-bench-part-2

How to troubleshoot Circus?

By default, circusd keeps its logging to stdout rather sparse. This lack of output can make things hard to troubleshoot when processes seem to be having trouble starting.

To increase the logging circusd provides, try increasing the log level. To see the available log levels just use the --help flag.

$ circus --log-level debug test.ini

One word of warning. If a process is flapping and the debug log level is turned on, you will see messages for each start attempt. It might be helpful to configure the app that is flapping to use a warmup_delay to slow down the messages to a manageable pace.

[watcher:webapp]
cmd = python -m myapp.wsgi
warmup_delay = 5

By default, stdout and stderr are captured by the circusd process. If you are testing your config and want to see the output in line with the circusd output, you can configure your watcher to use the StdoutStream class.

[watcher:webapp]
cmd = python -m myapp.wsgi
stdout_stream.class = StdoutStream
stderr_stream.class = StdoutStream

If your application is producing a traceback or error when it is trying to start up you should be able to see it in the output.

Changelog history

0.12.1 - 2015-08-05

Fix error when restarting a watcher with an output stream - #913
Minor doc tweaks

0.12 - 2015-06-02

This release brings Python 3.4, Tornado 4 and Windows support, among several exciting features and fixes.

The Windows support is still experimental, and does not handle streams.

Major changes:

Compatibility with Python 3.4 - #768
Experimental Windows support - #788
Compatibility with Tornado 4 - #872
Revamped Debian packaging - #896 - #903
Add support for Papa process kernel - #850
Add globing and regex matching for starting, stopping and restarting watchers - #829 - #902

More changes:

Optimization of the shutdown - #784 - #842
Add possibility to specify virtualenv version for the watchers - #805
Add --nostop option to the rmwatcher command - #777
Add a callback to Arbiter.start - #840
Fix reloading watchers with uppercase letters - #823
Remove leaking socket in stats daemon - #843
Fix multicast on SunOS - #876
Close output streams when stopping a watcher - #885
Fix signal sending to grandchildren with --recursive - #888

0.11.1 - 2014-05-22

•: Fixed a regression that broke Circus on 2.6 - #782

0.11 - 2014-05-21

This release is not introducing a lot of features, and focused on making Circus more robust & stable.

Major changes/fixes:

Make sure we cannot execute two conflictings commands on the arbiter simultanously.
we have 2 new streams class: TimedRotatingFileStream, WatchedFileStream
we have one new hook: after_spawn hook
CircusPlugin is easier to use
fix autostart=False watchers during start (regression)

More changes:

circus messages can be routed to syslog now - #748
endpoint_owner option added so we can define which user owns ipc socket files created by circus.
Started Windows support (just circusctl for now)
fixed a lot of leaks in the tests
Allow case sensitive environment variables
The resource plugin now accepts absolute memory values - #609
Add support to the add command for the 'singleton' option - #767
Allow sending arbitrary signals to child procs via resource watcher - #756
Allow INI/JSON/YAML configuration for logging
Make sure we're compatible with psutil 2.x and 3.x
Added more metrics to the statsd provider - #698
Fixed multicast discovery - #731
Make start, restart and reload more uniform - #673
Correctly initialize all use groups - #635
improved tests stability
many, many more things....

0.10 - 2013-11-04

Major changes:

Now Python 3.2 & 3.3 compatible - #586
Moved the core to a fully async model - #569
Improved documentation - #622

More changes:

Added stop_signal & stop_children - #594
Make sure the watchdog plugin closes the sockets - #588
Switched to ZMQ JSON parser
IN not supported on all platforms - #573
Allow global environment substitutions in any config section - #560
Allow dashes in sections names - #546
Now variables are expanded everywhere in the config - #554
Added the CommandReloader plugin
Added before_signal & after_signal hooks
Allow flapping plugin to retry indefinitely
Don't respawn procs when the watcher is stopping - #529 - #536
Added a unique id for each client message - #517
worker ids are now "slots" -
Fixed the graceful shutdown behavior - #515
Make sure we can add watchers even if the arbiter is not started - #503
Make sure make sure we pop expired process - #510
Make sure the set command can set several hooks
Correctly support ipv6 sockets - #507
Allow custom options for stdout_stream and stderr_stream - #495
Added time_format for FileStream - #493
Added new socket config option to bind to a specific interface by name

0.9.3 - 2013-09-04

Make sure we can add watchers even if the arbiter is not started
Make sure we pop expired process
Make sure the set command can set one or several hooks
Correctly support ipv6 sockets and improvments of CircusSockets
Give path default value to prevent UnboundLocalError
Added a test for multicast_endpoint existence in Controller initialization
Not converting every string of digits to ints anymore
Add tests
No need for special cases when converting stdout_stream options
also accept umask as an argument for consistency
Allow custom options for stdout_stream and stderr_stream.
Add new socket config option to bind to a specific interface by name
Add time_format for FileStream + tests
Update circus.upstart

0.9.2 - 2013-07-17

When a PYTHONPATH is defined in a config file, it's loaded in sys.path so hooks can be located there - #477, #481
Use a single argument for add_callback so it works with PyZMQ < 13.1.x - see #478

0.9 - 2013-07-16

added [env] sections wildcards
added global [env] secrtion
fixed hidden exception when circus-web is not installed - #424
make sure incr/decr commands really us the nb option - #421
Fix watcher virtualenv site-packages not in PYTHONPATH
make sure we dont try to remove more processes than 0 - #429
updated bootstrap.py - #436
fixed multiplatform separator in pythonpath virtualenv watcher
refactored socket close function
Ensure env sections are applied to all watchers - #437
added the reloadconfig command
added circus.green and removed gevent from the core - #441, #452
silenced spurious stdout & warnings in the tests - #438
$(circus.env.*) can be used for all options in the config now
added a before_spawn hook
correct the path of circusd in systemd service file - #450
make sure we can change hooks and set streams via CLI - #455
improved doc
added a spawn_count stat in watcher
added min_cpu and min_mem parameters in ResourceWatcher plugin
added the FQDN information to the arbiter.

0.8.1 - 2013-05-28

circusd-stats was choking on unix sockets - #415
circusd-stats & circushttpd child processes stdout/stderr are now left open by default. Python <= 2.7.5 would choke in the logging module in case the 2/3 fds were closed - #415
Now redirecting to /dev/null in the child process instead of closing. #417

0.8 - 2013-05-24

Integrated log handlers into zmq io loop.
Make redirector restartable and subsequently more robust.
Uses zmq.green.eventloop when gevent is detected
Added support for CIRCUSCTL_ENDPOINT environment variable to circusctl - #396
util: fix bug in to_uid function - #397
Remove handler on ioloop error - #398.
Improved test coverage
Deprecated the 'service' option for the ResourceWatcher plugin - #404
removed psutil.error usage
Added UDP discovery in circusd - #407
Now allowing globs at arbitrary directory levels - #388
Added the 'statd' configuration option - #408
Add pidfile, logoutput and loglevel option to circus configuration file - #379
Added a tutorial in the docs.
make sure we're merging all sections when using include - #414
added pipe_stdout, pipe_stderr, close_child_stderr & close_child_stdout options to the Process class
added close_child_stderr & close_child_stdout options to the watcher

0.7.1 - 2013-05-02

Fixed the respawn option - #382
Make sure we use an int for the timeout - #380
display the unix sockets as well - #381
Make sure it works with the latest pyzmq
introduced a second syntax for the fd notation

0.7 - 2013-04-08

Fix get_arbiter example to use a dict for the watchers argument. #304
Add some troubleshooting documentation #323
Add python buildout support
Removed the gevent and the thread redirectors. now using the ioloop - fixes #346. Relates #340
circus.web is now its own project
removed the pyzmq patching
Allow the watcher to be configured but not started #283
Add an option to load a virtualenv site dir
added on_demand watchers
added doc about nginx+websockets #371
now properly parsing the options list of each command #369
Fixed circusd-stats events handling #372
fixed the overflow issue in circus-top #378
many more things...

0.6 - 2012-12-18

Patching protocols name for sockets - #248
Don't autoscale graphs. #240
circusctl: add per command help, from docstrings #217
Added workers hooks
Added Debian package - #227
Added Redis, HTTP Observer, Full stats & Resource plugins
Now processes can have titles
Added autocompletion
Added process/watcher age in the webui
Added SSH tunnel support
Now using pyzmq.green
Added upstart script & Varnish doc
Added environment variables & sections
Added unix sockets support
Added the respawn option to have single-run watchers
Now using tox in the tests
Allow socket substitution in args
New doc theme
New rotation options for streams: max_bytes/backup_count

0.5.2 - 2012-07-26

now patching the thread module from the stdlib to avoid some Python bugs - #203
better looking circusctl help screen
uses pustil get_nice() when available (nice was deprecated) - #208
added max_age support - #221
only call listen() on SOCK_STREAM or SOCK_SEQPACKET sockets
make sure the controller empties the plugins list in update_watchers() - #220
added --log-level and --log-output to circushttpd
fix the process killing via the web UI - #219
now circus is zc.buildout compatible for scripts.
cleanup the websocket when the client disconnect - #225
fixed the default value for the endpoint - #199
splitted circushttpd in logical modules

0.5.1 - 2012-07-11

Fixed a bunch of typos in the documentation
Added the debug option
Package web-requirements.txt properly
Added a errno error code in the messages - fixes #111

0.5 - 2012-07-06

added socket support
added a listsocket command
sockets have stats too !
fixed a lot of small bugs
removed the wid - now using pid everywhere
faster tests
changed the variables syntax
use pyzmq's ioloop in more places
now using iowait for all select() calls
incr/decr commands now have an nbprocess parameter
Add a reproduce_env option to watchers
Add a new UNEXISTING status to the processes
Added the global httpd option to run circushttpd as a watcher

0.4 - 2012-06-12

Added a plugin system
Added a "singleton" option for watchers
Fixed circus-top screen flickering
Removed threads from circus.stats in favor of zmq periodic callbacks
Enhanced the documentation
Circus client now have a send_message api
The flapping feature is now a plugin
Every command line tool have a --version option
Added a statsd plugin (sends the events from circus to statsd)
The web UI now uses websockets (via socketio) to get the stats
The web UI now uses sessions for "flash messages" in the web ui

0.3.4 - 2012-05-30

Fixed a race condition that prevented the controller to cleanly reap finished processes.
Now check_flapping can be controlled in the configuration. And activated/deactivated per watcher.

0.3.3 - 2012-05-29

•: Fixed the regression on the uid handling

0.3.2 - 2012-05-24

allows optional args property to add_watcher command.
added circushttpd, circus-top and circusd-stats
allowing Arbiter.add_watcher() to set all Watcher option
make sure the redirectors are re-created on restarts

0.3.1 - 2012-04-18

fix: make sure watcher' defaults aren't overrided
added a StdoutStream class.

0.3 - 2012-04-18

added the streaming feature
now displaying coverage in the Sphinx doc
fixed the way the processes are killed (no more SIGQUIT)
the configuration has been factored out
setproctitle support

0.2 - 2012-04-04

Removed the show name. replaced by watcher.
Added support for setting process rlimit.
Added support for include dirs in the config file.
Fixed a couple of leaking file descriptors.
Fixed a core dump in the flapping
Doc improvments
Make sure circusd errors properly when another circusd is running on the same socket.
get_arbiter now accepts several watchers.
Fixed the cmd vs args vs executable in the process init.
Fixed --start on circusctl add

0.1 - 2012-03-20

•: initial release

man pages

circusd man page

Synopsis

circusd [options] [config]

Description

circusd is the main process of the Circus architecture. It takes care of running all the processes. Each process managed by Circus is a child process of circusd.

Arguments

config: configuration file

Options

-h, --help: Show the help message and exit
--log-level LEVEL: Specify the log level. LEVEL can be info, debug, critical, warning or error.
--log-output LOGOUTPUT: The location where the logs will be written. The default behavior is to write to stdout (you can force it by passing '-' to this option). Takes a filename otherwise.
--logger-config LOGGERCONFIG: The location where a standard Python logger configuration INI, JSON or YAML file can be found. This can be used to override the default logging configuration for the arbiter.
--daemon: Start circusd in the background.
--pidfile PIDFILE: The location of the PID file.
--version: Displays Circus version and exits.

add: Add a watcher
decr: Decrement the number of processes in a watcher
dstats: Get circusd stats
get: Get the value of specific watcher options
globaloptions: Get the arbiter options
incr: Increment the number of processes in a watcher
ipython: Create shell into circusd process
list: Get list of watchers or processes in a watcher
listen: Subscribe to a watcher event
listsockets: Get the list of sockets
numprocesses: Get the number of processes
numwatchers: Get the number of watchers
options: Get the value of all options for a watcher
quit: Quit the arbiter immediately
reload: Reload the arbiter or a watcher
reloadconfig: Reload the configuration file
restart: Restart the arbiter or a watcher
rm: Remove a watcher
set: Set a watcher option
signal: Send a signal
start: Start the arbiter or a watcher
stats: Get process infos
status: Get the status of a watcher or all watchers
stop: Stop watchers

Options

--endpoint ENDPOINT: connection endpoint
-h, --help: Show the help message and exit
--json: output to JSON
--prettify: prettify output
--ssh SSH: SSH Server in the format user@host:port
--ssh_keyfile SSH_KEYFILE: path to the keyfile to authorise the user
--timeout TIMEOUT: connection timeout
--version: Displays Circus version and exits.

plugin: Fully qualified name of the plugin class.

Options

--endpoint ENDPOINT: Connection endpoint.
--pubsub PUBSUB: The circusd ZeroMQ pub/sub socket to connect to.
--config CONFIG: The plugin configuration file.
--check-delay CHECK_DELAY: Check delay.
--log-level LEVEL: Specify the log level. LEVEL can be info, debug, critical, warning or error.
--log-output LOGOUTPUT: The location where the logs will be written. The default behavior is to write to stdout (you can force it by passing '-' to this option). Takes a filename otherwise.
--ssh SSH: SSH Server in the format user@host:port.
-h, --help: Show the help message and exit.
--version: Displays Circus version and exits.

--endpoint ENDPOINT: Connection endpoint.
--ssh SSH: SSH Server in the format user@host:port.
--process-timeout PROCESS_TIMEOUT: After this delay of inactivity, a process will be removed.
-h, --help: Show the help message and exit.
--version: Displays Circus version and exits.

--endpoint ENDPOINT: Connection endpoint.
--pubsub PUBSUB: The circusd ZeroMQ pub/sub socket to connect to.
--statspoint STATSPOINT: The ZeroMQ pub/sub socket to send data to.
--log-level LEVEL: Specify the log level. LEVEL can be info, debug, critical, warning or error.
--log-output LOGOUTPUT: The location where the logs will be written. The default behavior is to write to stdout (you can force it by passing '-' to this option). Takes a filename otherwise.
--ssh SSH: SSH Server in the format user@host:port.
-h, --help: Show the help message and exit.
--version: Displays Circus version and exits.

Glossary: Circus-specific terms

arbiter: The arbiter is responsible for managing all the watchers within circus, ensuring all processes run correctly.
controller: A controller contains the set of actions that can be performed on the arbiter.
flapping: The flapping detection subscribes to events and detects when some processes are constantly restarting.
pub/sub: Circus has a pubsub that receives events from the watchers and dispatches them to all subscribers.
remote controller: The remote controller allows you to communicate with the controller via ZMQ to control Circus.
watcher, watchers: A watcher is the program you tell Circus to run. A single Circus instance can run one or more watchers.
worker, workers, process, processes: A process is an independent OS process instance of your program. A single watcher can run one or more processes. We also call them workers.

Copyright

Circus was initiated by Tarek Ziade and is licenced under APLv2

Benoit Chesneau was an early contributor and did many things, like most of the circus.commands work.

Licence

Copyright 2012 - Mozilla Foundation
Copyright 2012 - Benoit Chesneau
Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at


    http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

Contributors

See the full list at https://github.com/circus-tent/circus/blob/master/CONTRIBUTORS.txt

AUTHOR

Mozilla Foundation, Benoit Chesneau

November 30, 2015

0.12.1