Command-line

Configuration flags

All the command-line tools support a set of common configuration flags, defined in config.py. Use --help with any of them to see the full list.

The following general flags can be passed as command-line arguments to either mrq-worker,mrq-run, or mrq-dashboard:

--trace_greenlets: Collect stats about each greenlet execution time and switches. Defaults to false.
--trace_memory: Collect stats about memory for each task. Incompatible with --greenlets > 1. Defaults to false.
--trace_io: Collect stats about all I/O operations. Defaults to true.
--print_mongodb: Print all MongoDB requests. Defaults to false.
--trace_memory_type: Create a .png object graph in trace_memory_output_dir with a random object of this type.
--trace_memory_output_dir: Directory where to output .pngs with object graphs. Defaults to folder memory_traces.
--profile: Run profiling on the whole worker. Defaults to false.
--mongodb_jobs, --mongodb: MongoDB URI for the jobs, scheduled_jobs & workers database. Defaults to mongodb://127.0.0.1:27017/mrq.
--mongodb_logs :MongoDB URI for the logs database."0" will disable remote logs, "1" will use main MongoDB. Defaults to 1
--mongodb_logs_size: If provided, sets the log collection to capped to that amount of bytes.
--no_mongodb_ensure_indexes: If provided, skip the creation of MongoDB indexes at worker startup.
--redis: Redis URI. Defaults to redis://127.0.0.1:6379.
--redis_prefix: Redis key prefix. Defaults to "mrq".
--redis_max_connections: Redis max connection pool size. Defaults to 1000.
--redis_timeout: Redis connection pool timeout to wait for an available connection. Defaults to 30.
--name: Specify a different name.
--quiet: Don't output task logs. Defaults to false.
--config, -c: Path of a config file.
--worker_class: Path to a custom worker class. Defaults to "mrq.worker.Worker".
--version, -v: Prints current MRQ version. Defaults to false.
--no_import_patch: Skips patching import to fix gevent bug #108. Defaults to false.
--add_network_latency: Adds random latency to the network calls, zero to N seconds. Can be a range (1-2)'). Defaults to 0.
--default_job_result_ttl: Seconds the results are kept in MongoDB when status is success. Defaults to 604800 seconds which is 7 days.
--default_job_abort_ttl: Seconds the tasks are kept in MongoDB when status is abort. Defaults to 86400 seconds which is 1 day.
--default_job_cancel_ttl: Seconds the tasks are kept in MongoDB when status is cancel. Defaults to 86400 seconds which is 1 day.
--default_job_timeout: In seconds, delay before interrupting the job. Defaults to 3600 seconds which is 1 hour.
--default_job_max_retries: Set the status to "maxretries" after retrying that many times. Defaults to 3 seconds.
--default_job_retry_delay: Seconds before a job in retry status is requeued again. Defaults to 3 seconds.
--use_large_job_ids: Do not use compacted job IDs in Redis. For compatibility with 0.1.x only. Defaults to false.

mrq-worker

mrq-worker starts a new worker and takes one argument list:

queues: The queues to listen on.Defaults to default , which will listen on all queues.

You can pass additional configuration flags:

--max_jobs: Gevent:max number of jobs to do before quitting. Use as a workaround for memory leaks in your tasks. Defaults to 0
--max_memory: Max memory (in Mb) after which the process will be shut down. Use with --processes [1-N] to have supervisord automatically respawn the worker when this happens. Defaults to 1
--grenlets, --gevent, --g: Max number of greenlets to use. Defaults to 1.
--processes, --p: Number of processes to launch with supervisord. Defaults to 0 (no supervisord).
--supervisord_template: Path of supervisord template to use. Defaults to supervisord_templates/default.conf.
--scheduler: Run the scheduler. Defaults to false.
--scheduler_interval: Seconds between scheduler checks. Defaults to 60 seconds, only ints are acceptable.
--report_interval: Seconds between worker reports to MongoDB. Defaults to 10 seconds, floats are acceptable too.
--report_file: Filepath of a json dump of the worker status. Disabled if none.
--subqueues_refresh_interval: Seconds between worker refreshes of the known subqueues.
--subqueues_delimiter: Delimiter between main queue and subqueue names.
--paused_queues_refresh_interval: Seconds between worker refreshes of the paused queues list.
--admin_port: Start an admin server on this port, if provided. Incompatible with --processes. Defaults to 0
--admin_ip: IP for the admin server to listen on. Use "0.0.0.0" to allow access from outside. Defaults to 127.0.0.1.
--local_ip: Overwrite the local IP, to be displayed in the dashboard.
--max_latency: Max seconds while worker may sleep waiting for a new job. Can be < 1 and a float value.
--dequeue_strategy: Strategy for dequeuing multiple queues. Default is sequential, to dequeue them in command-line order.

Worker concurrency

The default is to run tasks one at a time. You should obviously change this behaviour to use Gevent's full capabilities with something like:

mrq-worker --processes 3 --greenlets 10 queue-highpriority queue-default

This will start 30 greenlets over 3 UNIX processes. Each of them will run 10 jobs at the same time.

As soon as you use the --processes option (even with --processes=1) then supervisord will be used to control the processes. It is quite useful to manage long-running instances.

Simulating network latency

Sometimes it is helpful in local development to simulate an environment with higher network latency.

To do this we added a --add_network_latency=0.1 config option that will add (in this case) a random delay between 0 and 0.1 seconds to every network call.

mrq-dashboard

mrq-dashboard starts the web dashboard on the default port and takes these arguments:

--dashboard_httpauth: HTTP Auth for the Dashboard. Format is user:pass.
--dashboard_queue: Default queue for dashboard actions.
--dashboard_port: Use this port for mrq-dashboard. Defaults to port 5555.
--dashboard_ip: Bind the dashboard to this IP. Default is 0.0.0.0, use 127.0.0.1 to restrict access.

mrq-run

mrq-run runs a one-off task. If you add the --queue option that will enqueue it to be later ran by a worker.

taskpath: Task to run.
taskargs: JSON-encoded arguments, or "key value" pairs.
--queue: Queue the task on this queue instead of running it right away.

Typical usage is:

$ mrq-run tasks.mylib.myfile.MyTask '{"param1": 1, "param2": True}'

# Shorter syntax which casts all values as strings (equivalent to '{"param1": "1", "param2": "ok"}')
$ mrq-run tasks.mylib.myfile.MyTask param1 1 param2 ok