This document describes the current stable version of Celery (4.0). For development docs, go here.

Configuration and defaults¶

This document describes the configuration options available.

If you’re using the default loader, you must create the celeryconfig.py module and make sure it’s available on the Python path.

Example configuration file ¶

This is an example configuration file to get you started. It should contain all you need to run a basic Celery set-up.

## Broker settings.
broker_url = 'amqp://guest:guest@localhost:5672//'

# List of modules to import when the Celery worker starts.
imports = ('myapp.tasks',)

## Using the database to store task state and results.
result_backend = 'db+sqlite:///results.db'

task_annotations = {'tasks.add': {'rate_limit': '10/s'}}

New lowercase settings ¶

Version 4.0 introduced new lower case settings and setting organization.

The major difference between previous versions, apart from the lower case names, are the renaming of some prefixes, like celerybeat_ to beat_, celeryd_ to worker_, and most of the top level celery_ settings have been moved into a new task_ prefix.

Celery will still be able to read old configuration files, so there’s no rush in moving to the new settings format.

Setting name	Replace with
`CELERY_ACCEPT_CONTENT`	`accept_content`
`CELERY_ENABLE_UTC`	`enable_utc`
`CELERY_IMPORTS`	`imports`
`CELERY_INCLUDE`	`include`
`CELERY_TIMEZONE`	`timezone`
`CELERYBEAT_MAX_LOOP_INTERVAL`	`beat_max_loop_interval`
`CELERYBEAT_SCHEDULE`	`beat_schedule`
`CELERYBEAT_SCHEDULER`	`beat_scheduler`
`CELERYBEAT_SCHEDULE_FILENAME`	`beat_schedule_filename`
`CELERYBEAT_SYNC_EVERY`	`beat_sync_every`
`BROKER_URL`	`broker_url`
`BROKER_TRANSPORT`	`broker_transport`
`BROKER_TRANSPORT_OPTIONS`	`broker_transport_options`
`BROKER_CONNECTION_TIMEOUT`	`broker_connection_timeout`
`BROKER_CONNECTION_RETRY`	`broker_connection_retry`
`BROKER_CONNECTION_MAX_RETRIES`	`broker_connection_max_retries`
`BROKER_FAILOVER_STRATEGY`	`broker_failover_strategy`
`BROKER_HEARTBEAT`	`broker_heartbeat`
`BROKER_LOGIN_METHOD`	`broker_login_method`
`BROKER_POOL_LIMIT`	`broker_pool_limit`
`BROKER_USE_SSL`	`broker_use_ssl`
`CELERY_CACHE_BACKEND`	`cache_backend`
`CELERY_CACHE_BACKEND_OPTIONS`	`cache_backend_options`
`CASSANDRA_COLUMN_FAMILY`	`cassandra_table`
`CASSANDRA_ENTRY_TTL`	`cassandra_entry_ttl`
`CASSANDRA_KEYSPACE`	`cassandra_keyspace`
`CASSANDRA_PORT`	`cassandra_port`
`CASSANDRA_READ_CONSISTENCY`	`cassandra_read_consistency`
`CASSANDRA_SERVERS`	`cassandra_servers`
`CASSANDRA_WRITE_CONSISTENCY`	`cassandra_write_consistency`
`CELERY_COUCHBASE_BACKEND_SETTINGS`	`couchbase_backend_settings`
`CELERY_MONGODB_BACKEND_SETTINGS`	`mongodb_backend_settings`
`CELERY_EVENT_QUEUE_EXPIRES`	`event_queue_expires`
`CELERY_EVENT_QUEUE_TTL`	`event_queue_ttl`
`CELERY_EVENT_QUEUE_PREFIX`	`event_queue_prefix`
`CELERY_EVENT_SERIALIZER`	`event_serializer`
`CELERY_REDIS_DB`	`redis_db`
`CELERY_REDIS_HOST`	`redis_host`
`CELERY_REDIS_MAX_CONNECTIONS`	`redis_max_connections`
`CELERY_REDIS_PASSWORD`	`redis_password`
`CELERY_REDIS_PORT`	`redis_port`
`CELERY_RESULT_BACKEND`	`result_backend`
`CELERY_MAX_CACHED_RESULTS`	`result_cache_max`
`CELERY_MESSAGE_COMPRESSION`	`result_compression`
`CELERY_RESULT_EXCHANGE`	`result_exchange`
`CELERY_RESULT_EXCHANGE_TYPE`	`result_exchange_type`
`CELERY_TASK_RESULT_EXPIRES`	`result_expires`
`CELERY_RESULT_PERSISTENT`	`result_persistent`
`CELERY_RESULT_SERIALIZER`	`result_serializer`
`CELERY_RESULT_DBURI`	Use `result_backend` instead.
`CELERY_RESULT_ENGINE_OPTIONS`	`database_engine_options`
`[...]_DB_SHORT_LIVED_SESSIONS`	`database_short_lived_sessions`
`CELERY_RESULT_DB_TABLE_NAMES`	`database_db_names`
`CELERY_SECURITY_CERTIFICATE`	`security_certificate`
`CELERY_SECURITY_CERT_STORE`	`security_cert_store`
`CELERY_SECURITY_KEY`	`security_key`
`CELERY_ACKS_LATE`	`task_acks_late`
`CELERY_TASK_ALWAYS_EAGER`	`task_always_eager`
`CELERY_ANNOTATIONS`	`task_annotations`
`CELERY_MESSAGE_COMPRESSION`	`task_compression`
`CELERY_CREATE_MISSING_QUEUES`	`task_create_missing_queues`
`CELERY_DEFAULT_DELIVERY_MODE`	`task_default_delivery_mode`
`CELERY_DEFAULT_EXCHANGE`	`task_default_exchange`
`CELERY_DEFAULT_EXCHANGE_TYPE`	`task_default_exchange_type`
`CELERY_DEFAULT_QUEUE`	`task_default_queue`
`CELERY_DEFAULT_RATE_LIMIT`	`task_default_rate_limit`
`CELERY_DEFAULT_ROUTING_KEY`	`task_default_routing_key`
`CELERY_TASK_EAGER_PROPAGATES`	`task_eager_propagates`
`CELERY_IGNORE_RESULT`	`task_ignore_result`
`CELERY_TASK_PUBLISH_RETRY`	`task_publish_retry`
`CELERY_TASK_PUBLISH_RETRY_POLICY`	`task_publish_retry_policy`
`CELERY_QUEUES`	`task_queues`
`CELERY_ROUTES`	`task_routes`
`CELERY_SEND_TASK_SENT_EVENT`	`task_send_sent_event`
`CELERY_TASK_SERIALIZER`	`task_serializer`
`CELERYD_TASK_SOFT_TIME_LIMIT`	`task_soft_time_limit`
`CELERYD_TASK_TIME_LIMIT`	`task_time_limit`
`CELERY_TRACK_STARTED`	`task_track_started`
`CELERYD_AGENT`	`worker_agent`
`CELERYD_AUTOSCALER`	`worker_autoscaler`
`CELERYD_CONCURRENCY`	`worker_concurrency`
`CELERYD_CONSUMER`	`worker_consumer`
`CELERY_WORKER_DIRECT`	`worker_direct`
`CELERY_DISABLE_RATE_LIMITS`	`worker_disable_rate_limits`
`CELERY_ENABLE_REMOTE_CONTROL`	`worker_enable_remote_control`
`CELERYD_HIJACK_ROOT_LOGGER`	`worker_hijack_root_logger`
`CELERYD_LOG_COLOR`	`worker_log_color`
`CELERYD_LOG_FORMAT`	`worker_log_format`
`CELERYD_WORKER_LOST_WAIT`	`worker_lost_wait`
`CELERYD_MAX_TASKS_PER_CHILD`	`worker_max_tasks_per_child`
`CELERYD_POOL`	`worker_pool`
`CELERYD_POOL_PUTLOCKS`	`worker_pool_putlocks`
`CELERYD_POOL_RESTARTS`	`worker_pool_restarts`
`CELERYD_PREFETCH_MULTIPLIER`	`worker_prefetch_multiplier`
`CELERYD_REDIRECT_STDOUTS`	`worker_redirect_stdouts`
`CELERYD_REDIRECT_STDOUTS_LEVEL`	`worker_redirect_stdouts_level`
`CELERYD_SEND_EVENTS`	`worker_send_task_events`
`CELERYD_STATE_DB`	`worker_state_db`
`CELERYD_TASK_LOG_FORMAT`	`worker_task_log_format`
`CELERYD_TIMER`	`worker_timer`
`CELERYD_TIMER_PRECISION`	`worker_timer_precision`

Configuration Directives ¶

General settings ¶

`accept_content`¶

Default: {'json'} (set, list, or tuple).

A white-list of content-types/serializers to allow.

If a message is received that’s not in this list then the message will be discarded with an error.

By default any content type is enabled, including pickle and yaml, so make sure untrusted parties don’t have access to your broker. See Security for more.

Example:

# using serializer name
accept_content = ['json']

# or the actual content-type (MIME)
accept_content = ['application/json']

Time and date settings ¶

`enable_utc`¶

New in version 2.5.

Default: Enabled by default since version 3.0.

If enabled dates and times in messages will be converted to use the UTC timezone.

Note that workers running Celery versions below 2.5 will assume a local timezone for all messages, so only enable if all workers have been upgraded.

`timezone`¶

New in version 2.5.

Default: "UTC".

Configure Celery to use a custom time zone. The timezone value can be any time zone supported by the pytz library.

If not set the UTC timezone is used. For backwards compatibility there’s also a enable_utc setting, and this is set to false the system local timezone is used instead.

Task settings ¶

`task_annotations`¶

New in version 2.5.

Default: None.

This setting can be used to rewrite any task attribute from the configuration. The setting can be a dict, or a list of annotation objects that filter for tasks and return a map of attributes to change.

This will change the rate_limit attribute for the tasks.add task:

task_annotations = {'tasks.add': {'rate_limit': '10/s'}}

or change the same for all tasks:

task_annotations = {'*': {'rate_limit': '10/s'}}

You can change methods too, for example the on_failure handler:

def my_on_failure(self, exc, task_id, args, kwargs, einfo):
    print('Oh no! Task failed: {0!r}'.format(exc))

task_annotations = {'*': {'on_failure': my_on_failure}}

If you need more flexibility then you can use objects instead of a dict to choose the tasks to annotate:

class MyAnnotate(object):

    def annotate(self, task):
        if task.name.startswith('tasks.'):
            return {'rate_limit': '10/s'}

task_annotations = (MyAnnotate(), {other,})

`task_compression`¶

Default: None

Default compression used for task messages. Can be gzip, bzip2 (if available), or any custom compression schemes registered in the Kombu compression registry.

The default is to send uncompressed messages.

`task_protocol`¶

Default: 2 (since 4.0).

Set the default task message protocol version used to send tasks. Supports protocols: 1 and 2.

Protocol 2 is supported by 3.1.24 and 4.x+.

`task_serializer`¶

Default: "json" (since 4.0, earlier: pickle).

A string identifying the default serialization method to use. Can be json (default), pickle, yaml, msgpack, or any custom serialization methods that have been registered with kombu.serialization.registry.

See also

Serializers.

`task_publish_retry`¶

New in version 2.2.

Default: Enabled.

Decides if publishing task messages will be retried in the case of connection loss or other connection errors. See also task_publish_retry_policy.

`task_publish_retry_policy`¶

New in version 2.2.

Default: See Message Sending Retry.

Defines the default policy when retrying publishing a task message in the case of connection loss or other connection errors.

Task execution settings ¶

`task_always_eager`¶

Default: Disabled.

If this is True, all tasks will be executed locally by blocking until the task returns. apply_async() and Task.delay() will return an EagerResult instance, that emulates the API and behavior of AsyncResult, except the result is already evaluated.

That is, tasks will be executed locally instead of being sent to the queue.

`task_eager_propagates`¶

Default: Disabled.

If this is True, eagerly executed tasks (applied by task.apply(), or when the task_always_eager setting is enabled), will propagate exceptions.

It’s the same as always running apply() with throw=True.

`task_remote_tracebacks`¶

Default: Disabled.

If enabled task results will include the workers stack when re-raising task errors.

This requires the tblib library, that can be installed using pip:

$ pip install celery[tblib]

See Bundles for information on combining multiple extension requirements.

`task_ignore_result`¶

Default: Disabled.

Whether to store the task return values or not (tombstones). If you still want to store errors, just not successful return values, you can set task_store_errors_even_if_ignored.

`task_store_errors_even_if_ignored`¶

Default: Disabled.

If set, the worker stores all task errors in the result store even if Task.ignore_result is on.

`task_track_started`¶

Default: Disabled.

If True the task will report its status as ‘started’ when the task is executed by a worker. The default value is False as the normal behavior is to not report that level of granularity. Tasks are either pending, finished, or waiting to be retried. Having a ‘started’ state can be useful for when there are long running tasks and there’s a need to report what task is currently running.

`task_time_limit`¶

Default: No time limit.

Task hard time limit in seconds. The worker processing the task will be killed and replaced with a new one when this is exceeded.

`task_soft_time_limit`¶

Default: No soft time limit.

Task soft time limit in seconds.

The SoftTimeLimitExceeded exception will be raised when this is exceeded. For example, the task can catch this to clean up before the hard time limit comes:

from celery.exceptions import SoftTimeLimitExceeded

@app.task
def mytask():
    try:
        return do_work()
    except SoftTimeLimitExceeded:
        cleanup_in_a_hurry()

`task_acks_late`¶

Default: Disabled.

Late ack means the task messages will be acknowledged after the task has been executed, not just before (the default behavior).

See also

FAQ: Should I use retry or acks_late?.

`task_reject_on_worker_lost`¶

Default: Disabled.

Even if task_acks_late is enabled, the worker will acknowledge tasks when the worker process executing them abruptly exits or is signaled (e.g., KILL/INT, etc).

Setting this to true allows the message to be re-queued instead, so that the task will execute again by the same worker, or another worker.

Warning

Enabling this can cause message loops; make sure you know what you’re doing.

`task_default_rate_limit`¶

Default: No rate limit.

The global default rate limit for tasks.

This value is used for tasks that doesn’t have a custom rate limit

See also

The setting:worker_disable_rate_limits setting can disable all rate limits.

Task result backend settings ¶

`result_backend`¶

Default: No result backend enabled by default.

The backend used to store task results (tombstones). Can be one of the following:

rpc

Send results back as AMQP messages See RPC backend settings.
database

Use a relational database supported by SQLAlchemy. See Database backend settings.
redis

Use Redis to store the results. See Redis backend settings.
cache

Use Memcached to store the results. See Cache backend settings.
cassandra

Use Cassandra to store the results. See Cassandra backend settings.
elasticsearch

Use Elasticsearch to store the results. See Elasticsearch backend settings.
ironcache

Use IronCache to store the results. See IronCache backend settings.
couchbase

Use Couchbase to store the results. See Couchbase backend settings.
couchdb

Use CouchDB to store the results. See CouchDB backend settings.
filesystem

Use a shared directory to store the results. See File-system backend settings.
consul

Use the Consul K/V store to store the results See Consul K/V store backend settings.

`result_serializer`¶

Default: json since 4.0 (earlier: pickle).

Result serialization format.

See Serializers for information about supported serialization formats.

`result_compression`¶

Default: No compression.

Optional compression method used for task results. Supports the same options as the task_serializer setting.

`result_expires`¶

Default: Expire after 1 day.

Time (in seconds, or a timedelta object) for when after stored task tombstones will be deleted.

A built-in periodic task will delete the results after this time (celery.backend_cleanup), assuming that celery beat is enabled. The task runs daily at 4am.

A value of None or 0 means results will never expire (depending on backend specifications).

Note

For the moment this only works with the AMQP, database, cache, and Redis backends.

When using the database backend, celery beat must be running for the results to be expired.

`result_cache_max`¶

Default: Disabled by default.

Enables client caching of results.

This can be useful for the old deprecated ‘amqp’ backend where the result is unavailable as soon as one result instance consumes it.

This is the total number of results to cache before older results are evicted. A value of 0 or None means no limit, and a value of -1 will disable the cache.

Disabled by default.

Database backend settings ¶

Database URL Examples¶

To use the database backend you have to configure the result_backend setting with a connection URL and the db+ prefix:

result_backend = 'db+scheme://user:password@host:port/dbname'

Examples:

# sqlite (filename)
result_backend = 'db+sqlite:///results.sqlite'

# mysql
result_backend = 'db+mysql://scott:tiger@localhost/foo'

# postgresql
result_backend = 'db+postgresql://scott:tiger@localhost/mydatabase'

# oracle
result_backend = 'db+oracle://scott:tiger@127.0.0.1:1521/sidname'

Please see Supported Databases for a table of supported databases, and Connection String for more information about connection strings (this is the part of the URI that comes after the db+ prefix).

`database_engine_options`¶

Default: {} (empty mapping).

To specify additional SQLAlchemy database engine options you can use the sqlalchmey_engine_options setting:

# echo enables verbose logging from SQLAlchemy.
app.conf.database_engine_options = {'echo': True}

`database_short_lived_sessions`¶

Default: Disabled by default.

Short lived sessions are disabled by default. If enabled they can drastically reduce performance, especially on systems processing lots of tasks. This option is useful on low-traffic workers that experience errors as a result of cached database connections going stale through inactivity. For example, intermittent errors like (OperationalError) (2006, ‘MySQL server has gone away’) can be fixed by enabling short lived sessions. This option only affects the database backend.

`database_table_names`¶

Default: {} (empty mapping).

When SQLAlchemy is configured as the result backend, Celery automatically creates two tables to store result meta-data for tasks. This setting allows you to customize the table names:

# use custom table names for the database result backend.
database_table_names = {
    'task': 'myapp_taskmeta',
    'group': 'myapp_groupmeta',
}

RPC backend settings ¶

`result_persistent`¶

Default: Disabled by default (transient messages).

If set to True, result messages will be persistent. This means the messages won’t be lost after a broker restart.

Example configuration¶

result_backend = 'rpc://'
result_persistent = False

Cache backend settings ¶

Note

The cache backend supports the pylibmc and python-memcached libraries. The latter is used only if pylibmc isn’t installed.

Using a single Memcached server:

result_backend = 'cache+memcached://127.0.0.1:11211/'

Using multiple Memcached servers:

result_backend = """
    cache+memcached://172.19.26.240:11211;172.19.26.242:11211/
""".strip()

The “memory” backend stores the cache in memory only:

result_backend = 'cache'
cache_backend = 'memory'

`cache_backend_options`¶

Default: {} (empty mapping).

You can set pylibmc options using the cache_backend_options setting:

cache_backend_options = {
    'binary': True,
    'behaviors': {'tcp_nodelay': True},
}

`cache_backend`¶

This setting is no longer used as it’s now possible to specify the cache backend directly in the result_backend setting.

Redis backend settings ¶

Configuring the backend URL¶

Note

The Redis backend requires the redis library.

To install this package use pip:

$ pip install celery[redis]

See Bundles for information on combining multiple extension requirements.

This backend requires the result_backend setting to be set to a Redis URL:

result_backend = 'redis://:password@host:port/db'

For example:

result_backend = 'redis://localhost/0'

is the same as:

result_backend = 'redis://'

The fields of the URL are defined as follows:

password

Password used to connect to the database.
host

Host name or IP address of the Redis server (e.g., localhost).
port

Port to the Redis server. Default is 6379.
db

Database number to use. Default is 0. The db can include an optional leading slash.

`redis_max_connections`¶

Default: No limit.

Maximum number of connections available in the Redis connection pool used for sending and retrieving results.

`redis_socket_connect_timeout`¶

New in version 5.0.1.

Default: None

Socket timeout for connections to Redis from the result backend in seconds (int/float)

`redis_socket_timeout`¶

Default: 5.0 seconds.

Socket timeout for reading/writing operations to the Redis server in seconds (int/float), used by the redis result backend.

Cassandra backend settings ¶

Note

This Cassandra backend driver requires cassandra-driver.

To install, use pip:

$ pip install celery[cassandra]

See Bundles for information on combining multiple extension requirements.

This backend requires the following configuration directives to be set.

`cassandra_servers`¶

Default: [] (empty list).

List of host Cassandra servers. For example:

cassandra_servers = ['localhost']

`cassandra_port`¶

Default: 9042.

Port to contact the Cassandra servers on.

`cassandra_keyspace`¶

Default: None.

The key-space in which to store the results. For example:

cassandra_keyspace = 'tasks_keyspace'

`cassandra_table`¶

Default: None.

The table (column family) in which to store the results. For example:

cassandra_table = 'tasks'

`cassandra_read_consistency`¶

Default: None.

The read consistency used. Values can be ONE, TWO, THREE, QUORUM, ALL, LOCAL_QUORUM, EACH_QUORUM, LOCAL_ONE.

`cassandra_write_consistency`¶

Default: None.

The write consistency used. Values can be ONE, TWO, THREE, QUORUM, ALL, LOCAL_QUORUM, EACH_QUORUM, LOCAL_ONE.

`cassandra_entry_ttl`¶

Default: None.

Time-to-live for status entries. They will expire and be removed after that many seconds after adding. A value of None (default) means they will never expire.

`cassandra_auth_provider`¶

Default: None.

AuthProvider class within cassandra.auth module to use. Values can be PlainTextAuthProvider or SaslAuthProvider.

`cassandra_auth_kwargs`¶

Default: {} (empty mapping).

Named arguments to pass into the authentication provider. For example:

cassandra_auth_kwargs = {
    username: 'cassandra',
    password: 'cassandra'
}

Example configuration¶

cassandra_servers = ['localhost']
cassandra_keyspace = 'celery'
cassandra_table = 'tasks'
cassandra_read_consistency = 'ONE'
cassandra_write_consistency = 'ONE'
cassandra_entry_ttl = 86400

Elasticsearch backend settings ¶

To use Elasticsearch as the result backend you simply need to configure the result_backend setting with the correct URL.

Example configuration¶

result_backend = 'elasticsearch://example.com:9200/index_name/doc_type'

Riak backend settings ¶

Note

The Riak backend requires the riak library.

To install the this package use pip:

$ pip install celery[riak]

See Bundles for information on combining multiple extension requirements.

This backend requires the result_backend setting to be set to a Riak URL:

result_backend = 'riak://host:port/bucket'

For example:

result_backend = 'riak://localhost/celery

is the same as:

result_backend = 'riak://'

The fields of the URL are defined as follows:

host

Host name or IP address of the Riak server (e.g., ‘localhost’).
port

Port to the Riak server using the protobuf protocol. Default is 8087.
bucket

Bucket name to use. Default is celery. The bucket needs to be a string with ASCII characters only.

Alternatively, this backend can be configured with the following configuration directives.

`riak_backend_settings`¶

Default: {} (empty mapping).

This is a dict supporting the following keys:

host

The host name of the Riak server. Defaults to "localhost".
port

The port the Riak server is listening to. Defaults to 8087.
bucket

The bucket name to connect to. Defaults to “celery”.
protocol

The protocol to use to connect to the Riak server. This isn’t configurable via result_backend

IronCache backend settings ¶

Note

The IronCache backend requires the iron_celery library:

To install this package use pip:

$ pip install iron_celery

IronCache is configured via the URL provided in result_backend, for example:

result_backend = 'ironcache://project_id:token@'

Or to change the cache name:

ironcache:://project_id:token@/awesomecache

For more information, see: https://github.com/iron-io/iron_celery

Couchbase backend settings ¶

Note

The Couchbase backend requires the couchbase library.

To install this package use pip:

$ pip install celery[couchbase]

See Bundles for instructions how to combine multiple extension requirements.

This backend can be configured via the result_backend set to a Couchbase URL:

result_backend = 'couchbase://username:password@host:port/bucket'

`couchbase_backend_settings`¶

Default: {} (empty mapping).

This is a dict supporting the following keys:

host

Host name of the Couchbase server. Defaults to localhost.
port

The port the Couchbase server is listening to. Defaults to 8091.
bucket

The default bucket the Couchbase server is writing to. Defaults to default.
username

User name to authenticate to the Couchbase server as (optional).
password

Password to authenticate to the Couchbase server (optional).

CouchDB backend settings ¶

Note

The CouchDB backend requires the pycouchdb library:

To install this Couchbase package use pip:

$ pip install celery[couchdb]

See Bundles for information on combining multiple extension requirements.

This backend can be configured via the result_backend set to a CouchDB URL:

result_backend = 'couchdb://username:password@host:port/container'

The URL is formed out of the following parts:

username

User name to authenticate to the CouchDB server as (optional).
password

Password to authenticate to the CouchDB server (optional).
host

Host name of the CouchDB server. Defaults to localhost.
port

The port the CouchDB server is listening to. Defaults to 8091.
container

The default container the CouchDB server is writing to. Defaults to default.

File-system backend settings ¶

This backend can be configured using a file URL, for example:

CELERY_RESULT_BACKEND = 'file:///var/celery/results'

The configured directory needs to be shared and writable by all servers using the backend.

If you’re trying Celery on a single system you can simply use the backend without any further configuration. For larger clusters you could use NFS, GlusterFS, CIFS, HDFS (using FUSE), or any other file-system.

Consul K/V store backend settings ¶

The Consul backend can be configured using a URL, for example:

CELERY_RESULT_BACKEND = ‘consul://localhost:8500/’

The backend will storage results in the K/V store of Consul as individual keys.

The backend supports auto expire of results using TTLs in Consul.

Message Routing ¶

`task_queues`¶

Default: None (queue taken from default queue settings).

Most users will not want to specify this setting and should rather use the automatic routing facilities.

If you really want to configure advanced routing, this setting should be a list of kombu.Queue objects the worker will consume from.

Note that workers can be overridden this setting via the -Q option, or individual queues from this list (by name) can be excluded using the -X option.

Also see Basics for more information.

The default is a queue/exchange/binding key of celery, with exchange type direct.

`task_routes`¶

Default: None.

A list of routers, or a single router used to route tasks to queues. When deciding the final destination of a task the routers are consulted in order.

A router can be specified as either:

A function with the signature (name, args, kwargs, options, task=None, **kwargs)
A string providing the path to a router function.
A dict containing router specification:

Will be converted to a celery.routes.MapRoute instance.
A list of (pattern, route) tuples:

Will be converted to a celery.routes.MapRoute instance.

Examples:

task_routes = {
    'celery.ping': 'default',
    'mytasks.add': 'cpu-bound',
    'feed.tasks.*': 'feeds',                           # <-- glob pattern
    re.compile(r'(image|video)\.tasks\..*'): 'media',  # <-- regex
    'video.encode': {
        'queue': 'video',
        'exchange': 'media'
        'routing_key': 'media.video.encode',
    },
}

task_routes = ('myapp.tasks.route_task', {'celery.ping': 'default})

Where myapp.tasks.route_task could be:

def route_task(self, name, args, kwargs, options, task=None, **kw):
        if task == 'celery.ping':
            return {'queue': 'default'}

route_task may return a string or a dict. A string then means it’s a queue name in task_queues, a dict means it’s a custom route.

When sending tasks, the routers are consulted in order. The first router that doesn’t return None is the route to use. The message options is then merged with the found route settings, where the routers settings have priority.

Example if apply_async() has these arguments:

Task.apply_async(immediate=False, exchange='video',
                 routing_key='video.compress')

and a router returns:

{'immediate': True, 'exchange': 'urgent'}

the final message options will be:

immediate=True, exchange='urgent', routing_key='video.compress'

(and any default message options defined in the Task class)

Values defined in task_routes have precedence over values defined in task_queues when merging the two.

With the follow settings:

task_queues = {
    'cpubound': {
        'exchange': 'cpubound',
        'routing_key': 'cpubound',
    },
}

task_routes = {
    'tasks.add': {
        'queue': 'cpubound',
        'routing_key': 'tasks.add',
        'serializer': 'json',
    },
}

The final routing options for tasks.add will become:

{'exchange': 'cpubound',
 'routing_key': 'tasks.add',
 'serializer': 'json'}

See Routers for more examples.

`task_queue_ha_policy`¶

brokers:	RabbitMQ

Default: None.

This will set the default HA policy for a queue, and the value can either be a string (usually all):

task_queue_ha_policy = 'all'

Using ‘all’ will replicate the queue to all current nodes, Or you can give it a list of nodes to replicate to:

task_queue_ha_policy = ['rabbit@host1', 'rabbit@host2']

Using a list will implicitly set x-ha-policy to ‘nodes’ and x-ha-policy-params to the given list of nodes.

See http://www.rabbitmq.com/ha.html for more information.

`task_queue_max_priority`¶

brokers:	RabbitMQ

Default: None.

See RabbitMQ Message Priorities.

`worker_direct`¶

Default: Disabled.

This option enables so that every worker has a dedicated queue, so that tasks can be routed to specific workers.

The queue name for each worker is automatically generated based on the worker hostname and a .dq suffix, using the C.dq exchange.

For example the queue name for the worker with node name w1@example.com becomes:

w1@example.com.dq

Then you can route the task to the task by specifying the hostname as the routing key and the C.dq exchange:

task_routes = {
    'tasks.add': {'exchange': 'C.dq', 'routing_key': 'w1@example.com'}
}

`task_create_missing_queues`¶

Default: Enabled.

If enabled (default), any queues specified that aren’t defined in task_queues will be automatically created. See Automatic routing.

`task_default_queue`¶

Default: "celery".

The name of the default queue used by .apply_async if the message has no route or no custom queue has been specified.

This queue must be listed in task_queues. If task_queues isn’t specified then it’s automatically created containing one queue entry, where this name is used as the name of that queue.

See also

Changing the name of the default queue

`task_default_exchange`¶

Default: "celery".

Name of the default exchange to use when no custom exchange is specified for a key in the task_queues setting.

`task_default_exchange_type`¶

Default: "direct".

Default exchange type used when no custom exchange type is specified for a key in the task_queues setting.

`task_default_routing_key`¶

Default: "celery".

The default routing key used when no custom routing key is specified for a key in the task_queues setting.

`task_default_delivery_mode`¶

Default: "persistent".

Can be transient (messages not written to disk) or persistent (written to disk).

Broker Settings ¶

`broker_url`¶

Default: "amqp://"

Default broker URL. This must be a URL in the form of:

transport://userid:password@hostname:port/virtual_host

Only the scheme part (transport://) is required, the rest is optional, and defaults to the specific transports default values.

The transport part is the broker implementation to use, and the default is amqp, (uses librabbitmq if installed or falls back to pyamqp). There are also other choices available, including; redis://, sqs://, and qpid://.

The scheme can also be a fully qualified path to your own transport implementation:

broker_url = 'proj.transports.MyTransport://localhost'

More than one broker URL, of the same transport, can also be specified. The broker URLs can be passed in as a single string that’s semicolon delimited:

broker_url = 'transport://userid:password@hostname:port//;transport://userid:password@hostname:port//'

Or as a list:

broker_url = [
    'transport://userid:password@localhost:port//',
    'transport://userid:password@hostname:port//'
]

The brokers will then be used in the broker_failover_strategy.

See URLs in the Kombu documentation for more information.

`broker_read_url` / `broker_write_url`¶

Default: Taken from broker_url.

These settings can be configured, instead of broker_url to specify different connection parameters for broker connections used for consuming and producing.

Example:

broker_read_url = 'amqp://user:pass@broker.example.com:56721'
broker_write_url = 'amqp://user:pass@broker.example.com:56722'

Both options can also be specified as a list for failover alternates, see broker_url for more information.

`broker_failover_strategy`¶

Default: "round-robin".

Default failover strategy for the broker Connection object. If supplied, may map to a key in ‘kombu.connection.failover_strategies’, or be a reference to any method that yields a single item from a supplied list.

Example:

# Random failover strategy
def random_failover_strategy(servers):
    it = list(it)  # don't modify callers list
    shuffle = random.shuffle
    for _ in repeat(None):
        shuffle(it)
        yield it[0]

broker_failover_strategy = random_failover_strategy

`broker_heartbeat`¶

transports supported:
	`pyamqp`

Default: 120.0 (negotiated by server).

Note: This value is only used by the worker, clients do not use a heartbeat at the moment.

It’s not always possible to detect connection loss in a timely manner using TCP/IP alone, so AMQP defines something called heartbeats that’s is used both by the client and the broker to detect if a connection was closed.

If the heartbeat value is 10 seconds, then the heartbeat will be monitored at the interval specified by the broker_heartbeat_checkrate setting (by default this is set to double the rate of the heartbeat value, so for the 10 seconds, the heartbeat is checked every 5 seconds).

`broker_heartbeat_checkrate`¶

transports supported:
	`pyamqp`

Default: 2.0.

At intervals the worker will monitor that the broker hasn’t missed too many heartbeats. The rate at which this is checked is calculated by dividing the broker_heartbeat value with this value, so if the heartbeat is 10.0 and the rate is the default 2.0, the check will be performed every 5 seconds (twice the heartbeat sending rate).

`broker_use_ssl`¶

transports supported:
	`pyamqp`, `redis`

Default: Disabled.

Toggles SSL usage on broker connection and SSL settings.

If True the connection will use SSL with default SSL settings. If set to a dict, will configure SSL connection according to the specified policy. The format used is Python’s ssl.wrap_socket() options.

Note that SSL socket is generally served on a separate port by the broker.

Example providing a client cert and validating the server cert against a custom certificate authority:

import ssl

broker_use_ssl = {
  'keyfile': '/var/ssl/private/worker-key.pem',
  'certfile': '/var/ssl/amqp-server-cert.pem',
  'ca_certs': '/var/ssl/myca.pem',
  'cert_reqs': ssl.CERT_REQUIRED
}

Warning

Be careful using broker_use_ssl=True. It’s possible that your default configuration won’t validate the server cert at all. Please read Python ssl module security considerations.

`broker_pool_limit`¶

New in version 2.3.

Default: 10.

The maximum number of connections that can be open in the connection pool.

The pool is enabled by default since version 2.5, with a default limit of ten connections. This number can be tweaked depending on the number of threads/green-threads (eventlet/gevent) using a connection. For example running eventlet with 1000 greenlets that use a connection to the broker, contention can arise and you should consider increasing the limit.

If set to None or 0 the connection pool will be disabled and connections will be established and closed for every use.

`broker_connection_timeout`¶

Default: 4.0.

The default timeout in seconds before we give up establishing a connection to the AMQP server. This setting is disabled when using gevent.

`broker_connection_retry`¶

Default: Enabled.

Automatically try to re-establish the connection to the AMQP broker if lost.

The time between retries is increased for each retry, and is not exhausted before broker_connection_max_retries is exceeded.

`broker_connection_max_retries`¶

Default: 100.

Maximum number of retries before we give up re-establishing a connection to the AMQP broker.

If this is set to 0 or None, we’ll retry forever.

`broker_transport_options`¶

New in version 2.2.

Default: {} (empty mapping).

A dict of additional options passed to the underlying transport.

See your transport user manual for supported options (if any).

Example setting the visibility timeout (supported by Redis and SQS transports):

broker_transport_options = {'visibility_timeout': 18000}  # 5 hours

Worker ¶

`imports`¶

Default: [] (empty list).

A sequence of modules to import when the worker starts.

This is used to specify the task modules to import, but also to import signal handlers and additional remote control commands, etc.

The modules will be imported in the original order.

`include`¶

Default: [] (empty list).

Exact same semantics as imports, but can be used as a means to have different import categories.

The modules in this setting are imported after the modules in imports.

`worker_concurrency`¶

Default: Number of CPU cores.

The number of concurrent worker processes/threads/green threads executing tasks.

If you’re doing mostly I/O you can have more processes, but if mostly CPU-bound, try to keep it close to the number of CPUs on your machine. If not set, the number of CPUs/cores on the host will be used.

`worker_prefetch_multiplier`¶

Default: 4.

How many messages to prefetch at a time multiplied by the number of concurrent processes. The default is 4 (four messages for each process). The default setting is usually a good choice, however – if you have very long running tasks waiting in the queue and you have to start the workers, note that the first worker to start will receive four times the number of messages initially. Thus the tasks may not be fairly distributed to the workers.

To disable prefetching, set worker_prefetch_multiplier to 1. Changing that setting to 0 will allow the worker to keep consuming as many messages as it wants.

For more on prefetching, read Prefetch Limits

Note

Tasks with ETA/countdown aren’t affected by prefetch limits.

`worker_lost_wait`¶

Default: 10.0 seconds.

In some cases a worker may be killed without proper cleanup, and the worker may have published a result before terminating. This value specifies how long we wait for any missing results before raising a WorkerLostError exception.

`worker_max_tasks_per_child`¶

Maximum number of tasks a pool worker process can execute before it’s replaced with a new one. Default is no limit.

`worker_max_memory_per_child`¶

Default: No limit. Type: int (kilobytes)

Maximum amount of resident memory, in kilobytes, that may be consumed by a worker before it will be replaced by a new worker. If a single task causes a worker to exceed this limit, the task will be completed, and the worker will be replaced afterwards.

Example:

worker_max_memory_per_child = 12000  # 12MB

`worker_disable_rate_limits`¶

Default: Disabled (rate limits enabled).

Disable all rate limits, even if tasks has explicit rate limits set.

`worker_state_db`¶

Default: None.

Name of the file used to stores persistent worker state (like revoked tasks). Can be a relative or absolute path, but be aware that the suffix .db may be appended to the file name (depending on Python version).

Can also be set via the celery worker --statedb argument.

`worker_timer_precision`¶

Default: 1.0 seconds.

Set the maximum time in seconds that the ETA scheduler can sleep between rechecking the schedule.

Setting this value to 1 second means the schedulers precision will be 1 second. If you need near millisecond precision you can set this to 0.1.

`worker_enable_remote_control`¶

Default: Enabled by default.

Specify if remote control of the workers is enabled.

Events ¶

`worker_send_task_events`¶

Default: Disabled by default.

Send task-related events so that tasks can be monitored using tools like flower. Sets the default value for the workers -E argument.

`task_send_sent_event`¶

New in version 2.2.

Default: Disabled by default.

If enabled, a task-sent event will be sent for every task so tasks can be tracked before they’re consumed by a worker.

`event_queue_ttl`¶

transports supported:
	`amqp`

Default: 5.0 seconds.

Message expiry time in seconds (int/float) for when messages sent to a monitor clients event queue is deleted (x-message-ttl)

For example, if this value is set to 10 then a message delivered to this queue will be deleted after 10 seconds.

`event_queue_expires`¶

transports supported:
	`amqp`

Default: 60.0 seconds.

Expiry time in seconds (int/float) for when after a monitor clients event queue will be deleted (x-expires).

`event_queue_prefix`¶

Default: "celeryev".

The prefix to use for event receiver queue names.

`event_serializer`¶

Default: "json".

Message serialization format used when sending event messages.

See also

Serializers.

Remote Control Commands ¶

Note

To disable remote control commands see the worker_enable_remote_control setting.

`control_queue_ttl`¶

Default: 300.0

Time in seconds, before a message in a remote control command queue will expire.

If using the default of 300 seconds, this means that if a remote control command is sent and no worker picks it up within 300 seconds, the command is discarded.

This setting also applies to remote control reply queues.

`control_queue_expires`¶

Default: 10.0

Time in seconds, before an unused remote control command queue is deleted from the broker.

This setting also applies to remote control reply queues.

Logging ¶

`worker_hijack_root_logger`¶

New in version 2.2.

Default: Enabled by default (hijack root logger).

By default any previously configured handlers on the root logger will be removed. If you want to customize your own logging handlers, then you can disable this behavior by setting worker_hijack_root_logger = False.

Note

Logging can also be customized by connecting to the celery.signals.setup_logging signal.

`worker_log_color`¶

Default: Enabled if app is logging to a terminal.

Enables/disables colors in logging output by the Celery apps.

`worker_log_format`¶

Default:

"[%(asctime)s: %(levelname)s/%(processName)s] %(message)s"

The format to use for log messages.

See the Python logging module for more information about log formats.

`worker_task_log_format`¶

Default:

"[%(asctime)s: %(levelname)s/%(processName)s]
    [%(task_name)s(%(task_id)s)] %(message)s"

The format to use for log messages logged in tasks.

See the Python logging module for more information about log formats.

`worker_redirect_stdouts`¶

Default: Enabled by default.

If enabled stdout and stderr will be redirected to the current logger.

Used by celery worker and celery beat.

`worker_redirect_stdouts_level`¶

Default: WARNING.

The log level output to stdout and stderr is logged as. Can be one of DEBUG, INFO, WARNING, ERROR, or CRITICAL.

Security ¶

`security_key`¶

Default: None.

New in version 2.5.

The relative or absolute path to a file containing the private key used to sign messages when Message Signing is used.

`security_certificate`¶

Default: None.

New in version 2.5.

The relative or absolute path to an X.509 certificate file used to sign messages when Message Signing is used.

`security_cert_store`¶

Default: None.

New in version 2.5.

The directory containing X.509 certificates used for Message Signing. Can be a glob with wild-cards, (for example /etc/certs/*.pem).

Custom Component Classes (advanced)¶

`worker_pool`¶

Default: "prefork" (celery.concurrency.prefork:TaskPool).

Name of the pool class used by the worker.

Eventlet/Gevent

Never use this option to select the eventlet or gevent pool. You must use the -P option to celery worker instead, to ensure the monkey patches aren’t applied too late, causing things to break in strange ways.

`worker_pool_restarts`¶

Default: Disabled by default.

If enabled the worker pool can be restarted using the pool_restart remote control command.

`worker_autoscaler`¶

New in version 2.2.

Default: "celery.worker.autoscale:Autoscaler".

Name of the autoscaler class to use.

`worker_consumer`¶

Default: "celery.worker.consumer:Consumer".

Name of the consumer class used by the worker.

`worker_timer`¶

Default: "kombu.async.hub.timer:Timer".

Name of the ETA scheduler class used by the worker. Default is or set by the pool implementation.

Beat Settings (celery beat)¶

`beat_schedule`¶

Default: {} (empty mapping).

The periodic task schedule used by beat. See Entries.

`beat_scheduler`¶

Default: "celery.beat:PersistentScheduler".

The default scheduler class. May be set to "django_celery_beat.schedulers:DatabaseScheduler" for instance, if used alongside django-celery-beat extension.

Can also be set via the celery beat -S argument.

`beat_schedule_filename`¶

Default: "celerybeat-schedule".

Name of the file used by PersistentScheduler to store the last run times of periodic tasks. Can be a relative or absolute path, but be aware that the suffix .db may be appended to the file name (depending on Python version).

Can also be set via the celery beat --schedule argument.

`beat_sync_every`¶

Default: 0.

The number of periodic tasks that can be called before another database sync is issued. A value of 0 (default) means sync based on timing - default of 3 minutes as determined by scheduler.sync_every. If set to 1, beat will call sync after every task message sent.

`beat_max_loop_interval`¶

Default: 0.

The maximum number of seconds beat can sleep between checking the schedule.

The default for this value is scheduler specific. For the default Celery beat scheduler the value is 300 (5 minutes), but for the django-celery-beat database scheduler it’s 5 seconds because the schedule may be changed externally, and so it must take changes to the schedule into account.

Also when running Celery beat embedded (-B) on Jython as a thread the max interval is overridden and set to 1 so that it’s possible to shut down in a timely manner.

Configuration and defaults¶

accept_content¶

enable_utc¶

timezone¶

task_annotations¶

task_compression¶

task_protocol¶

task_serializer¶

task_publish_retry¶

task_publish_retry_policy¶

task_always_eager¶

task_eager_propagates¶

task_remote_tracebacks¶

task_ignore_result¶

task_store_errors_even_if_ignored¶

task_track_started¶

task_time_limit¶

task_soft_time_limit¶

task_acks_late¶

task_reject_on_worker_lost¶

task_default_rate_limit¶

result_backend¶

result_serializer¶

result_compression¶

result_expires¶

result_cache_max¶

Database URL Examples¶

database_engine_options¶

database_short_lived_sessions¶

database_table_names¶

result_persistent¶

Example configuration¶

cache_backend_options¶

cache_backend¶

Configuring the backend URL¶

redis_max_connections¶

redis_socket_connect_timeout¶

redis_socket_timeout¶

cassandra_servers¶

cassandra_port¶

cassandra_keyspace¶

cassandra_table¶

cassandra_read_consistency¶

cassandra_write_consistency¶

cassandra_entry_ttl¶

cassandra_auth_provider¶

cassandra_auth_kwargs¶

Example configuration¶

Example configuration¶

riak_backend_settings¶

couchbase_backend_settings¶

task_queues¶

task_routes¶

task_queue_ha_policy¶

task_queue_max_priority¶

worker_direct¶

task_create_missing_queues¶

task_default_queue¶

task_default_exchange¶

task_default_exchange_type¶

task_default_routing_key¶

task_default_delivery_mode¶

broker_url¶

broker_read_url / broker_write_url¶

broker_failover_strategy¶

broker_heartbeat¶

broker_heartbeat_checkrate¶

broker_use_ssl¶

broker_pool_limit¶

broker_connection_timeout¶

broker_connection_retry¶

broker_connection_max_retries¶

broker_login_method¶

broker_transport_options¶

imports¶

include¶

worker_concurrency¶

worker_prefetch_multiplier¶

worker_lost_wait¶

worker_max_tasks_per_child¶

`accept_content`¶

`enable_utc`¶

`timezone`¶

`task_annotations`¶

`task_compression`¶

`task_protocol`¶

`task_serializer`¶

`task_publish_retry`¶

`task_publish_retry_policy`¶

`task_always_eager`¶

`task_eager_propagates`¶

`task_remote_tracebacks`¶

`task_ignore_result`¶

`task_store_errors_even_if_ignored`¶

`task_track_started`¶

`task_time_limit`¶

`task_soft_time_limit`¶

`task_acks_late`¶

`task_reject_on_worker_lost`¶

`task_default_rate_limit`¶

`result_backend`¶

`result_serializer`¶

`result_compression`¶

`result_expires`¶

`result_cache_max`¶

`database_engine_options`¶

`database_short_lived_sessions`¶

`database_table_names`¶

`result_persistent`¶

`cache_backend_options`¶

`cache_backend`¶

`redis_max_connections`¶

`redis_socket_connect_timeout`¶

`redis_socket_timeout`¶

`cassandra_servers`¶

`cassandra_port`¶

`cassandra_keyspace`¶

`cassandra_table`¶

`cassandra_read_consistency`¶

`cassandra_write_consistency`¶

`cassandra_entry_ttl`¶

`cassandra_auth_provider`¶

`cassandra_auth_kwargs`¶

`riak_backend_settings`¶

`couchbase_backend_settings`¶

`task_queues`¶

`task_routes`¶

`task_queue_ha_policy`¶

`task_queue_max_priority`¶

`worker_direct`¶

`task_create_missing_queues`¶

`task_default_queue`¶

`task_default_exchange`¶

`task_default_exchange_type`¶

`task_default_routing_key`¶

`task_default_delivery_mode`¶

`broker_url`¶

`broker_read_url` / `broker_write_url`¶

`broker_failover_strategy`¶

`broker_heartbeat`¶

`broker_heartbeat_checkrate`¶

`broker_use_ssl`¶

`broker_pool_limit`¶

`broker_connection_timeout`¶

`broker_connection_retry`¶

`broker_connection_max_retries`¶

`broker_login_method`¶

`broker_transport_options`¶

`imports`¶

`include`¶

`worker_concurrency`¶

`worker_prefetch_multiplier`¶

`worker_lost_wait`¶

`worker_max_tasks_per_child`¶

`worker_max_memory_per_child`¶

`worker_disable_rate_limits`¶

`worker_state_db`¶

`worker_timer_precision`¶

`worker_enable_remote_control`¶

`worker_send_task_events`¶