Last Updated: June 01, 2026

Table of Contents

Understanding Celery and Background Tasks
Installing and Setting Up Celery with Redis
Creating Your First Celery Application
Calling Tasks Asynchronously
Starting Celery Workers
Task Chains, Groups, and Chords
Error Handling and Retries
Periodic Tasks with Celery Beat
Monitoring Tasks with Flower
Troubleshooting Common Celery Issues
Real-Life Example: Email Sending Queue
Best Practices for Celery in Production
FAQ
Related Python Tutorials

How To Use Celery for Background Tasks in Python

Quick Answer: Celery is a distributed task queue library that allows you to run time-consuming Python functions asynchronously in background workers. Install it with pip install celery redis, define tasks using the @app.task decorator, and call them with .delay() or .apply_async() for non-blocking execution. Use Redis or RabbitMQ as your message broker, and monitor tasks with Flower web interface.

Written by Pubs

Python developer and educator with 15+ years building production systems across data engineering, web APIs, and AI tooling. Founder of Python How To Program — 270+ in-depth tutorials covering the modern Python stack.

View all tutorials by Pubs →

Understanding Celery and Background Tasks

Celery is a powerful asynchronous task queue system for Python that enables you to execute long-running operations without blocking your main application. Instead of making users wait for resource-intensive tasks to complete, you can defer them to background workers that process jobs independently.

Background tasks are essential for modern web applications. Without them, uploading a large file, sending emails, or processing videos would freeze your user interface. Celery solves this by allowing tasks to run in separate processes and even on different machines.

Installing and Setting Up Celery with Redis

First, install Celery along with Redis, which serves as the message broker:

pip install celery redis

Next, install and run Redis server. On macOS with Homebrew:

brew install redis
redis-server

On Ubuntu/Debian:

sudo apt-get install redis-server
sudo systemctl start redis-server

Verify Redis is running by connecting with the Redis CLI:

redis-cli ping
# Output: PONG

Celery: send the work somewhere else, get back to the user immediately.

Creating Your First Celery Application

Create a file called celery_app.py to define your Celery application and tasks:

from celery import Celery

# Create Celery instance
app = Celery('myapp')

# Configure Celery to use Redis as broker
app.conf.update(
    broker_url='redis://localhost:6379/0',
    result_backend='redis://localhost:6379/0',
    task_serializer='json',
    accept_content=['json'],
    result_serializer='json',
    timezone='UTC',
    enable_utc=True,
)

# Define a simple task
@app.task
def add(x, y):
    return x + y

# Define a task that takes longer
@app.task
def send_email(email, subject, body):
    import time
    time.sleep(2)  # Simulate email sending delay
    print(f"Email sent to {email}: {subject}")
    return {"status": "sent", "email": email}

Calling Tasks Asynchronously

Now create a main application file that uses these tasks:

from celery_app import add, send_email

# Method 1: Using .delay() for simple calls
result = add.delay(4, 6)
print(f"Task ID: {result.id}")
print(f"Task state: {result.state}")

# Method 2: Using .apply_async() for advanced options
result = send_email.apply_async(
    args=('user@example.com', 'Hello', 'Welcome!'),
    countdown=10  # Execute after 10 seconds
)

# Method 3: Get task result (blocking)
print(f"Result: {result.get()}")  # Waits until task completes

# Method 4: Non-blocking result check
if result.ready():
    print(f"Task completed: {result.result}")
else:
    print("Task still processing...")

The .delay() method is shorthand for .apply_async() with immediate execution. For more control, use .apply_async() with parameters like:

countdown: Delay execution by N seconds
expires: Task expires after N seconds if not executed
priority: Task priority (higher numbers execute first)
retry: Retry failed tasks

Workers consume the queue. The web request never waits.

Starting Celery Workers

Celery workers are separate processes that consume and execute tasks from the queue. Start a worker with:

celery -A celery_app worker --loglevel=info

You should see output like:

-------------- celery@MacBook-Pro.local v5.3.0 (emerald-rush)
--- ***** -----
-- ******* ----
- *** --- * ---
- ** ---------- [config]
- ** ----------
- *** --- * --- celery@MacBook-Pro.local
-- ******* ---- Linux-5.15.0-1020-aws
--- ***** -----
 -------------- [queues]
                .> celery           exchange=celery(direct) key=celery

[tasks]
  . celery_app.add
  . celery_app.send_email

[2024-01-15 10:30:45,123: WARNING/MainProcess] celery@MacBook-Pro.local ready.

For production, run multiple workers with concurrency:

celery -A celery_app worker --loglevel=info --concurrency=4

Task Chains, Groups, and Chords

Celery provides powerful constructs for composing complex workflows:

from celery import chain, group, chord
from celery_app import add

# Chain: Execute tasks sequentially
workflow = chain(add.s(2, 2), add.s(4), add.s(8))
result = workflow.apply_async()
print(result.get())  # Output: 16 (((2+2)+4)+8)

# Group: Execute tasks in parallel
job = group(add.s(2, 2), add.s(4, 4), add.s(8, 8))
result = job.apply_async()
print(result.get())  # Output: [4, 8, 16]

# Chord: Parallel tasks then callback
callback = add.s(0)
header = group(add.s(2, 2), add.s(4, 4), add.s(8, 8))
result = chord(header)(callback)
print(result.get())  # Output: 28 (sum of all)

These constructs allow you to:

Chain: Pass results between tasks sequentially
Group: Execute multiple tasks in parallel
Chord: Run parallel tasks, then process combined results

Monitor your queue depth. A growing queue is a smoking gun.

Error Handling and Retries

Celery provides built-in retry mechanisms for fault tolerance:

from celery import Celery
from celery.exceptions import MaxRetriesExceededError
import requests

app = Celery('myapp')
app.conf.broker_url = 'redis://localhost:6379/0'

@app.task(bind=True, max_retries=3)
def fetch_data(self, url):
    try:
        response = requests.get(url, timeout=5)
        response.raise_for_status()
        return response.json()
    except requests.ConnectionError as exc:
        # Retry with exponential backoff
        raise self.retry(exc=exc, countdown=2 ** self.request.retries)
    except requests.Timeout:
        if self.request.retries < self.max_retries:
            raise self.retry(countdown=60)
        else:
            raise MaxRetriesExceededError()

# Call with automatic retries
result = fetch_data.delay('https://api.example.com/data')

Key retry features:

max_retries: Maximum number of retry attempts
countdown: Seconds to wait before retrying
autoretry_for: Tuple of exception types to auto-retry
bind=True: Gives task access to self for retry logic

Periodic Tasks with Celery Beat

Schedule recurring tasks using Celery Beat:

from celery import Celery
from celery.schedules import crontab
import datetime

app = Celery('myapp')
app.conf.broker_url = 'redis://localhost:6379/0'

# Configure periodic tasks
app.conf.beat_schedule = {
    'send-report-every-hour': {
        'task': 'tasks.send_report',
        'schedule': 3600.0,  # Every 3600 seconds (1 hour)
    },
    'cleanup-database-daily': {
        'task': 'tasks.cleanup_old_data',
        'schedule': crontab(hour=2, minute=0),  # Daily at 2 AM
    },
    'sync-data-every-30-minutes': {
        'task': 'tasks.sync_external_api',
        'schedule': 1800.0,  # Every 30 minutes
    },
}

@app.task
def send_report():
    print(f"Report generated at {datetime.datetime.now()}")
    return "Report sent"

@app.task
def cleanup_old_data():
    print("Cleaning up old database records")
    # Delete old records
    return "Cleanup complete"

@app.task
def sync_external_api():
    print("Syncing with external API")
    return "Sync complete"

Start Celery Beat scheduler:

celery -A celery_app beat --loglevel=info

Run both worker and beat together:

celery -A celery_app worker --beat --loglevel=info

Monitoring Tasks with Flower

Flower is a real-time web-based monitoring tool for Celery. Install and run it:

pip install flower
celery -A celery_app flower

Access Flower at http://localhost:5555 to see:

Active tasks and their progress
Worker status and capacity
Task execution history and statistics
Real-time task graphs
Task failure details

You can also programmatically inspect tasks:

from celery.app.control import Inspect

app = Celery('myapp')
app.conf.broker_url = 'redis://localhost:6379/0'

# Get inspection object
i = Inspect(app=app)

# Get active tasks
active = i.active()
print(f"Active tasks: {active}")

# Get registered tasks
registered = i.registered()
print(f"Registered tasks: {registered}")

# Get worker stats
stats = i.stats()
print(f"Worker stats: {stats}")

Troubleshooting Common Celery Issues

Issue	Cause	Solution
Worker not processing tasks	Worker not running or Redis connection failed	Verify Redis is running: `redis-cli ping`. Start worker: `celery -A celery_app worker --loglevel=info`
Task remains in PENDING state	Task never executed or worker crashed	Check worker logs, verify broker connection, ensure task is properly registered
Redis connection error	Redis not running or wrong connection string	Check Redis is running, verify broker_url in Celery config
Task results not stored	Result backend not configured	Set `result_backend` in Celery config to Redis or other backend
High memory usage	Too many tasks queued or workers consuming memory	Limit prefetch with `--prefetch-multiplier=1`, adjust concurrency
Tasks timing out	Task execution exceeds time limit	Increase `time_limit` or optimize task code

Real-Life Example: Email Sending Queue

Here’s a production-ready example of an email sending queue system:

from celery import Celery, group
from celery.exceptions import MaxRetriesExceededError
import smtplib
from email.mime.text import MIMEText
from email.mime.multipart import MIMEMultipart

app = Celery('email_app')
app.conf.update(
    broker_url='redis://localhost:6379/0',
    result_backend='redis://localhost:6379/0',
    task_serializer='json',
    accept_content=['json'],
    result_serializer='json',
)

@app.task(bind=True, max_retries=5)
def send_email_task(self, recipient, subject, body, html=None):
    """Send email with automatic retries"""
    try:
        sender = 'noreply@example.com'
        password = 'your_app_password'

        msg = MIMEMultipart('alternative')
        msg['Subject'] = subject
        msg['From'] = sender
        msg['To'] = recipient

        # Attach text and HTML versions
        msg.attach(MIMEText(body, 'plain'))
        if html:
            msg.attach(MIMEText(html, 'html'))

        # Send email
        with smtplib.SMTP_SSL('smtp.gmail.com', 465) as server:
            server.login(sender, password)
            server.sendmail(sender, recipient, msg.as_string())

        return {
            'status': 'success',
            'recipient': recipient,
            'subject': subject
        }

    except smtplib.SMTPException as exc:
        # Retry with exponential backoff
        raise self.retry(
            exc=exc,
            countdown=min(600, 2 ** self.request.retries)
        )

@app.task
def send_bulk_emails(recipients, subject, body):
    """Send emails to multiple recipients in parallel"""
    job = group(
        send_email_task.s(recipient, subject, body)
        for recipient in recipients
    )
    result = job.apply_async()
    return result.id

# Usage example
if __name__ == '__main__':
    # Send single email
    result = send_email_task.delay(
        'user@example.com',
        'Welcome!',
        'Thanks for signing up!'
    )
    print(f"Task ID: {result.id}")

    # Send bulk emails
    recipients = [
        'user1@example.com',
        'user2@example.com',
        'user3@example.com'
    ]
    bulk_result = send_bulk_emails.delay(
        recipients,
        'Newsletter',
        'Check out our latest updates!'
    )
    print(f"Bulk send task ID: {bulk_result.id}")

This example demonstrates:

Email sending with automatic retries and exponential backoff
Both single and bulk email sending
Error handling for SMTP failures
Parallel task execution for multiple recipients
Task result tracking and retrieval

Best Practices for Celery in Production

When deploying Celery to production, follow these guidelines:

Use a robust broker: RabbitMQ for mission-critical tasks, Redis for caching-heavy workloads
Set result TTL: Configure result_expires to automatically clean up old results
Monitor workers: Use Flower or external monitoring to track worker health
Use task routing: Route different tasks to different workers for better resource allocation
Implement task timeout: Set task_soft_time_limit and task_time_limit to prevent hanging tasks
Version your API: Use task_protocol to handle backward compatibility
Log comprehensively: Configure proper logging for debugging production issues
Test retries: Verify retry logic works correctly before production deployment

FAQ

Q: What is the difference between Celery and threading in Python?

A: Celery is a distributed task queue system that runs tasks in separate processes and can scale across multiple machines, while threading runs tasks in the same process and is limited by Python’s GIL. Celery is better for I/O-bound and CPU-intensive tasks that require true parallelism.

Q: Can Celery run tasks on different machines?

A: Yes, Celery is designed for distributed computing. You can run workers on different servers, and they will all consume tasks from the same message broker for true scalability.

Q: What message brokers does Celery support?

A: Celery supports RabbitMQ, Redis, AWS SQS, and others. RabbitMQ is recommended for production, while Redis is suitable for development and lighter workloads.

Q: How do I handle task failures in Celery?

A: Use the @app.task decorator with max_retries and implement retry logic using self.retry(). You can also use error callbacks with apply_async() to execute functions when tasks fail.

Q: Is Celery suitable for real-time processing?

A: Celery is better for background tasks with some latency tolerance. For true real-time processing, consider message streams like Kafka alongside Celery.

Setting Up Celery

Celery needs three pieces — the Celery app, a broker (Redis or RabbitMQ), and a worker process. Minimal setup:

# pip install celery redis

# File: tasks.py
from celery import Celery

app = Celery("myapp", broker="redis://localhost:6379/0", backend="redis://localhost:6379/1")

@app.task
def send_email(to: str, subject: str, body: str):
    # Pretend this does real SMTP work
    print(f"Sending email to {to}: {subject}")
    return {"status": "sent", "to": to}

# In your web app:
from tasks import send_email
result = send_email.delay("alice@example.com", "Welcome", "Hi there")
print(result.id)        # job ID for later status checks
print(result.get(timeout=10))   # block until done

Start the worker in a separate process: celery -A tasks worker --loglevel=info. Now .delay() drops the task into the queue; the worker picks it up and runs it; the result is stored in the backend for retrieval.

Retries, Timeouts, and Error Handling

Production tasks need to handle failure. Use autoretry on specific exceptions with exponential backoff:

from celery import Celery
import requests

app = Celery("myapp", broker="redis://localhost:6379")

@app.task(
    autoretry_for=(requests.RequestException,),
    retry_backoff=True,
    retry_backoff_max=600,
    retry_jitter=True,
    max_retries=5,
    soft_time_limit=30,
    time_limit=60,
)
def fetch_url(url: str):
    response = requests.get(url, timeout=10)
    response.raise_for_status()
    return response.text

Each retry doubles the wait (with jitter), up to 10 minutes between tries. soft_time_limit raises SoftTimeLimitExceeded that your code can catch and clean up; time_limit hard-kills the task.

Periodic Tasks with Celery Beat

Beat is Celery’s built-in scheduler — replaces cron for background work. Run it alongside the worker:

from celery.schedules import crontab

app.conf.beat_schedule = {
    "daily-report": {
        "task": "tasks.send_daily_report",
        "schedule": crontab(hour=9, minute=0),
    },
    "every-5-min": {
        "task": "tasks.health_check",
        "schedule": 300.0,
    },
}

# Run beat:    celery -A tasks beat --loglevel=info
# Run worker:  celery -A tasks worker --loglevel=info

Common Pitfalls

Tasks that aren’t picklable. Celery serializes arguments. Database session objects, file handles, and lambdas can’t cross the wire — pass IDs/strings instead.
Forgetting result_backend. Without a backend, .get() hangs forever waiting for a result that’s never stored. Redis backend is fine for most cases.
Long-running tasks blocking workers. Set --concurrency appropriately and consider --max-tasks-per-child=1000 to recycle workers periodically (helps with memory leaks).
Beat running twice. Two beat processes = double-firing schedules. Use a singleton beat or the celery-beat-django database scheduler with a lock.
Logging surprises. Worker logs go to stdout in the worker process. Configure Python logging in celeryconfig.py so your task code’s logs end up in the right place.

FAQ

Q: Celery, RQ, or Dramatiq?
A: Celery for most cases — most features, biggest ecosystem. RQ if you want simplicity and you’re already on Redis. Dramatiq if you’ve outgrown RQ but Celery feels too heavy.

Q: Redis or RabbitMQ as broker?
A: Redis is fine for under 1000 tasks/sec. RabbitMQ handles higher volume and offers stronger durability guarantees. For most apps, start with Redis.

Q: How do I see what tasks are running?
A: celery -A tasks inspect active shows current jobs. Flower (pip install flower) gives you a web UI for monitoring, retrying failed jobs, and inspecting queues.

Q: Async tasks in async views (FastAPI / Django async)?
A: send_email.delay(...) is fire-and-forget — works fine in async views. To await a result, use asyncio.to_thread(result.get, timeout=10).

Q: How do I prevent thundering herd on Celery beat after a long downtime?
A: app.conf.beat_max_loop_interval = 60 and use --max-interval on beat. For tasks with misfire_grace_time, set it small to skip overdue tasks rather than running 50 backlogged copies.

Wrapping Up

Celery’s worth the setup once you have any of: emails to send, reports to generate, webhook deliveries, image processing, or any work that shouldn’t block a web request. Start with one task type and one worker; add retries, beat scheduling, and Flower monitoring as the load grows. The complexity scales with your needs — you don’t have to learn everything before you can use it.

Continue Learning Python

Tutorials you might also find useful:

Post Views: 54

How To Use Celery for Background Tasks in Python

How To Use Celery for Background Tasks in Python

Understanding Celery and Background Tasks

Installing and Setting Up Celery with Redis

Creating Your First Celery Application

Calling Tasks Asynchronously

Starting Celery Workers

Task Chains, Groups, and Chords

Error Handling and Retries

Periodic Tasks with Celery Beat

Monitoring Tasks with Flower

Troubleshooting Common Celery Issues

Real-Life Example: Email Sending Queue

Best Practices for Celery in Production

FAQ

Setting Up Celery

Retries, Timeouts, and Error Handling

Periodic Tasks with Celery Beat

Common Pitfalls

FAQ

Wrapping Up

Related Python Tutorials

Continue Learning Python

Submit a Comment Cancel reply