Why am I getting a 429 error even though I am only sending 1 request per second?

Slack's rate limits are Tiered. While Tier 4 (chat.postMessage) allows roughly 1 request per second, Tier 1 or Tier 2 endpoints have much stricter limits (e.g., 20 requests per minute). Furthermore, limits can be evaluated on a per-workspace or per-token basis. Burst limits can also trigger if you send requests too concurrently, even if the average over a minute is low.

How do I handle the 'Retry-After' header correctly?

The 'Retry-After' header returns an integer representing seconds. Your application must catch the HTTP 429 response, extract this header, and suspend all requests to that specific endpoint using the same token for the specified number of seconds. Ignoring it and retrying immediately will result in longer penalties.

What is the difference between a rate limit (429) and an API timeout?

A 429 is an explicit HTTP response from Slack telling you to slow down. A timeout (often surfacing as a client-side exception or 503/504) means the connection dropped or took too long. Timeouts are usually caused by your application sending payloads that are too large, having exhausted connection pools due to hanging requests, or transient network issues.

Does the Slack Events API count towards my rate limit?

No, receiving events via the Events API (where Slack POSTs to your server) does not consume your Web API rate limits. However, if your server fails to respond to Events API payloads with a 200 OK quickly (within 3 seconds), Slack will retry and eventually disable your event subscription. You must acknowledge the event immediately and process the work asynchronously.

Can I request a rate limit increase for my Slack app?

Generally, no. Slack rarely grants rate limit increases for standard Web API usage. Their architecture relies on developers implementing backoff, caching, and utilizing the Events API. If you have an Enterprise Grid deployment and a highly specific use case, you can contact support, but the primary expectation is that your application manages its own throughput.

Troubleshooting Slack API Rate Limit (HTTP 429 Too Many Requests) and Timeout Errors

Resolve Slack API rate limit 429 Too Many Requests and timeout errors by implementing exponential backoff, respecting Retry-After headers, and optimizing API ca

Last updated: February 24, 2026

Last verified: February 24, 2026

1,736 words

Key Takeaways

HTTP 429 Too Many Requests occurs when your application exceeds Slack's tiered rate limits (Tier 1 to Tier 4) for specific Web API methods.
The most critical diagnostic step is reading the 'Retry-After' HTTP header, which explicitly tells you how many seconds to wait before resuming requests.
Slack API timeouts (often seen as client-side timeouts or HTTP 503/504) frequently happen when attempting to send excessively large payloads or making too many concurrent connections.
Quick Fix: Implement a retry interceptor that pauses execution for the duration specified in the Retry-After header.
Long-term Fix: Move from aggressive polling to the Events API, cache frequently accessed data (like user lists), and implement task queues (e.g., Redis/Celery or SQS) for outbound messages.

Rate Limit Mitigation Strategies Compared
Method	When to Use	Implementation Time	Risk / Scalability
Synchronous Sleep (Retry-After)	Simple scripts, low-volume bots, CLI tools	Low (< 1 hour)	High risk of blocking threads; poor scalability
Exponential Backoff with Jitter	General API wrappers, mid-tier applications	Medium (1-2 hours)	Medium risk; prevents thundering herd but doesn't solve high throughput needs
Asynchronous Message Queues	Enterprise apps, high-volume notifications, broadcast bots	High (Days)	Low risk; highly scalable, decouples app logic from API constraints
Caching & Payload Optimization	When frequently fetching users/channels or seeing timeouts	Medium (Hours)	Low risk; drastically reduces API footprint and prevents timeouts

Understanding the Error

When developing integrations or operating production bots with the Slack API, encountering rate limits is a mathematical certainty as your user base or activity grows. The most explicit symptom is the HTTP 429 Too Many Requests status code. However, poorly managed rate limits can also manifest as slack api timeout errors, either because your client's connection pool is exhausted while waiting for blocked requests, or because Slack's ingestion nodes drop connections during extreme burst traffic.

Slack does not have a single global rate limit. Instead, it employs a sophisticated Tiered Rate Limiting System where different API methods are assigned to different tiers. For example, posting a message (chat.postMessage) is a Tier 4 method allowing approximately 1 request per second per workspace, while fetching a user list (users.list) is a Tier 2 method allowing roughly 20 requests per minute.

When you exceed these limits, Slack's API gateway intercepts the request and returns a 429 status. Crucially, this response includes a Retry-After HTTP header. This header contains an integer representing the exact number of seconds your application must wait before making another request to that specific endpoint.

Step 1: Diagnose

The first step in troubleshooting a 429 Too Many Requests or timeout issue is isolating the offending API call and understanding the traffic pattern.

1. Analyze the Headers and Payload: If you are simply logging the error message (e.g., Error: Slack API returned 429), you are missing the most critical piece of debugging data. You must log the HTTP response headers. The Retry-After header is your immediate diagnostic tool. If Retry-After is 1, you are likely slightly exceeding a Tier 4 limit. If Retry-After is 300 (5 minutes) or higher, you have severely violated a lower-tier limit or hit a special burst limit (like the workspace-wide burst limit).

2. Identify the Tier and Endpoint: Review your application logs to determine which Slack API method is failing.

Are you calling users.info inside a loop for every message received? (Classic anti-pattern leading to Tier 4 exhaustion).
Are you attempting to broadcast a message to 1,000 users concurrently?
Are you polling conversations.history every second instead of using the Events API?

3. Differentiate Between 429s and Timeouts: If you are seeing timeouts (e.g., requests.exceptions.ReadTimeout in Python or ETIMEDOUT in Node.js), this might not be a direct rate limit response. Timeouts often occur when:

Your application is sending a massive payload (e.g., a heavily nested Block Kit UI with thousands of elements).
Your HTTP client's connection pool is exhausted because previous requests are hanging or being intentionally delayed by rudimentary retry logic.
DNS resolution failures or proxy bottlenecks within your own infrastructure under high load.

Step 2: Immediate Fixes (Tactical)

If your production environment is currently failing due to rate limits, you need immediate tactical mitigation to restore service.

1. Respect the Retry-After Header (The "Stop Bleeding" Fix): Update your API client to trap the 429 status code, read the Retry-After header, and halt execution for that specific thread or process. If you are using an official Slack SDK (like @slack/web-api for Node.js or slack_sdk for Python), this logic is often built-in but might need to be explicitly enabled or configured via retry policies. If you are making raw HTTP requests, you must implement this interceptor immediately. Failure to respect the Retry-After header and continuing to hammer the API will result in longer bans and potentially workspace-level rate limit locks.

2. Halt Bulk Operations: If a background cron job (e.g., a daily report generator) is causing the 429s, pause the job immediately. Bulk operations often consume the entire rate limit bucket, starving critical, synchronous user interactions (like responding to a slash command).

Step 3: Long-Term Architecture Fixes (Strategic)

To permanently resolve Slack API rate limits and timeouts, you must move from tactical retries to strategic traffic management.

1. Transition from Polling to the Events API: If your application calls endpoints like conversations.history or users.list frequently to detect state changes, you are utilizing an anti-pattern. Slack's Events API is a push-based model. Instead of asking Slack "Did anything happen?" every 5 seconds, you register a webhook, and Slack POSTs a JSON payload to your server the millisecond an event occurs. This drops your API read requests to near zero.

2. Implement a Task Queue for Outbound Traffic: For applications that send many messages (e.g., alert aggregators, CI/CD notification bots), synchronous API calls are dangerous. Implement a message broker like RabbitMQ, Redis (via Celery/Sidekiq), or AWS SQS. When your app needs to send a Slack message, it publishes a job to the queue. A background worker consumes the queue at a controlled rate (e.g., 1 message per second, respecting Tier 4 limits). If the worker encounters a 429, it simply pauses processing or requeues the message with a delay equal to the Retry-After value. This completely isolates your core application from Slack's rate limits and eliminates client-side timeouts caused by blocked threads.

3. Caching State and Identifiers: A major source of rate limiting is redundant identity lookups. Applications often receive an email address and call users.lookupByEmail to get a Slack User ID, then call chat.postMessage. If you process 100 alerts for the same user, that's 100 redundant lookups. Implement a local cache (in-memory or Redis) to map internal user identifiers or emails to Slack User IDs and Channel IDs. Update this cache asynchronously or via the Events API (user_change events).

4. Optimizing Payloads to Prevent Timeouts: If you are experiencing timeouts rather than 429s, inspect your payload size. Slack's Block Kit is powerful but verbose. Ensure you are not sending thousands of blocks in a single message. Paginate your output, use thread replies for extended context, and ensure your HTTP client has sensible read and connect timeouts (e.g., 5-10 seconds) rather than waiting indefinitely. If your client times out, ensure your retry logic includes exponential backoff to prevent a thundering herd of retries from worsening the network congestion.

Frequently Asked Questions

python

import time
import logging
import requests
from requests.exceptions import RequestException

logging.basicConfig(level=logging.INFO)
logger = logging.getLogger(__name__)

def post_slack_message_with_retry(webhook_url, payload, max_retries=3):
    """
    Sends a message to Slack while respecting the HTTP 429 Retry-After header
    and handling potential timeouts with exponential backoff.
    """
    attempt = 0
    backoff_factor = 2

    while attempt < max_retries:
        try:
            # Set a strict timeout to prevent hanging threads
            response = requests.post(webhook_url, json=payload, timeout=5.0)
            
            if response.status_code == 200:
                logger.info("Message posted successfully.")
                return True
                
            if response.status_code == 429:
                # Crucial step: Extract the Retry-After header
                retry_after = int(response.headers.get('Retry-After', 1))
                logger.warning(f"Rate limited (429). Sleeping for {retry_after} seconds as requested by Slack.")
                time.sleep(retry_after)
                attempt += 1
                continue
                
            # Handle other HTTP errors
            response.raise_for_status()
            
        except requests.exceptions.Timeout:
            # Handle client-side timeouts with exponential backoff
            sleep_time = backoff_factor ** attempt
            logger.error(f"Timeout occurred. Retrying in {sleep_time} seconds...")
            time.sleep(sleep_time)
            attempt += 1
        except RequestException as e:
            logger.error(f"Network or HTTP error: {e}")
            break
            
    logger.error("Max retries exceeded. Failed to deliver Slack message.")
    return False

# Example usage
# post_slack_message_with_retry('https://hooks.slack.com/services/T000/B000/XXX', {'text': 'Hello World'})

Error Medic Editorial

Error Medic Editorial is composed of senior Site Reliability Engineers and DevOps practitioners dedicated to solving complex infrastructure and API integration challenges.

Sources

Explore More API Errors Guides

AWS API Rate Limit Exceeded (ThrottlingException): Complete Troubleshooting Guide

Fix AWS ThrottlingException and API timeouts with exponential backoff, Service Quotas increases, and optimized API polling strategies for your workloads.

Azure API Timeout: 'The operation timed out' — Root Causes and Fixes

Fix Azure API timeouts caused by misconfigured APIM policies, backend latency, or connection limits. Step-by-step diagnostics and policy fixes included.

Azure API Timeout: Fix 504 Gateway Timeout and RequestTimeout Errors in Azure API Management, Functions, and ARM

Diagnose and fix Azure API timeout errors (504, 408, RequestTimeout) across API Management, Functions, and ARM. Includes policy fixes, host.json config, and CLI