Why am I getting 429 Too Many Requests when I am well under the per-minute limit?

Slack rate limits are evaluated on a per-minute basis, but they also enforce 'burst' limits. If your tier allows 100 requests per minute, sending all 100 requests in a single second may still trigger a rate limit. Traffic must be relatively smoothed out.

How do I stop the Slack Events API from sending duplicate retry messages?

Slack requires an HTTP 200 response within 3 seconds of sending an event payload. If your app takes longer to process the data, Slack assumes the request failed and retries. You must decouple your architecture: immediately return HTTP 200, and pass the payload to a background worker (like Celery or SQS) for processing.

What happens if my application ignores the Retry-After header?

If you continue to send requests to the Slack API while under a rate limit window (ignoring the Retry-After header), Slack will elevate the enforcement. This can result in workspace-wide IP bans, token revocation, or your application being temporarily suspended.

Are Slack Incoming Webhooks subject to the same tier-based limits?

No. Incoming Webhooks have a much stricter, distinct rate limit of exactly 1 request per second. They are not intended for bulk data operations. If you need to send bulk messages, you should use the `chat.postMessage` API endpoint instead.

Why is my application throwing a read timeout error instead of a 429?

Timeouts usually occur locally due to connection pool exhaustion, unhandled network latency, or NAT gateway port exhaustion in cloud environments. Ensure you are reusing TCP connections (Connection Pooling) and explicitly defining connection/read timeout values in your HTTP client, wrapping them in an exponential backoff retry block.

Resolving Slack API HTTP 429 Rate Limits and Timeout Errors

Comprehensive guide to fixing Slack API 429 Too Many Requests and timeout errors. Learn tier limits, Retry-After handling, and event queueing architectures.

Last updated: February 24, 2026

Last verified: February 24, 2026

1,568 words

Key Takeaways

Slack enforces Tier 1 to Tier 4 rate limits, ranging from 1 to 100 requests per minute depending on the specific API method.
HTTP 429 'Too Many Requests' responses include a mandatory 'Retry-After' header indicating how many seconds to wait.
Ignoring the 'Retry-After' header and continuing to poll will result in prolonged temporary bans or app suspension.
Slack Events API requires an HTTP 200 acknowledgment within 3 seconds; failure to do so causes Slack to retry, exacerbating load.
Implement asynchronous processing (e.g., Celery, SQS) and exponential backoff to ensure reliable message delivery during traffic bursts.

Remediation Strategies for Slack API Rate Limits
Method	When to Use	Implementation Time	Risk Profile
Retry-After Polling	Simple scripts or low-volume bots hitting occasional 429s	Low (< 1 hour)	Low
Exponential Backoff	Handling 'slack api timeout' network instability or transient errors	Low (1-2 hours)	Low
Asynchronous Queues (Redis/SQS)	High-volume enterprise Slack apps with heavy Event API traffic	High (Days)	Medium (Architecture Change)
Caching Profiles/Channels	Apps making redundant calls to users.info or conversations.list	Medium (Hours)	Low

Understanding Slack API Rate Limits and 429 Errors

When developing integrations, bots, or enterprise applications on top of the Slack API, encountering rate limits is a rite of passage. Slack protects its infrastructure using a strict, tier-based rate-limiting system. When your application exceeds the allowed number of requests for a specific tier, Slack rejects subsequent requests, returning an HTTP 429 Too Many Requests status code along with a specific JSON payload: {"ok": false, "error": "ratelimited"}.

Simultaneously, developers often experience generic "Slack API timeouts" (e.g., ReadTimeoutError or requests.exceptions.Timeout). While sometimes caused by network jitter, these timeouts frequently occur when a client library implicitly struggles with retries, or when your application's connection pool is exhausted while waiting for rate-limit windows to clear.

The Anatomy of the Error

When a rate limit is hit, the HTTP response headers contain the most critical piece of debugging information: the Retry-After header.

HTTP/1.1 429 Too Many Requests
Retry-After: 30
Content-Type: application/json; charset=utf-8

{
  "ok": false,
  "error": "ratelimited"
}

The Retry-After: 30 header explicitly tells your application to halt all requests to that specific API endpoint for the next 30 seconds.

Step 1: Diagnosing the Root Cause

Before refactoring your application, you must identify why you are hitting the limits. Slack's rate limits are categorized into four primary tiers:

Tier 1 (~1 request per minute): Heavy operational tasks like files.upload or workspace-wide administrative functions.
Tier 2 (~20 requests per minute): General configuration read/writes, such as conversations.history or users.info.
Tier 3 (~50 requests per minute): Common UI operations, like joining channels or basic message formatting.
Tier 4 (~100 requests per minute): Real-time operations like chat.postMessage.

Common Anti-Patterns Leading to 429s:

Iterating over channels or users without delays: Running a for loop over 1,000 users to call users.info will instantly exhaust a Tier 2 limit.
Synchronous Events API processing: Slack requires your server to respond to incoming Event API webhooks with an HTTP 200 OK within 3 seconds. If your app processes the event (e.g., querying a database, calling a third-party LLM) before responding, Slack assumes the delivery failed and retries. This creates a cascading retry storm, rapidly depleting your rate limits.
Webhook Abuse: Incoming Webhooks have a strict, non-tiered limit of 1 request per second. Bursting 5 messages to a webhook in 1 second will immediately trigger a 429.

Step 2: Implementing the 'Retry-After' Handler

The most immediate and required fix is to build middleware or wrappers around your HTTP client that respect the Retry-After header. If you are using the official Slack SDKs (like @slack/web-api for Node.js or slack-sdk for Python), this feature is often built-in but may require explicit configuration to enable RetryHandler classes.

If you are writing raw HTTP requests, your logic must catch the 429 status code, read the header, and invoke a blocking sleep or, preferably, schedule the task for later.

Step 3: Architecting for High-Volume Slack Apps

To permanently resolve rate limit and timeout issues in production environments, you must decouple message ingestion from message processing.

A. Acknowledge First, Process Later

When receiving an event from Slack, immediately return HTTP 200 OK. Place the event payload onto an asynchronous message broker like Redis, RabbitMQ, or AWS SQS. Background workers can then process these events at a controlled concurrency level that aligns with Slack's API tiers.

B. Caching Frequently Accessed Data

Do not call users.info or conversations.list repeatedly for the same entities. Implement an LRU (Least Recently Used) cache or a Redis key-value store. When you receive a user ID in an event, check the cache first. If it's a cache miss, fetch from Slack and store it with a reasonable TTL (Time To Live), such as 24 hours.

C. Batching and Pagination

When dealing with endpoints that return lists (like channel history or user lists), rely on pagination (cursor) instead of bulk brute-force queries. Ensure your pagination loops have deliberate micro-sleeps (e.g., time.sleep(1)) between iterative calls to respect Tier 2 limits.

Step 4: Resolving Slack API Timeouts

If you are experiencing timeouts rather than explicit 429s (e.g., the connection drops before a response is received):

Adjust Client Timeout Settings: Default HTTP clients often have infinite or very aggressive timeout settings. Explicitly set connection timeouts (e.g., 5 seconds) and read timeouts (e.g., 15 seconds).
Network Egress Constraints: In environments like AWS Lambda or Kubernetes, sudden bursts of outbound connections to slack.com can exhaust NAT Gateway ports, resulting in network-level timeouts. Implement connection pooling (using requests.Session() in Python or keepAlive agents in Node.js) to reuse TCP connections rather than opening a new one for every Slack message.
Implement Exponential Backoff: For network-related timeouts, use an exponential backoff strategy. Retry the request after 1 second, then 2 seconds, then 4 seconds, up to a maximum number of retries, adding a random "jitter" to prevent the thundering herd problem.

Frequently Asked Questions

python

import time
import logging
import requests
from requests.adapters import HTTPAdapter
from urllib3.util.retry import Retry

# Configure logging
logging.basicConfig(level=logging.INFO)
logger = logging.getLogger(__name__)

def create_slack_session():
    """
    Creates a robust requests Session with built-in connection pooling,
    exponential backoff for timeouts, and automatic HTTP 429 handling.
    """
    session = requests.Session()
    
    # Configure retry strategy for network errors and 5xx responses
    # Note: urllib3 Retry doesn't perfectly handle Slack's custom Retry-After by default in all versions,
    # so we will also implement a manual wrapper below.
    retry_strategy = Retry(
        total=3,
        backoff_factor=1, # 1s, 2s, 4s
        status_forcelist=[413, 500, 502, 503, 504],
        allowed_methods=["POST", "GET"]
    )
    adapter = HTTPAdapter(max_retries=retry_strategy, pool_connections=10, pool_maxsize=10)
    session.mount("https://", adapter)
    return session

def post_slack_message_with_rate_limit_handling(session, token, channel, text):
    """
    Sends a message to Slack while strictly respecting the Retry-After header.
    """
    url = "https://slack.com/api/chat.postMessage"
    headers = {
        "Authorization": f"Bearer {token}",
        "Content-Type": "application/json"
    }
    payload = {
        "channel": channel,
        "text": text
    }

    max_attempts = 3
    for attempt in range(max_attempts):
        try:
            # Explicitly set timeouts: 5s for connection, 10s for reading response
            response = session.post(url, headers=headers, json=payload, timeout=(5, 10))
            
            if response.status_code == 429:
                # Extract the Retry-After header. Default to 30 seconds if missing.
                retry_after = int(response.headers.get("Retry-After", 30))
                logger.warning(f"[Attempt {attempt+1}] Rate limited. Sleeping for {retry_after} seconds.")
                time.sleep(retry_after)
                continue # Retry the loop
                
            response.raise_for_status()
            data = response.json()
            
            if not data.get("ok"):
                logger.error(f"Slack API Error: {data.get('error')}")
                return False
                
            logger.info("Message posted successfully.")
            return True
            
        except requests.exceptions.Timeout:
            # Handle standard network timeouts with our own backoff
            sleep_time = 2 ** attempt
            logger.warning(f"[Attempt {attempt+1}] Connection timeout. Backing off for {sleep_time} seconds.")
            time.sleep(sleep_time)
        except requests.exceptions.RequestException as e:
            logger.error(f"Critical HTTP error: {e}")
            break
            
    logger.error("Exhausted all retries. Message failed.")
    return False

# Usage Example:
# session = create_slack_session()
# post_slack_message_with_rate_limit_handling(session, "xoxb-your-token", "#general", "Hello, World!")

Error Medic Editorial

Error Medic Editorial comprises senior SREs, DevOps engineers, and cloud architects dedicated to untangling the internet's most frustrating infrastructure and API errors.

Sources

Explore More API Errors Guides

AWS API Rate Limit Exceeded (ThrottlingException): Complete Troubleshooting Guide

Fix AWS ThrottlingException and API timeouts with exponential backoff, Service Quotas increases, and optimized API polling strategies for your workloads.

Azure API Timeout: 'The operation timed out' — Root Causes and Fixes

Fix Azure API timeouts caused by misconfigured APIM policies, backend latency, or connection limits. Step-by-step diagnostics and policy fixes included.

Azure API Timeout: Fix 504 Gateway Timeout and RequestTimeout Errors in Azure API Management, Functions, and ARM

Diagnose and fix Azure API timeout errors (504, 408, RequestTimeout) across API Management, Functions, and ARM. Includes policy fixes, host.json config, and CLI