What does Twilio Error 20429 mean?

Error 20429 indicates that your application has exceeded the allowed rate limit for a specific Twilio REST API endpoint. Twilio enforces these limits to protect their infrastructure from being overwhelmed. You must slow down your request rate.

Why am I getting a 503 Service Unavailable from Twilio?

A 503 error usually happens when Twilio's API load balancers are temporarily overwhelmed by your request volume or connection churn. It often occurs if you open and close thousands of separate TCP connections rather than using HTTP Keep-Alive and connection pooling.

How many messages per second (MPS) can I send?

This depends entirely on the type of phone number you are using. US local numbers (10DLC) typically have 1 MPS, unverified toll-free numbers have limited throughput, verified toll-free numbers start at 3 MPS, and short codes can handle 100+ MPS.

Does Twilio automatically queue my messages?

Yes, Twilio will queue outbound messages up to a maximum duration of 4 hours based on your number's MPS. However, if your API requests are so fast that they exceed the REST API limits before the message even gets queued, or if the queue becomes entirely full, you will receive an error.

How do I fix rate limits during a bulk SMS campaign?

Never loop and send requests synchronously. Use a message broker like Redis, RabbitMQ, or AWS SQS to queue the jobs internally, and have worker processes consume the queue at a controlled, steady rate that matches your Twilio throughput limits.

How to Fix Twilio Rate Limit (Error 20429 & 503 Service Unavailable) - A Comprehensive Guide

Resolve Twilio rate limit errors (20429) and 503 Service Unavailable. Learn how to implement exponential backoff, queues, and optimize your API requests.

Last updated: February 24, 2026

Last verified: February 24, 2026

1,529 words

Key Takeaways

Root cause 1: Exceeding the maximum allowed REST API requests per second (RPS) or concurrent connections to Twilio endpoints.
Root cause 2: Sending messages at a rate higher than your specific Sender ID's allocated Messages Per Second (MPS) throughput.
Quick fix summary: Implement an exponential backoff strategy for API retries and introduce a queuing system (like Redis or RabbitMQ) to control the flow of outbound requests.

Fix Approaches Compared
Method	When to Use	Time	Risk
Exponential Backoff	Handling intermittent 429/503 errors and immediate retry scenarios.	Low	Low
Message Queuing (Redis/RabbitMQ)	High-volume outgoing campaigns, bulk notifications, or sustained traffic spikes.	High	Medium
Upgrading Sender Throughput	When your baseline business needs simply exceed current account MPS limits (e.g., Toll-Free verification or Short Code).	Medium	Low
Connection Pooling	When running into 503 errors due to TCP connection exhaustion or TLS handshake limits.	Medium	Low

Understanding the Twilio Rate Limit Error

When scaling applications that rely on the Twilio API for SMS, voice, or WhatsApp messaging, encountering rate limit errors is a common rite of passage. These typically manifest as Error 20429 (Too Many Requests) or, in more severe cases of connection flooding, 503 Service Unavailable.

To effectively troubleshoot and resolve these issues, it is crucial to understand that Twilio enforces two distinct types of limits: API Request Limits and Message Throughput Limits (MPS).

1. API Request Limits (The 429 Too Many Requests)

Twilio's REST API is protected by rate limiters to ensure platform stability. If you send too many HTTP requests in a short period, Twilio responds with an HTTP 429 status code and a JSON body containing code: 20429. This limit applies to all API actions: creating messages, fetching call logs, or buying phone numbers. Typically, Twilio allows up to 100 concurrent requests, but sustained high-frequency requests will trigger the limiter.

2. Message Throughput Limits (MPS)

Even if you successfully submit a request to the Twilio API, your message enters a queue. How fast messages leave this queue and reach the carrier depends on your sender type's Messages Per Second (MPS):

US/Canada Local Numbers: 1 MPS
Toll-Free Numbers: Starts at 3 MPS (can be upgraded)
Short Codes: 100+ MPS If your Twilio queue length exceeds 4 hours' worth of throughput, Twilio will reject further API requests to send messages, resulting in a rate limit error.

3. The 503 Service Unavailable Error

A 503 error from Twilio generally indicates that the specific edge location or API endpoint is temporarily overloaded, or your application is opening and closing TCP/TLS connections too rapidly without reusing them (connection churn). When Twilio's load balancers detect a flood of new connection requests, they may drop them, resulting in a 503.

Step 1: Diagnose the Bottleneck

Before writing code, identify which limit you are hitting.

Check the Response Headers

When you receive a 429 response, Twilio includes specific headers that tell you exactly when you can retry. Look for the Retry-After header in your HTTP response logs. This header indicates the number of seconds you must wait before making another request to that endpoint.

Review the Twilio Error Logs

Navigate to the Twilio Console -> Monitor -> Logs -> Error Logs. Search for 20429. The log details will tell you if the rate limit was hit due to concurrent connections, total account request rates, or a queue overflow for a specific phone number.

Analyze Your Connection Management

If you are seeing 503s, verify your HTTP client configuration. Are you creating a new HTTP client instance for every single request? If so, you are forcing a new DNS lookup, TCP handshake, and TLS negotiation every time, which can exhaust local ephemeral ports and trigger Twilio's anti-abuse mechanisms.

Step 2: Fix the Rate Limiting Issues

Fix 1: Implement Exponential Backoff with Jitter

If you are hitting standard API rate limits (429s), the immediate technical fix is to catch the exception and retry the request after a delay. However, a static delay (e.g., waiting exactly 2 seconds) can cause a "thundering herd" problem where all blocked requests retry simultaneously.

Instead, use Exponential Backoff with Jitter. This means the delay increases exponentially with each failure (e.g., 1s, 2s, 4s, 8s), and a random variation (jitter) is added to spread out the retries.

Most official Twilio SDKs have built-in retry mechanisms, but you must ensure they are enabled and configured correctly for your workload.

Fix 2: Utilize a Message Broker (Redis/RabbitMQ/Celery)

For bulk SMS campaigns or high-traffic notifications, you should never send messages synchronously within your application's main request-response cycle. If you loop through 10,000 users and call the Twilio API, you will inevitably hit the rate limit.

Instead, offload the sending process to a background worker system:

Your application pushes 10,000 "send_sms" tasks to a Redis queue.
A background worker (like Celery in Python, or Bull in Node.js) consumes these tasks.
The worker is configured with a strict rate limit (e.g., processing no more than 50 tasks per second). This completely decouples your application's speed from Twilio's API limits.

Fix 3: Implement Connection Pooling

To resolve 503 errors related to connection exhaustion, ensure your HTTP client utilizes connection pooling (Keep-Alive).

In Node.js, ensure your http.Agent or https.Agent has keepAlive: true.
In Python's requests library, use a Session object (requests.Session()) across multiple API calls rather than using requests.post() directly every time. This maintains a persistent connection to Twilio's servers, drastically reducing overhead.

Fix 4: Address Message Queue Overflows

If you are hitting the 4-hour queue limit, software fixes won't help—you need more throughput. You must upgrade your infrastructure on Twilio's side:

If using a standard 10DLC (10-Digit Long Code), ensure your brand and campaign are fully registered in the Trust Hub to get the maximum allowed throughput for your use case.
If using Toll-Free, apply for High-Throughput Toll-Free.
If your volume is consistently massive, migrate to a Short Code or use Twilio's Messaging Services to automatically distribute traffic across a pool of multiple phone numbers (Number Pooling).

Conclusion

Handling Twilio rate limits is about respecting the platform's constraints while designing resilient systems. By implementing connection pooling to prevent 503s, adding exponential backoff to gracefully handle 429s, and utilizing background queues for bulk sending, you can ensure high deliverability and system stability.

Frequently Asked Questions

python

import os
from twilio.rest import Client
from twilio.base.exceptions import TwilioRestException
from tenacity import retry, wait_exponential, retry_if_exception_type, stop_after_attempt
import logging

# Initialize logger
logging.basicConfig(level=logging.INFO)
logger = logging.getLogger(__name__)

# Initialize Twilio Client
# It is best practice to reuse this client instance to maintain a connection pool
client = Client(os.environ['TWILIO_ACCOUNT_SID'], os.environ['TWILIO_AUTH_TOKEN'])

def is_rate_limit_error(exception):
    """Check if the exception is a Twilio 429 (Error 20429) or 503."""
    if isinstance(exception, TwilioRestException):
        return exception.status in [429, 503] or exception.code == 20429
    return False

# Configure exponential backoff: wait 2^x * 1 second between each retry,
# up to 10 seconds, then 10 seconds afterwards. Stop after 5 total attempts.
@retry(
    retry=retry_if_exception_type(TwilioRestException),
    wait=wait_exponential(multiplier=1, min=2, max=10),
    stop=stop_after_attempt(5),
    before_sleep=lambda retry_state: logger.warning(
        f"Rate limited by Twilio. Retrying in {retry_state.next_action.sleep} seconds..."
    )
)
def send_sms_with_backoff(to_number, from_number, body_text):
    try:
        message = client.messages.create(
            body=body_text,
            from_=from_number,
            to=to_number
        )
        logger.info(f"Message {message.sid} successfully dispatched.")
        return message
    except TwilioRestException as e:
        if is_rate_limit_error(e):
            raise e # Reraise to trigger Tenacity retry
        else:
            # Handle other Twilio errors (e.g., invalid number, unsubscribed) without retrying
            logger.error(f"Failed to send message due to a non-retriable error: {e}")
            raise e

# Example usage
# send_sms_with_backoff("+1234567890", "+0987654321", "Hello, your server is down!")

Error Medic Editorial

Error Medic Editorial provides battle-tested troubleshooting guides for modern cloud infrastructure and APIs. Our articles are researched and written by senior Site Reliability Engineers who have spent years managing high-throughput production systems.

Sources

Explore More API Errors Guides

AWS API Rate Limit Exceeded (ThrottlingException): Complete Troubleshooting Guide

Fix AWS ThrottlingException and API timeouts with exponential backoff, Service Quotas increases, and optimized API polling strategies for your workloads.

Azure API Timeout: 'The operation timed out' — Root Causes and Fixes

Fix Azure API timeouts caused by misconfigured APIM policies, backend latency, or connection limits. Step-by-step diagnostics and policy fixes included.

Azure API Timeout: Fix 504 Gateway Timeout and RequestTimeout Errors in Azure API Management, Functions, and ARM

Diagnose and fix Azure API timeout errors (504, 408, RequestTimeout) across API Management, Functions, and ARM. Includes policy fixes, host.json config, and CLI