What is the difference between Error 20429 and a 503 error in Twilio?

Error 20429 represents a hard API rate limit, meaning you have exceeded Twilio's maximum concurrent HTTP connections (usually 100). A 503 Service Unavailable error usually indicates that Twilio's internal message queues are full because you are sending faster than your number's maximum Messages Per Second (MPS) limits, or there is a temporary internal Twilio failure.

How many messages per second (MPS) can I send with a standard Twilio number?

A standard US or Canada Long Code is strictly limited to 1 message segment per second (1 MPS). Toll-free numbers generally offer 3 MPS, and dedicated Short Codes can handle anywhere from 100 to over 1,000 MPS.

Will Twilio automatically queue my messages if I send them too fast?

Yes, Twilio provides an internal queuing system that can hold up to 4 hours worth of outbound messages. However, if your sending burst exceeds the 4-hour queue threshold, or if you exceed the 100 concurrent API connection limit while submitting them, Twilio will begin dropping the requests and returning 429 or 503 errors.

How do I correctly implement exponential backoff for Twilio APIs?

When you catch a 429 status code, check for the presence of the `Retry-After` HTTP header and wait that many seconds. If the header is missing, pause your thread execution for `(2^retry_count) * base_delay` seconds (plus random jitter) before retrying the exact same HTTP request.

Can I bypass the rate limit by just buying more Twilio phone numbers?

Yes, but you must architect it correctly. You can pool multiple numbers into a Twilio Messaging Service. When you send API requests to the Messaging Service SID instead of a specific 'From' number, Twilio automatically distributes the outbound load across the pool, effectively multiplying your total MPS.

How to Fix Twilio Rate Limit (Error 20429 & 503 Service Unavailable)

Resolving Twilio rate limit errors (20429) and 503 responses. Learn how to implement exponential backoff, manage message queueing, and increase API concurrency.

Last updated: February 24, 2026

Last verified: February 24, 2026

1,560 words

Key Takeaways

Twilio enforces a strict API concurrency limit (typically 100 concurrent connections), triggering HTTP 429 and Error 20429 when exceeded.
HTTP 503 Service Unavailable often indicates you have exceeded your number's Messages Per Second (MPS) throughput and Twilio's internal queues are overflowing.
Immediate fix: Implement client-side exponential backoff with jitter on HTTP 429 responses.
Long-term fix: Decouple API requests using an asynchronous message broker (SQS, Redis, RabbitMQ) to strictly control worker concurrency.
Scale throughput by upgrading sender types (Toll-Free, Short Code) or utilizing Twilio Messaging Services to load-balance across a number pool.

Fix Approaches Compared
Method	When to Use	Implementation Time	Risk Level
Exponential Backoff	Occasional API spikes causing intermittent 429s	Fast (< 1 day)	Low
Message Queueing (SQS/Redis)	Sustained high volume or bulk sending operations	Medium (1-3 days)	Low
Messaging Services (Number Pool)	Distributing load to bypass 1 MPS limit on long codes	Fast (Hours)	Low
Upgrade Sender Type (Short Code)	Hitting hard MPS limits consistently requiring high throughput	Slow (Weeks for approval)	Medium (High Cost)

Troubleshooting Twilio Rate Limits: Error 20429 and 503 Service Unavailable

When scaling communication systems, encountering Twilio rate limits is a rite of passage for DevOps and backend engineering teams. You're likely here because your application logs are suddenly flooded with HTTP 429 (Too Many Requests) or HTTP 503 (Service Unavailable) status codes, often accompanied by Twilio's specific Error 20429.

This guide provides a comprehensive, senior-level deep dive into identifying, mitigating, and architecturally resolving Twilio rate limiting issues to ensure your messaging and voice pipelines remain resilient.

Understanding the Errors: 429 vs. 503

Before implementing a fix, you must distinguish between API concurrency limits and carrier throughput limits.

Error 20429: Too Many Requests (API Rate Limit)

Twilio enforces a strict limit on the number of concurrent REST API requests. By default, most Twilio accounts are limited to 100 concurrent API requests. If your application attempts to open a 101st connection while 100 are still processing, Twilio will immediately reject it with an HTTP 429 and Error Code 20429.

This has nothing to do with how many SMS messages you are sending per second (MPS), but rather how many HTTP requests your servers are holding open with api.twilio.com.

HTTP 503: Service Unavailable (Queue Overload)

An HTTP 503 error from Twilio typically indicates one of two things:

Twilio Internal Issues: A localized degradation on Twilio's API nodes.
Queue Overflow: Twilio automatically queues outbound messages when you send them faster than your number's MPS (Messages Per Second) limit. Standard US Long Codes have a limit of 1 MPS. Twilio's internal queue holds up to 4 hours of messages. If your burst exceeds this 4-hour queue threshold, the queue drops the messages and returns a 503 or 429 depending on the exact endpoint.

Step 1: Diagnosis & Log Analysis

The first step is to isolate the exact bottleneck. Are you hitting the API concurrency limit, or the MPS throughput limit?

Search your application logs or API gateway for the specific response payloads. A classic Error 20429 looks like this:

{
  "code": 20429,
  "message": "Too many requests",
  "more_info": "https://www.twilio.com/docs/errors/20429",
  "status": 429
}

If you use Datadog, Kibana, or CloudWatch, run a query to calculate your peak requests per second to the Twilio API. If your peak connection count exceeds 100, you have an API concurrency issue.

Step 2: Immediate Mitigation (Client-Side)

If you are actively experiencing dropped messages, you need to implement immediate retry logic with exponential backoff and jitter. Do not simply retry the request immediately, as this will compound the rate limit and cause a thundering herd problem.

When Twilio returns a 429, it may include a Retry-After HTTP header indicating how many seconds you should wait. If it is missing, fallback to a standard exponential backoff algorithm.

Base Delay: Start with a 500ms delay.
Multiplier: Multiply by 2 for each subsequent failure.
Jitter: Add a random millisecond variance to prevent synchronized retries.
Max Retries: Cap out at 5-7 retries before failing gracefully or dead-lettering the message.

Step 3: Architectural Fixes (Long-Term)

To permanently solve Twilio rate limits in high-throughput environments, you must decouple your application's message generation from the actual Twilio API calls.

1. Implement Message Queuing (SQS / RabbitMQ / Redis)

Instead of your web workers calling the Twilio API synchronously during a user request or bulk cron job, push the message payload to a message broker like AWS SQS or Redis (via Celery/Sidekiq).

Configure a dedicated pool of worker processes to consume from this queue. You can strictly control the concurrency of these workers (e.g., limit to 50 concurrent workers) ensuring you never breach the 100 concurrent API connection limit.

2. Upgrade Your Sender Type (MPS Limits)

If your queue is backing up because you are fundamentally exceeding the 1 MPS limit of a standard Long Code, you need a larger pipe. Consider upgrading your sender type:

Toll-Free Numbers: ~3 MPS (requires verification).
High-Volume Short Codes: 100 to 1,000+ MPS (requires lengthy carrier approval and higher costs).
A2P 10DLC (US Only): Throughput is determined by your Trust Score, ranging from 1 to 60+ MPS.

3. Use Twilio Messaging Services (Copilot)

A Messaging Service allows you to pool multiple Twilio phone numbers together. Twilio will automatically load-balance your outbound API requests across the numbers in the pool. If you add ten Long Codes to a Messaging Service pool, your effective throughput increases from 1 MPS to 10 MPS, completely abstracting the load balancing logic away from your application codebase.

Step 4: Monitoring and Alerting

Finally, implement proactive alerting. Do not wait for user complaints to know you are hitting limits.

Set a Datadog/Prometheus monitor on HTTP 429/503 responses from your Twilio HTTP clients.
Set CloudWatch Alarms on your SQS Dead Letter Queue (DLQ) where failed Twilio dispatches are stored.
Monitor the QueueTime property provided by Twilio's API callbacks to see if messages are sitting in Twilio's internal queues for unacceptably long periods before carrier delivery.

By combining synchronous backoff, asynchronous message queuing, and appropriate sender provisioning, you can build a communication pipeline capable of scaling to millions of messages without dropping a single API payload.

Frequently Asked Questions

bash

#!/bin/bash
# A robust bash wrapper to send Twilio SMS with Exponential Backoff handling 429/503 errors

TWILIO_ACCOUNT_SID="your_account_sid"
TWILIO_AUTH_TOKEN="your_auth_token"
TO_NUMBER="+1234567890"
FROM_NUMBER="+0987654321"
BODY="Urgent DevOps Alert: Rate Limit Detected"

MAX_RETRIES=5
RETRY_COUNT=0
BASE_DELAY=1 # Start with 1 second

while [ $RETRY_COUNT -lt $MAX_RETRIES ]; do
  # Execute curl and capture HTTP status code in a variable
  HTTP_RESPONSE=$(curl -s -w "%{http_code}" -X POST "https://api.twilio.com/2010-04-01/Accounts/$TWILIO_ACCOUNT_SID/Messages.json" \
    --data-urlencode "To=$TO_NUMBER" \
    --data-urlencode "From=$FROM_NUMBER" \
    --data-urlencode "Body=$BODY" \
    -u "$TWILIO_ACCOUNT_SID:$TWILIO_AUTH_TOKEN")

  HTTP_STATUS=$(echo $HTTP_RESPONSE | awk '{print $NF}')
  # Strip the HTTP status code from the response body output
  RESPONSE_BODY=$(echo "$HTTP_RESPONSE" | sed 's/ [0-9]\{3\}$//')

  if [ "$HTTP_STATUS" -eq 200 ] || [ "$HTTP_STATUS" -eq 201 ]; then
    echo "Message sent successfully!"
    exit 0
  elif [ "$HTTP_STATUS" -eq 429 ] || [ "$HTTP_STATUS" -eq 503 ]; then
    echo "Rate limited ($HTTP_STATUS). Error: $RESPONSE_BODY"
    
    # Calculate exponential backoff with a bit of random jitter
    JITTER=$(awk -v min=0 -v max=1 'BEGIN{srand(); print min+rand()*(max-min)}')
    SLEEP_TIME=$(echo "($BASE_DELAY * (2 ^ $RETRY_COUNT)) + $JITTER" | bc)
    
    echo "Waiting for $SLEEP_TIME seconds before retry $((RETRY_COUNT+1))/$MAX_RETRIES..."
    sleep $SLEEP_TIME
    
    RETRY_COUNT=$((RETRY_COUNT+1))
  else
    echo "Fatal Error: HTTP $HTTP_STATUS - $RESPONSE_BODY"
    exit 1
  fi
done

echo "Failed to send message after $MAX_RETRIES attempts."
exit 1

Error Medic Editorial Team

Our editorial team consists of senior Site Reliability Engineers and DevOps architects with over a decade of experience operating high-throughput messaging pipelines, cloud infrastructure, and distributed systems at scale.

Sources

Explore More API Errors Guides

AWS API Rate Limit Exceeded (ThrottlingException): Complete Troubleshooting Guide

Fix AWS ThrottlingException and API timeouts with exponential backoff, Service Quotas increases, and optimized API polling strategies for your workloads.

Azure API Timeout: 'The operation timed out' — Root Causes and Fixes

Fix Azure API timeouts caused by misconfigured APIM policies, backend latency, or connection limits. Step-by-step diagnostics and policy fixes included.

Azure API Timeout: Fix 504 Gateway Timeout and RequestTimeout Errors in Azure API Management, Functions, and ARM

Diagnose and fix Azure API timeout errors (504, 408, RequestTimeout) across API Management, Functions, and ARM. Includes policy fixes, host.json config, and CLI