What does 'error_code: RATE_LIMIT_EXCEEDED' mean in the Plaid API response?

It means your application has sent too many requests to a specific Plaid API endpoint within a given time window for your client ID or for a specific linked item. The HTTP status code will be 429. Plaid does not disclose exact numeric thresholds publicly, but you can request your current limits from Plaid support. The fix is to implement retry logic with exponential backoff and reduce request frequency via caching or webhooks.

I'm getting 'plaid rate limited' only on /transactions/get but not other endpoints — why?

Transaction endpoints have stricter per-item rate limits than other endpoints because they return large datasets and are commonly polled. Each `item_id` (linked bank account) has its own quota on transaction endpoints. If you're fetching transactions for the same item frequently, you'll hit the per-item limit before the per-client limit. Migrate to `/transactions/sync` with webhooks to eliminate repeated full fetches.

Does Plaid include a Retry-After header in 429 responses?

Plaid does not consistently include a `Retry-After` header in rate limit responses as of 2025. You should implement your own backoff strategy rather than relying on this header. A safe default is to wait at least 1 second after the first 429, doubling with each subsequent retry, up to a maximum of 60 seconds, plus random jitter of 0-500ms to prevent synchronized retries across distributed services.

Can running Plaid in sandbox mode cause rate limit errors?

Yes. Sandbox has its own rate limits which are lower than production. If you're stress-testing or running automated integration tests against sandbox, you can trigger RATE_LIMIT_EXCEEDED errors there too. Use separate client IDs for sandbox and production. For load testing, mock the Plaid client entirely rather than hitting the live sandbox endpoint.

We have multiple microservices all using the same Plaid client ID — could they be competing against each other?

Yes, this is a very common cause of intermittent rate limiting. All requests using the same PLAID_CLIENT_ID and PLAID_SECRET share the same rate limit quota, regardless of which microservice or server instance originates them. Solutions include: (1) creating separate client IDs per service via the Plaid dashboard, (2) routing all Plaid API calls through a single internal gateway service that enforces a global rate limit, or (3) implementing a distributed token bucket using Redis that all services check before calling Plaid.

Plaid Rate Limit Error: RATE_LIMIT_EXCEEDED Troubleshooting Guide

Fix Plaid RATE_LIMIT_EXCEEDED (HTTP 429) errors with exponential backoff, request queuing, and caching strategies. Step-by-step guide for devs.

Last updated: February 23, 2026

Last verified: February 23, 2026

1,927 words

Key Takeaways

Root cause 1: Burst traffic exceeding Plaid's per-minute or per-second request quotas on endpoints like /transactions/get or /auth/get, triggering HTTP 429 with error_code RATE_LIMIT_EXCEEDED.
Root cause 2: Missing retry logic causes repeated hammering of the API after a rate limit is hit, compounding the problem and extending the cooldown window.
Root cause 3: Shared client IDs across multiple application environments (staging + production) consuming the same quota pool simultaneously.
Quick fix summary: Implement exponential backoff with jitter on all Plaid API calls, cache responses aggressively (especially /transactions/get), move bulk operations to off-peak hours, and audit shared credentials across environments.

Fix Approaches Compared
Method	When to Use	Time to Implement	Risk
Exponential backoff with jitter	Any environment hitting 429s under burst load	1-2 hours	Low — pure retry logic, no architectural change
Response caching (Redis/Memcached)	High-read workloads repeatedly fetching same user data	4-8 hours	Medium — stale data window must match business tolerance
Request queue with rate limiter (e.g., Bottleneck.js, token bucket)	Background jobs or batch ingestion pipelines	4-12 hours	Low — adds latency but prevents rate exhaustion entirely
Plaid Webhooks + event-driven updates	Apps polling /transactions/get on a schedule	1-3 days	Low long-term — eliminates polling root cause permanently
Separate client IDs per environment	Shared credentials across staging and production	30 minutes	Low — requires Plaid dashboard access and env var update
Upgrade Plaid plan / request limit increase	Legitimate traffic exceeding current tier limits	1-5 business days (support ticket)	None — pure capacity expansion

Understanding the Plaid RATE_LIMIT_EXCEEDED Error

When your application exceeds the request quota assigned to your Plaid client ID, the Plaid API returns an HTTP 429 response with the following JSON body:

{
  "display_message": null,
  "error_code": "RATE_LIMIT_EXCEEDED",
  "error_message": "rate limit exceeded for this item",
  "error_type": "RATE_LIMIT_ERROR",
  "request_id": "HNlA3",
  "suggested_action": null
}

Plaid enforces rate limits at multiple levels: per-item (per linked bank account), per-client-id, and per-endpoint. The most commonly hit limits are on /transactions/get, /auth/get, and /identity/get — all of which return large payloads that developers tend to poll aggressively.

Plaid does not publish exact numeric rate limits in their public documentation, but community reports and Plaid support responses indicate that most production environments are limited to roughly 10-15 requests per second per client ID, with per-item limits that are significantly lower (sometimes as few as 2-3 requests per minute for transaction-heavy endpoints).

Step 1: Diagnose the Source of Rate Limiting

1.1 Identify which endpoint is being rate limited

All Plaid API errors include a request_id. Log this alongside the endpoint path, timestamp, and item_id (if applicable). A 5-minute spike in 429s on a single endpoint points to burst traffic; a continuous drip of 429s on a single item points to a polling loop.

In your application logs, search for the pattern:

grep -E 'RATE_LIMIT_EXCEEDED|429' /var/log/app/api.log | \
  awk '{print $1, $2}' | \
  sort | uniq -c | sort -rn | head -20

1.2 Check for shared credentials

If you have multiple services or environments using the same PLAID_CLIENT_ID and PLAID_SECRET, their quotas are pooled. Verify:

# List all services referencing Plaid credentials
grep -r 'PLAID_CLIENT_ID' /etc/environment /etc/app/*.env ~/.env* 2>/dev/null

# Check Kubernetes secrets if applicable
kubectl get secrets -A | grep plaid
kubectl describe secret plaid-credentials -n production
kubectl describe secret plaid-credentials -n staging

If both staging and production point to the same client ID, you're sharing quota — this is a common silent killer.

1.3 Identify polling anti-patterns

Search your codebase for scheduled calls to transaction or balance endpoints:

# Find cron-style patterns near Plaid client calls
grep -rn 'setInterval\|cron\|schedule\|polling' src/ | \
  grep -v node_modules | grep -v '.test.'

If you're calling /transactions/get every 30 seconds for 1000 users, you're generating ~120,000 requests per hour — almost certainly over any tier limit.

Step 2: Implement Exponential Backoff

The fastest mitigation is wrapping all Plaid API calls in a retry handler with exponential backoff and jitter. This prevents the thundering-herd problem where all retries fire simultaneously.

Node.js example using the official plaid-node SDK:

const { PlaidError } = require('plaid');

async function plaidCallWithBackoff(fn, maxRetries = 5) {
  let attempt = 0;
  while (attempt < maxRetries) {
    try {
      return await fn();
    } catch (err) {
      const isRateLimit =
        err?.response?.data?.error_code === 'RATE_LIMIT_EXCEEDED' ||
        err?.response?.status === 429;

      if (!isRateLimit || attempt === maxRetries - 1) throw err;

      const baseDelay = Math.pow(2, attempt) * 1000; // 1s, 2s, 4s, 8s, 16s
      const jitter = Math.random() * 500;
      const delay = baseDelay + jitter;

      console.warn(
        `Plaid rate limited. Attempt ${attempt + 1}/${maxRetries}. Retrying in ${Math.round(delay)}ms`
      );
      await new Promise(res => setTimeout(res, delay));
      attempt++;
    }
  }
}

// Usage
const response = await plaidCallWithBackoff(() =>
  plaidClient.transactionsGet({
    access_token: accessToken,
    start_date: '2024-01-01',
    end_date: '2024-12-31',
  })
);

Python example:

import time
import random
from plaid.exceptions import ApiException

def plaid_call_with_backoff(fn, max_retries=5):
    for attempt in range(max_retries):
        try:
            return fn()
        except ApiException as e:
            if e.status != 429 or attempt == max_retries - 1:
                raise
            base_delay = (2 ** attempt)
            jitter = random.uniform(0, 0.5)
            delay = base_delay + jitter
            print(f"Rate limited. Attempt {attempt + 1}/{max_retries}. Retrying in {delay:.2f}s")
            time.sleep(delay)

Step 3: Cache Plaid Responses

For most use cases, bank account data does not change second-to-second. Cache aggressively:

Transactions: Safe to cache for 15-60 minutes. Use Plaid Webhooks (TRANSACTIONS_UPDATED) to invalidate on real changes.
Auth / Account numbers: Rarely change. Cache for 24 hours or longer.
Balances: Cache for 5-15 minutes depending on UX requirements.

Redis caching example (Node.js):

async function getCachedTransactions(accessToken, startDate, endDate) {
  const cacheKey = `plaid:txn:${hashToken(accessToken)}:${startDate}:${endDate}`;
  const cached = await redis.get(cacheKey);
  if (cached) return JSON.parse(cached);

  const response = await plaidCallWithBackoff(() =>
    plaidClient.transactionsGet({ access_token: accessToken, start_date: startDate, end_date: endDate })
  );

  await redis.setEx(cacheKey, 900, JSON.stringify(response.data)); // 15 min TTL
  return response.data;
}

Step 4: Migrate Polling to Webhooks

This is the permanent fix for transaction polling anti-patterns. Configure a Plaid webhook endpoint in your dashboard (https://dashboard.plaid.com/developers/webhooks) and handle TRANSACTIONS webhook events:

// Express webhook handler
app.post('/webhooks/plaid', express.json(), async (req, res) => {
  const { webhook_type, webhook_code, item_id } = req.body;

  if (webhook_type === 'TRANSACTIONS') {
    if (webhook_code === 'SYNC_UPDATES_AVAILABLE') {
      // Queue a single sync job for this item
      await transactionSyncQueue.add({ item_id });
    }
  }
  res.sendStatus(200);
});

This eliminates polling entirely: you only call /transactions/sync when Plaid tells you new data exists.

Step 5: Request Queuing for Bulk Operations

For bulk data ingestion (e.g., syncing 10,000 users at startup), use a token-bucket rate limiter:

# Install Bottleneck for Node.js
npm install bottleneck

const Bottleneck = require('bottleneck');

const limiter = new Bottleneck({
  maxConcurrent: 5,     // max parallel Plaid calls
  minTime: 200,         // minimum 200ms between requests (5 req/sec)
  reservoir: 100,       // token bucket: 100 requests
  reservoirRefreshAmount: 100,
  reservoirRefreshInterval: 60 * 1000  // refill every 60 seconds
});

// Wrap Plaid calls
const throttledGet = limiter.wrap(async (accessToken) => {
  return plaidClient.accountsGet({ access_token: accessToken });
});

// Process users without overwhelming the API
await Promise.all(users.map(u => throttledGet(u.plaidAccessToken)));

Frequently Asked Questions

bash

#!/usr/bin/env bash
# Plaid Rate Limit Diagnostic Script
# Run from your application server to identify rate limiting patterns

set -euo pipefail

LOG_FILE="${1:-/var/log/app/api.log}"
OUTPUT_DIR="/tmp/plaid-ratelimit-audit-$(date +%Y%m%d-%H%M%S)"
mkdir -p "$OUTPUT_DIR"

echo "[1/5] Scanning for RATE_LIMIT_EXCEEDED errors in logs..."
grep -E 'RATE_LIMIT_EXCEEDED|429' "$LOG_FILE" 2>/dev/null | \
  awk '{print $1" "$2}' | sort | uniq -c | sort -rn | head -30 \
  > "$OUTPUT_DIR/rate_limit_timeline.txt"
cat "$OUTPUT_DIR/rate_limit_timeline.txt"

echo ""
echo "[2/5] Checking for shared Plaid credentials across environments..."
echo "--- Files referencing PLAID_CLIENT_ID ---"
grep -rn 'PLAID_CLIENT_ID' /etc/ ~/.env* ./env* ./.env* 2>/dev/null | \
  grep -v '.bak' | grep -v Binary \
  > "$OUTPUT_DIR/credential_files.txt" || true
cat "$OUTPUT_DIR/credential_files.txt"

echo ""
echo "[3/5] Identifying polling patterns in codebase (if ./src exists)..."
if [ -d ./src ]; then
  grep -rn --include='*.js' --include='*.ts' --include='*.py' \
    'setInterval\|cron\|schedule\|polling\|setTimeout' ./src/ | \
    grep -i plaid | grep -v '.test.' | grep -v node_modules \
    > "$OUTPUT_DIR/polling_patterns.txt" 2>/dev/null || true
  echo "Found $(wc -l < "$OUTPUT_DIR/polling_patterns.txt") potential polling patterns:"
  cat "$OUTPUT_DIR/polling_patterns.txt"
else
  echo "Skipping: ./src directory not found"
fi

echo ""
echo "[4/5] Counting unique item_ids being rate limited (last 1000 errors)..."
grep -E 'RATE_LIMIT_EXCEEDED' "$LOG_FILE" 2>/dev/null | \
  grep -oP '"item_id":"[^"]+"' | sort | uniq -c | sort -rn | head -20 \
  > "$OUTPUT_DIR/affected_items.txt" || true
cat "$OUTPUT_DIR/affected_items.txt"

echo ""
echo "[5/5] Checking if Bottleneck or rate-limiter is installed..."
if [ -f ./package.json ]; then
  echo "Node.js project detected:"
  grep -E 'bottleneck|rate-limiter|p-queue|async-throttle' ./package.json || \
    echo "WARNING: No rate limiting library found in package.json"
fi
if [ -f ./requirements.txt ]; then
  echo "Python project detected:"
  grep -iE 'ratelimit|backoff|tenacity|retry' ./requirements.txt || \
    echo "WARNING: No rate limiting library found in requirements.txt"
fi

echo ""
echo "=== Audit complete. Results saved to: $OUTPUT_DIR ==="
echo "Next steps:"
echo "  1. Review rate_limit_timeline.txt for burst patterns"
echo "  2. Verify credential_files.txt shows unique IDs per environment"
echo "  3. Refactor polling_patterns.txt to use Plaid webhooks"
echo "  4. Items in affected_items.txt need per-item backoff"

Error Medic Editorial

Error Medic Editorial is a team of senior DevOps engineers and SREs with experience designing high-throughput financial data pipelines on top of open banking APIs including Plaid, Yodlee, and MX. We specialize in API reliability, rate limit mitigation strategies, and incident post-mortems for fintech infrastructure.

Sources

Explore More API Errors Guides

AWS API Rate Limit Exceeded (ThrottlingException): Complete Troubleshooting Guide

Fix AWS ThrottlingException and API timeouts with exponential backoff, Service Quotas increases, and optimized API polling strategies for your workloads.

Azure API Timeout: 'The operation timed out' — Root Causes and Fixes

Fix Azure API timeouts caused by misconfigured APIM policies, backend latency, or connection limits. Step-by-step diagnostics and policy fixes included.

Azure API Timeout: Fix 504 Gateway Timeout and RequestTimeout Errors in Azure API Management, Functions, and ARM

Diagnose and fix Azure API timeout errors (504, 408, RequestTimeout) across API Management, Functions, and ARM. Includes policy fixes, host.json config, and CLI