Why does ArgoCD say the application is 'Synced' but the health is 'Degraded'?

A 'Synced' status means ArgoCD successfully applied the manifests from Git to the Kubernetes API. 'Degraded' means the resources were accepted, but they are failing to reach a healthy state in the cluster (e.g., Pods are CrashLoopBackOff, a Deployment has 0 available replicas, or an Ingress cannot provision a load balancer).

What is the difference between a Replace Sync and a Force Sync?

A 'Replace' sync uses `kubectl replace` instead of `kubectl apply`, which is useful if the resource is in a corrupted state but the fields aren't immutable. A 'Force' sync actually executes a `kubectl delete` followed by a `kubectl create`. Force sync is highly disruptive but necessary for immutable field changes.

How do I fix the 'metadata.annotations: Too long' error without enabling Server-Side Apply?

If you cannot enable Server-Side Apply, you must reduce the size of the resource. For ConfigMaps and Secrets, you can split the data across multiple resources or mount the configuration via an external secrets provider or volume instead of embedding massive strings in Git.

Why does my sync fail on the first run, but succeed on the second?

This is almost always a dependency issue, typically involving Custom Resource Definitions (CRDs) and Custom Resources (CRs). On the first run, the CR fails because the CRD isn't registered yet. By the second run, the CRD exists. Use ArgoCD Sync Waves to explicitly order the deployment of CRDs before CRs.

Can I ignore specific fields that cause constant OutOfSync issues?

Yes. You can use the `ignoreDifferences` block in your Application manifest. This allows you to specify API groups, kinds, and JSON pointers to fields that should be ignored during state comparison, which is essential for working alongside mutating admission webhooks.

Troubleshooting ArgoCD Sync Failed: Root Causes and Production Fixes

Comprehensive guide to diagnosing and fixing ArgoCD sync failures. Learn to resolve immutable field errors, CRD dependencies, Sync Hook failures, and RBAC issue

Last updated: February 24, 2026

Last verified: February 24, 2026

1,537 words

Key Takeaways

Always inspect the application controller logs and the 'operationState' of the Application CR for exact error messages.
Immutable field errors (e.g., StatefulSet volume claims, Deployment selectors) require a Force Sync or manual resource deletion.
CRD dependency issues can be resolved using Sync Waves or the 'SkipDryRunOnMissingResource' sync option.
Enable Server-Side Apply (SSA) to bypass the 262KB annotation limit for large Custom Resources or ConfigMaps.
Ensure the ArgoCD service account has the necessary RBAC permissions in the destination namespace.

ArgoCD Sync Failure Mitigation Strategies
Mitigation Method	Best Used For	Execution Time	Risk Level
Standard Retry	Transient API server unavailability or network timeouts	< 1m	Low
Sync Waves / Phases	Managing dependencies (e.g., deploying CRDs before CRs)	1-5m	Low
Server-Side Apply	Fixing 'metadata.annotations: Too long' errors on large manifests	1m	Low
Force Sync (--force)	Overcoming immutable field change rejections (deletes & recreates)	1-2m	High
Replace Sync (--replace)	Fixing severely corrupted resource states without triggering hooks	1-2m	High

Understanding ArgoCD Sync Failures

ArgoCD is the cornerstone of modern GitOps workflows, continuously monitoring your Git repositories and synchronizing state into your Kubernetes clusters. However, when an argocd sync failed error occurs, your CI/CD pipeline halts, and the cluster state drifts from your declarative source of truth.

A sync failure means ArgoCD attempted to apply the manifests generated from your Git repository using the equivalent of kubectl apply, but the Kubernetes API server rejected the transaction, or a pre-condition within ArgoCD itself was not met. Understanding the specific mechanism of failure is critical to choosing the correct remediation strategy.

Step 1: Diagnosing the Exact Error

The first step in troubleshooting is moving beyond the generic 'Sync Failed' UI badge. You need the exact error message returned by the Kubernetes API.

Via the ArgoCD CLI:

argocd app get <your-app-name> --output wide

Look specifically at the Sync Status and Conditions output.

Via kubectl: Inspect the Application Custom Resource (CR) directly in the ArgoCD namespace:

kubectl get application <your-app-name> -n argocd -o yaml

Scroll down to the status.operationState and status.conditions arrays. You will often see detailed error messages logged here.

If the Application CR lacks detail, check the logs of the argocd-application-controller:

kubectl logs -n argocd -l app.kubernetes.io/name=argocd-application-controller | grep <your-app-name>

Step 2: Common Root Causes and Fixes

1. Immutable Field Changes

The Error: Invalid value: "...": field is immutable

The Cause: Kubernetes prevents updates to certain fields after a resource is created. Common examples include modifying a Deployment's spec.selector.matchLabels, changing the volumeClaimTemplates of a StatefulSet, or altering the clusterIP of a Service. A standard kubectl apply (which ArgoCD uses by default) cannot process these changes.

The Fix: You must delete the existing resource and let ArgoCD recreate it with the new specifications. You can do this via the ArgoCD UI by clicking 'Delete' on the specific resource (ensure cascading delete is off if you want to keep underlying pods running temporarily), or by using the --force flag during sync:

argocd app sync <app-name> --force

Warning: Force syncing deletes and recreates the resource, which will cause downtime for the affected workload.

2. The "Resource Too Large" Error

The Error: metadata.annotations: Too long: must have at most 262144 bytes

The Cause: Client-side apply (the default) stores the previous state of the resource in an annotation (kubectl.kubernetes.io/last-applied-configuration). If you have a massive ConfigMap, Secret, or a heavily populated Custom Resource (like a Prometheus rule set), this annotation exceeds Kubernetes' 262KB limit.

The Fix: Switch to Server-Side Apply (SSA). SSA stores the managed fields on the server side, eliminating the massive client-side annotation. You can enable this per-application in the sync options:

apiVersion: argoproj.io/v1alpha1
kind: Application
metadata:
  name: large-app
spec:
  syncPolicy:
    syncOptions:
    - ServerSideApply=true

3. Missing Custom Resource Definitions (CRDs)

The Error: no matches for kind "MyCustomResource" in version "v1alpha1" or the server could not find the requested resource

The Cause: Your Git repository contains both a CRD and a Custom Resource (CR) that relies on that CRD. ArgoCD attempts to apply both simultaneously. The API server accepts the CRD, but immediately rejects the CR because the API server's discovery cache hasn't updated yet to recognize the new CRD.

The Fix: Utilize ArgoCD Sync Waves and Phases. Assign the CRD a lower sync wave than the CR, ensuring ArgoCD waits for the CRD to be fully established before attempting to apply the CR.

Annotate your CRD:

metadata:
  annotations:
    argocd.argoproj.io/sync-wave: "-1"

Annotate your CR:

metadata:
  annotations:
    argocd.argoproj.io/sync-wave: "0"

Additionally, you can add SkipDryRunOnMissingResource=true to your sync options to prevent ArgoCD from failing the initial dry-run phase.

4. Sync Hook Failures

The Error: Hook <hook-name> failed

The Cause: ArgoCD allows you to run Jobs at specific phases of the lifecycle (PreSync, Sync, PostSync, SyncFail). If a Job annotated with argocd.argoproj.io/hook: PreSync fails (e.g., exits with a non-zero code, such as a database migration script failing), ArgoCD aborts the entire sync process.

The Fix: Investigate the specific Job that failed.

kubectl get pods -n <target-namespace> | grep <hook-name>
kubectl logs <hook-pod-name> -n <target-namespace>

Resolve the issue in your hook script. If the hook job is stuck, you may need to manually delete the Job resource, adjust your hook-delete-policy annotations (e.g., hook-succeeded,hook-failed), and re-trigger the sync.

5. RBAC and Permission Denied Errors

The Error: Error from server (Forbidden): error when creating "...": roles.rbac.authorization.k8s.io "..." is forbidden: user "system:serviceaccount:argocd:argocd-application-controller" cannot create resource "roles" in API group "rbac.authorization.k8s.io" in the namespace "target-ns"

The Cause: When ArgoCD manages multiple clusters or strictly segregated namespaces, the argocd-application-controller ServiceAccount must be granted explicit RBAC permissions to manage resources in the destination cluster/namespace. It defaults to cluster-admin in simple setups, but in hardened environments, its permissions are restricted.

The Fix: Review the ClusterRole bindings associated with the ArgoCD controller. Ensure it has the necessary RBAC verbs (get, list, watch, create, update, patch, delete) for the API groups and resources you are trying to deploy.

Step 3: Best Practices to Prevent Sync Failures

Enable Auto-Sync Carefully: Auto-sync is powerful but can lead to infinite sync loops if manifests are dynamically mutating or if admission controllers are altering applied manifests. If using auto-sync, ensure you understand the implications of selfHeal and prune.
Use IgnoreExtraneous for Mutating Webhooks: If an external system (like a service mesh injector or an operator) injects fields into your manifests, ArgoCD will constantly see the resource as OutOfSync. Use the IgnoreDifferences configuration in your Application spec to ignore these specific fields.
Validate Locally: Before pushing to your Git repository, validate your manifests against your cluster's API version using tools like kubeconform or by running kubectl apply --dry-run=server -f ..

Frequently Asked Questions

bash

#!/bin/bash
# Diagnostic script for ArgoCD sync failures

APP_NAME="my-failing-app"
ARGOCD_NAMESPACE="argocd"

echo "=== Checking Application Status ==="
kubectl get application $APP_NAME -n $ARGOCD_NAMESPACE -o jsonpath='{.status.operationState.message}' | jq .

echo -e "\n=== Checking Sync Conditions ==="
kubectl get application $APP_NAME -n $ARGOCD_NAMESPACE -o jsonpath='{.status.conditions}' | jq .

echo -e "\n=== Fetching Application Controller Logs for the last 15 minutes ==="
kubectl logs -n $ARGOCD_NAMESPACE deployment/argocd-application-controller --since=15m | grep -i "$APP_NAME" | grep -i error

# Example: Triggering a sync with Server Side Apply enabled via CLI
# argocd app sync $APP_NAME --server-side

# Example: Triggering a force sync (deletes and recreates resources)
# argocd app sync $APP_NAME --force

Error Medic Editorial

Error Medic Editorial comprises seasoned DevOps engineers, Site Reliability Experts, and Cloud Architects dedicated to creating actionable, code-first troubleshooting guides for modern infrastructure.

Sources

Explore More DevOps Config Guides

Ansible Failed: Fix Connection Refused, Permission Denied & Timeout Errors

Fix Ansible failures including connection refused, permission denied, and timeout errors. Step-by-step diagnosis with real commands and verified solutions.

AWS ECS Timeout: Task Failed ELB Health Checks & Container Startup Timeouts — Complete Fix Guide

Fix AWS ECS timeout errors including failed ELB health checks, container startup timeouts, and deployment stalls with step-by-step CLI commands and config fixes

CircleCI Build Failed: How to Fix OOM, Permission Denied, and Timeout Errors

CircleCI build failed? Diagnose and fix out-of-memory, permission denied, and timeout errors with step-by-step commands and config fixes. Resolve in minutes.