← Back to trackModule: GitOps and Delivery Control

ADVANCED · Hard · 70m

GitOps with Argo CD

Platform Engineering Road

Lesson notes

Goal

Use Git as the source of truth for deployments and let Argo CD keep clusters in sync.

Why GitOps works

  • every change is reviewed (PRs)
  • easy rollback (revert commit)
  • drift detection (cluster changed outside Git)

Core concepts

  • Application: points to a repo/path + target cluster/namespace
  • Sync: apply desired state
  • Health: readiness of deployed objects

Practical workflow

  1. Change manifests in Git
  2. PR review
  3. Merge to main
  4. Argo syncs automatically or on approval

Next Step

Add guardrails: policies, approvals, and limits so teams can self-serve safely.

View full outline

Outline

Use the outline to jump to any topic.

Track tools

Search lessons, continue where you left off, and track completion.

Modules

3

Lessons

6

Estimated Time

385m

Completion

0%

0/6 lessons

Your progress: 0%

Complete a lesson to increase progress

Outline

Open a lesson for full notes. Mark completed to update your progress.

Goal: Understand Kubernetes architecture so debugging becomes logical instead of guessing. The mental model control plane (API server + scheduler + controllers) nodes (kubelet + container runtime) workloads (pods, deploy…

Goal: Release on Kubernetes without outages. What you must set requests/limits (CPU/memory) readiness probe (traffic gate) liveness probe (self heal) rollout strategy (maxUnavailable/maxSurge) Rollout commands Validation…

Goal: Use Git as the source of truth for deployments and let Argo CD keep clusters in sync. Why GitOps works every change is reviewed (PRs) easy rollback (revert commit) drift detection (cluster changed outside Git) Core…

Goal: Keep production safe while enabling teams to ship. Guardrails to implement protected branches (main) required CI checks environment approvals for production resource limits + quotas security policies (basic) Good g…

Goal: Know what to measure and how to use telemetry to make decisions. The three signals metrics: numbers over time (latency, error rate) logs: events (context) traces: request flow across services Start with RED (for se…

Goal: Handle incidents calmly with a repeatable playbook. The incident loop 1. detect (alert / report) 2. triage (scope + severity) 3. mitigate (stop bleeding) 4. recover (restore service) 5. learn (postmortem + fixes) T…