← Back to trackModule: Kubernetes Operations

ADVANCED · Medium · 60m

Cluster mental model (control plane, nodes, workloads)

Platform Engineering Road

Lesson notes

Goal

Understand Kubernetes architecture so debugging becomes logical instead of guessing.

The mental model

control plane (API server + scheduler + controllers)
nodes (kubelet + container runtime)
workloads (pods, deployments)

Flow:

kubectl -> API Server -> etcd + controllers -> scheduler -> kubelet -> pods

Commands you will use daily

kubectl get nodes
kubectl get pods -A
kubectl describe pod <name>
kubectl logs <pod> -c <container> --tail=200
kubectl get events -A --sort-by=.lastTimestamp | tail

What breaks most often

image pull issues
bad env vars/secrets
missing permissions (RBAC)
readiness probe failing
resource limits too low

Next Step

Deploy and roll out safely with probes and rollout strategies.

View full outline

Outline

Use the outline to jump to any topic.

Track tools

Search lessons, continue where you left off, and track completion.

Continue learning →Jump to outline

Modules

Lessons

Estimated Time

385m

Completion

0/6 lessons

Your progress: 0%

Complete a lesson to increase progress

Outline

Open a lesson for full notes. Mark completed to update your progress.

Cluster mental model (control plane, nodes, workloads)

60m · Open lesson →

MEDIUM

Goal: Understand Kubernetes architecture so debugging becomes logical instead of guessing. The mental model control plane (API server + scheduler + controllers) nodes (kubelet + container runtime) workloads (pods, deploy…

Deploy and roll out safely

75m · Open lesson →

HARD

Goal: Release on Kubernetes without outages. What you must set requests/limits (CPU/memory) readiness probe (traffic gate) liveness probe (self heal) rollout strategy (maxUnavailable/maxSurge) Rollout commands Validation…

GitOps with Argo CD

70m · Open lesson →

HARD

Goal: Use Git as the source of truth for deployments and let Argo CD keep clusters in sync. Why GitOps works every change is reviewed (PRs) easy rollback (revert commit) drift detection (cluster changed outside Git) Core…

Policy and guardrails

65m · Open lesson →

HARD

Goal: Keep production safe while enabling teams to ship. Guardrails to implement protected branches (main) required CI checks environment approvals for production resource limits + quotas security policies (basic) Good g…

Signals that matter: metrics, logs, traces

55m · Open lesson →

MEDIUM

Goal: Know what to measure and how to use telemetry to make decisions. The three signals metrics: numbers over time (latency, error rate) logs: events (context) traces: request flow across services Start with RED (for se…

Incident response playbook

60m · Open lesson →

HARD

Goal: Handle incidents calmly with a repeatable playbook. The incident loop 1. detect (alert / report) 2. triage (scope + severity) 3. mitigate (stop bleeding) 4. recover (restore service) 5. learn (postmortem + fixes) T…

Cluster mental model (control plane, nodes, workloads)

Lesson notes

Goal

The mental model

Commands you will use daily

What breaks most often

Next Step

Outline

Kubernetes Operations

Cluster mental model (control plane, nodes, workloads)

Deploy and roll out safely

GitOps and Delivery Control

GitOps with Argo CD

Policy and guardrails

Observability + Incidents

Signals that matter: metrics, logs, traces

Incident response playbook