What Is a Kubernetes Job?

Q: What Is a Kubernetes Job?

A Kubernetes Job creates one or more Pods and ensures they run to successful completion. Unlike Deployments that keep Pods running indefinitely, Jobs are designed for finite tasks like database migrations, batch processing, or data exports.

Detailed Answer

A Job is a Kubernetes workload controller designed for tasks that should run to completion and then stop. While Deployments keep Pods running forever, Jobs ensure a specified number of Pods successfully terminate.

Basic Job Example

apiVersion: batch/v1
kind: Job
metadata:
  name: db-migration
spec:
  template:
    spec:
      containers:
        - name: migrate
          image: myapp/migrate:v3.2
          command: ["python", "manage.py", "migrate"]
          env:
            - name: DATABASE_URL
              valueFrom:
                secretKeyRef:
                  name: db-credentials
                  key: url
          resources:
            requests:
              cpu: "250m"
              memory: "256Mi"
            limits:
              cpu: "1"
              memory: "512Mi"
      restartPolicy: Never
  backoffLimit: 3

Key requirements:

restartPolicy must be Never or OnFailure (not Always)
backoffLimit controls how many times the Job retries on failure

Job Lifecycle

Job is created — the controller creates a Pod
Pod runs — executes the workload
Pod succeeds (exit code 0) — Job is marked Complete
Pod fails (non-zero exit code) — controller retries up to backoffLimit
All retries exhausted — Job is marked Failed

# Check Job status
kubectl get jobs
# NAME           COMPLETIONS   DURATION   AGE
# db-migration   1/1           45s        2m

# View the Pod created by the Job
kubectl get pods -l job-name=db-migration
# NAME                 READY   STATUS      RESTARTS   AGE
# db-migration-x7k2q   0/1     Completed   0          2m

# View logs from the completed Pod
kubectl logs db-migration-x7k2q

restartPolicy Behavior

The restartPolicy controls what happens when a container fails:

| Policy | Behavior | Use Case | |---|---|---| | Never | Job controller creates a new Pod | When you need a fresh environment on retry | | OnFailure | kubelet restarts the container in the same Pod | When restarting in-place is safe and faster |

With restartPolicy: Never, each retry creates a new Pod, so you can inspect failed Pods for debugging. With OnFailure, the container restarts in the same Pod, which is faster but loses the previous container's filesystem.

Common Use Cases

| Task | Description | |---|---| | Database migrations | Run schema changes before deploying a new version | | Data processing | Process a batch of files, records, or messages | | Report generation | Generate periodic reports and export to storage | | Backups | Create database or volume snapshots | | ML training | Train a model to completion | | Integration tests | Run a test suite against a staging environment |

Cleanup

Completed Jobs and their Pods remain in the cluster by default. Clean them up with:

# Delete a specific Job and its Pods
kubectl delete job db-migration

# Delete all completed Jobs
kubectl delete jobs --field-selector status.successful=1

Or use automatic TTL-based cleanup (covered in the job-ttl-after-finished question).

Jobs vs CronJobs

A Job runs once when created. A CronJob creates Jobs on a schedule, similar to cron in Linux. If you need periodic batch tasks, use a CronJob that creates Jobs at specified intervals.

Detailed Answer

Basic Job Example

Job Lifecycle

restartPolicy Behavior

Common Use Cases

Cleanup

Jobs vs CronJobs

Why Interviewers Ask This

Common Follow-Up Questions

Key Takeaways

Related Questions

You Might Also Like