Services / KubeCare

Day-2 Kubernetes, handled

KubeCare is ongoing managed operations for your Kubernetes clusters — 24/7 monitoring, security patching, upgrades, cost optimisation, and incident response, all SLA-bound. Always-on coverage, not a one-off.

SRE on-call · Lifecycle stage 3 / 5
What's covered

The scope of "handled"

Everything that keeps a production estate healthy, carried by our SRE team instead of yours — continuously, and under SLA.

24/7 Monitoring

Our SRE team monitors your cluster around the clock. Alerts, incidents, and escalations handled before you wake up.

Managed Upgrades

Kubernetes and OpenShift upgrades planned, tested, and executed with zero downtime. Never be surprised by an EOL version.

Security Patching

CVE monitoring and patching with SLA-bound response times. We patch, you sleep.

Cost Optimisation

Monthly rightsizing analysis. We identify wasted resources and give you actionable savings recommendations.

Incident Response

P1 response in under 15 minutes. War room support, root cause analysis, and post-mortems with action items.

Monthly Health Reports

Cluster health, security posture, cost breakdown, and recommendations — delivered monthly.

The SLA

The SLA is the product

Response times you can plan around, committed in writing. While your cluster runs, our SRE team is on-call against these targets — continuously.

SRE on-call · responding now
99.9%
Uptime target
24/7
Coverage
Severity Description Response Covered
P1 Critical — production down < 15 min
P2 Major — degraded service < 2 hr
P3 Minor — non-urgent < 24 hr

Response-time commitments per the KubeCare agreement.

How it works

From your cluster to ours, in three steps

We take an existing production estate under management without a rebuild — onboard, baseline, then run it.

01 / Step

Onboarding

We audit your cluster and stand up our monitoring and alerting stack against it.

02 / Step

Baseline

Establish runbooks, escalation paths, and the SLA agreement — your coverage, in writing.

03 / Step

Operate & Report

Continuous monitoring, patching, upgrades, and optimisation, with a monthly health report on metrics, incidents, and recommendations.

Deliverables

What the contract puts in writing

Coverage you can hold us to: response times committed, a monthly report on the estate, a planned upgrade cadence, and a live view of your security posture.

SLA-bound · reviewed monthly
01

SLA-backed incident response

P1 <15min, P2 <2hr, P3 <24hr response time commitment.

02

Monthly health report

Cluster metrics, security posture, cost breakdown, and recommendations.

03

Managed upgrade plan

Quarterly upgrade schedule with change windows and rollback plans.

04

Security posture dashboard

Real-time CVE tracking and patching status.

KubeCare

Ready to hand off cluster operations?

Tell us about your cluster and we'll scope a KubeCare plan.