← Back to opportunities
About the Role
ProvideL1 operational support for Red Hat OpenShift / Kubernetes platforms. Perform daily health checks on OpenShift clusters, nodes, pods, operators, routes, storage, and platform services. Monitor cluster availability, resource usage, alerts, and workload status using Prometheus, Grafana, Alert manager, ELK, Splunk, or equivalent tools. Perform first-level troubleshooting for common platform issues, including: Pod failures CrashLoop BackOff Image pull errors Node resource pressure Route / ingress access issues DNS-related issues Persistent volume mount issues Certificate expiry alerts Basic authentication or access issues Support routine platform operations such as: User and group access requests Namespace / project checks Pod restarts and rollout validation Log collection Basic YAML review Alert validation Evidence collection for incidents and changes Escalate complex issues to L2/L3 platform teams with clear initial findings, logs, timestamps, affected components, and business impa...
Ready to Join Through a Referral?
Apply now and get connected directly with the hiring team
Apply for this Position