Kubernetes knowledge and management The SR Site Reliability Engineer is responsible for ensuring operations for infrastructure... and management of Kubernetes clusters. 8+ years of experience in technical support and troubleshooting of multiple systems including...
/Cloud Infrastructure platforms. * SLO/SLA management and implementation experience Deep UNIX/Linux systems knowledge..., and designed to scale. As SRE organization we take pride in handling “operations as an engineering” problem with automation first...
in a large-scale production environment Experience with managing large numbers of diverse systems with configuration management... environment Experience with Linux/Unix, Networking, Systems Management, Systems Security Experience using modern object storage...