Job Description:
- 5+ years is needed, and 2 years must be with Kubernetes hands on (they need people with lifecycle of Kubernetes clusters, creating, and maintaining them, not just managing them)
- They are a platform team, so this needs to be very focused on Kubernetes.
- DevOps are a highly sought after focus as well, but applicant should have experience supporting infrastructure / platforms / environments.
- Team manages the shared Kubernetes platform that serves the application teams.
- Their focus is on Azure and AKS (Azure Kubernetes services) will consider someone with AWS / EKS managed Kubernetes offering.
- The main role is to support lifecycle and engineering new features in Kubernetes platform.
- Infrastructure as Code, experience with tech stack beyond Kubernetes, git hub actions, Argo CD, SDO service mesh, observability tools like Grafana and Dynatrace and operating systems /Linux management.
- They are focused on making sure someone has strong verbal and written communication skills, and all classic soft skills are there.
- This list is intended to reflect the current job but there may be additional essential functions (and certainly non-essential job functions) that are not referenced.
- Management will modify the job or require other tasks be performed whenever it is deemed appropriate to do so, observing, of course, any legal obligations including any collective bargaining obligations.
- Design, deploy, and manage Cloud Infrastructure solution.
- Build scalable, secure, and resilient cloud solutions, such as Kubernetes Platform in Azure or other public cloud.
- Collaborate with Cybersecurity, enterprise architects to define and Implement cloud solution best practices following Well-Architect Framework best practices for security, reliability, performance, and cost efficiency.
- Ensure customer needs (product strategy objectives, user requirements, and technical environment compatibility) are met.
- Communicate product strategy, roadmap, and change effectively to customers and stakeholders.
- Drive innovation by automating infrastructure via your hands-on expertise with Infrastructure as code technologies.
- Manage lifecycle of cloud solutions and platforms that keep the system stack up to date.
- Writes, tests, and documents technical work products (e.g., code, scripts, processes) according to organizational standards and practices.
- Monitor cloud systems for availability, performance, and cost efficiency, optimizing usage and resources.
- Diagnose and resolve issues in cloud environments, providing support for internal teams and stakeholders.
- 24/7 on call rotation supporting business critical applications or systems.
- Conducts root cause analysis to identify domain level problems and prescribes action items to mitigate.
Minimum Qualifications:
- Bachelor’s degree in computer science, Computer Engineering, Technology, Information Systems (CIS/MIS), Engineering or related technical discipline, or equivalent experience/training
- Five years of working experience as a technology professional.
- Two years of platform operations experience, specifically supporting Kubernetes, microservices, and containerization.
- Two years of experience implementing infrastructure as code (Terraform)
Preferred Qualifications:
- Azure, AWS, or Kubernetes Technical Certifications
- 2+ years working with Microsoft Azure Kubernetes Service (AKS) or Amazon Elastic Kubernetes Service (EKS), specifically administering and maintaining cluster lifecycle
Skills, Licenses & Certifications
- Experience keeping production environments operating at peak performance on the cloud and in containers
- Experience managing production Kubernetes infrastructure
- Hands-on experience with infrastructure as code and configuration as code
- Deep understanding of DevOps best practices and CI/CD, particularly utilizing Github Actions and ArgoCD
- Experience with Service Mesh technologies such as Istio
- Experience with using Rancher as a control plane
- Experience with Observability technologies and optimizing monitoring using technologies such as Grafana, Dynatrace, and Prometheus
- Deep knowledge of Linux OS, network, and Azure PaaS solutions