Evangelize automation, DevOps best practices, and enterprise IT infrastructure
Execute maintenance, development, and troubleshooting of our cloud/on-prem hybrid distributed architecture platform
A Linux based on-prem application platform, using IoT-like design patterns, enabling a variety of new and innovative capabilities driving new product functionality
Change Data Capture (CDC) platform tooling and operations automation
Aid multiple teams in the development of Infrastructure as Code
Work on AWS account separation, migration and environment capabilities
Perform Linux system maintenance
Develop the tooling and processes to aid in building a team on call support rotation to handle early morning checks, improve observability, etc.
Regularly perform Relational Database Services (RDS) upgrades
Build and maintain internal tooling systems to support and improve organizational needs:
Slack commands
Internal services and APIs that help drive support, monitoring, alerting, etc
OpenTelemetry collectors and infrastructure to support an internal platform for observability
Responsible for participating in IT change management and asset management processes
Provides expert guidance and leadership to less-experienced cloud services personnel
May serve as a task or project leader and may consult with executives and customers to deliver complex cloud-based solutions
Participate in continuous improvement planning for cloud operations, including technology reviews, root cause analysis, action reports analysis, and cloud portfolio review to ensure adequate infrastructure to provide a suitable operating environment(s) for critical applications
Other IT duties as assigned
General systems troubleshooting and understanding of overall architecture
Ability to travel as needed
Extensive AWS experience (EC2, Lambda, RDS, Fargate, Route53, VPC, CloudFront, AppSync, API Gateway, Cognito, CloudWatch, DynamoDB, SNS, SQS, S3, etc.)
4-6 years of professional experience in infrastructure architecture and development Terraform, Ansible, Docker, and other related technologies
Knowledge of Linux system administration, maintenance, and deployment
Experience with scripting languages such as Bash, Python, Ruby, etc.
Demonstrated understanding of one or more programming languages, such as Typescript, Javascript, Go, etc.
Experience with PostgreSQL or other relational databases
Experience with modern observability solutions, allowing the management of “cattle, not pets” and other ephemeral distributed processing technologies
Ability to manage multiple tasks across several projects
Highly motivated and self directed
Excellent analytical, planning, organizational, and technical skills
Understanding of emerging technology trends/disruptions and their practical application to new and existing technologies
Excellent understanding of application support and development methodologies, infrastructure, and network
Good understanding of product management, agile principles and development methodologies and capability of supporting agile teams by providing guidance on opportunities, impact, and risks, taking account of technical and architectural debt