*IQGeo will consider any qualified applicants in the US and Canada
Job Summary
The Systems Reliability Engineering (SRE) department serves to support IQGEO’s hosted customers in AWS, provide consultative services alongside Implementation Services for external deployments in all three cloud providers and some on-prem scenarios. SRE provides a CI/CD pipeline to deliver solutions from Engineering and Implementation Services to customers. The SRE team works closely with the Engineering, Support, Implementation Services, and Pre-Sales departments to ensure that they are successful with cloud infrastructure and Kubernetes deployments.
Job Description
We are looking for a Systems Reliability Engineer Professional, who will be working closely with Engineering, Implementation Services, Support, and the rest of the Operations Team to enable rapid delivery of capabilities through a secure Continuous Integration/Continuous Delivery (CI/CD) Pipeline to production systems. You will be responsible for building and maintaining the entire customer pipeline and infrastructure needs for the hosted offerings of IQGEO. You will be a key member of the team, where the rubber meets the road, doing work that matters for our customers, and utilizing the latest technologies in Cloud Computing, GIS, containers, and Kubernetes. Providing backend support and ensuring that support and Implementation Services are setup for success from an infrastructure and pipeline perspective. Be available for an on-call rotation.
Duties/Responsibilities
Actively and consistently supports all efforts to simplify and enhance the customer experience.
- Build resilient, self-healing systems that could scale seamlessly (high-performance) and improve system reliability (always available)
- Monitor system health using various charts, graphs and logs, detect and trace problems and react to issues at scale
- Write post-mortems, participate in forensic root cause analysis to implement corrective measures preventing issue(s) from reoccurring
- Create, modify, evolve and document risk-mitigation strategies to eliminate potential risks that could impact performance, scalability and reliability of systems and services,
- Create, modify, evolve, repair and/or maintain scripts to secure CI/CD pipelines across Multiple Domains
- Create, modify or evolve current processes for source control, build, integration, automated test, security scanning, and delivery of applications
- Will be required to interact with Product House and Implementation Services to deliver the end-to-end pipelines of software delivery
- Create automated development and operations scripts/processes to ensure reliability, scalability, repeatability of pipelines without error, bugs or with very minimal customer impact
- Leverage Infrastructure as Code and Configuration as Code to automate deployments
- Be held to MTTR or other SLAs
Required Skills & Abilities
- Expert with GitHub or similar source repository and CI/CD collaboration platform
- Expert with platforms such as Kubernetes, EKS, or another container orchestration platform
- Expert with scripting language such as Python or bash; sed/awk is preferred
- Experience with Docker or similar container technology
- Experience with PostgreSQL
- Understanding of a well architected Cloud Infrastructure deployment in AWS, Including:
- Compute
- Storage
- Database
- Networking
- VPC
- Subnets
- CIDRs
- NACLs
- Security Groups
- VPN
- Transit Gateway
- Transit Gateway Attachments
Desirable
- Experience with Cloud Computing and Hybrid On-Prem solutions
- AWS Certification (Developer, DevOps, Architect, etc.)
- Experience with Ansible, Chef, Puppet or similar Configuration Management technology
- Experience with ArgoCD or similar continuous delivery technology
- Experience with Terraform or similar Infrastructure as code tool
- Experience with Docker or similar container technology
- Experience with Kubernetes or another container orchestration platform
Education and Experience
- Bachelor's degree from a four-year college or university, and engineering Degree is preferred
- 5+ years of telecom industry, wireless industry, or customer management (or relevant) experience
- An equivalent combination of education and experience will be considered
How We Work
- We deploy, support, and administer the IQGEO Cloud Infrastructure for our clients, training, support, and other departments.
- We establish pipelines for use by Implementation Services and Engineering for providing customers with solutions.
- We seek to collaborate with respect, build trust and learn continuously while selflessly sharing knowledge and experience with team members
What’s In it For You
- Medical, Dental, Vision, Life insurance: monthly premiums are paid 100% for employee, spouse, and family! No employee contribution to
benefit plan required!
- STD/LTD insurance fully paid.
- Generous PTO with 8 paid holidays plus 2 “floating” holidays.
- Paid charity/volunteering day each year.
- Enhanced maternity leave policy (full-pay 3 months, half-pay 3 additional months) after 2 years of service.
- 401k Safe Harbor contribution, fully vested day one.
- Mentor program.
- Home office support for remote workers.
Flexible Working
We support hybrid and flexible working arrangements for all employees. We understand that life for many people involves school runs, caregiving, or exercising!
Work Permits & Visas
You must already have the right to work permanently in United States.
IQGeo is not able to sponsor work permits.
About IQGeo
IQGeo™ is based in Cambridge, UK with regional offices in the United States, Canada, Belgium, Germany, Malaysia, and Japan. We are supported by a global network of highly skilled partners. Originally founded as Ubisense Ltd in 2002, the IQGeo brand was launched in January 2019 after the company was split into two separate businesses. Led by a team of geospatial technology pioneers, the IQGeo Platform software was first launched in 2010 and has an impressive pedigree in the telco, communications, and utility industries. In 2020, IQGeo acquired OSPInsight, a provider of fiber network management software, and in 2022 IQGeo acquired Comsof, a world leader in automated network design, headquartered in Belgium.
Today, IQGeo is the leader in introducing modern web and mobile geospatial applications into the communications and utility industries.