Role: Systems Reliability Engineer (SRE)
Location: Indianapolis, IN (Hybrid)
Type: Long Term Contract
Required Skills : For this role, I am looking for a high-level Site Reliability Engineer. This is a HYBRID position based on the southwest side of downtown Indianapolis. This resource MUST be able to do 3 days onsite (Tuesday - Thursday) to be considered for the role. Since the role does not start until the beginning of the year, I can take relocation candidates, but they must be willing to relocate by or around January 6th. The resource is required to have the following experience (non-negotiable): Sire Reliability Engineering, Life Sciences experience, AWS, and regulatory experience. Please make sure all candidates have this experience before sending over.
Our company Systems, a global leader in IT Services, is looking for a Sr. SRE for a large client in Indianapolis, IN. This role is hybrid, so we are looking for candidates willing to work out of Indianapolis 3 days a week. You will be working in the Life Science industry so experience with in GMP and GCP data is preferred. Please see below for details!
Qualifications
- Bachelor’s Degree in IT or equivalent field
- Minimum 3-5 years relevant work experience (internships also count)
- Ability to effectively communicate and influence key stakeholders to support proposed strategies, process improvements and operational decisions
- Knowledge of DevSecOps quality, project management and software development approaches.
- Experience testing and mapping various processes to propose improvements
- Strong understanding of tech to gather data and problem solve
- Excellent communication and leadership skills
Role And Responsibilities
- Collaborate within product team(s) to promote the concept of reliability engineering during all phases of the software lifecycle to detect and correct performance issues and meet availability goals.
- Work with stakeholders (e.g., product owners) to define service level objectives (SLOs) for system operations.
- Track performance against SLOs in partnership with others to ensure systems meet SLOs over time.
- Create software to improve system performance, scalability, and stability, and to automate manual operational work (i.e., “toil”).
- Participate in operational support and drive blameless post-mortems to troubleshoot and resolve priority incidents.
- Evaluate and recommend methods for improving automation, security, and system observability.
- Work with your product team as well as security and privacy experts to build, monitor, and maintain assurance techniques.
- Identify, assess, and mitigate key risks inherent to any dependencies your product has on third parties who develop, support, use, and/or assist with the lifecycle of your product.
Skillset Required
- Strong problem solving and analytical skills and highly adaptable to changing circumstances.
- Experience in reliability engineering and monitoring practices, environments, and tools (e.g., cloud ecosystem preferably AWS, monitoring/observability, configuration management, CI/CD, etc.).
- Working with testing tools such as MABL and ALM
- Experience with programming and scripting languages (e.g., Java, Python, etc.)
- Experience with large-scale databases, data movement and analytics tools (e.g., RDS, DynamoDB, etc.)
- Experience with ITIL v4 processes, framework, and tools that support it (e.g., ServiceNow).
- Strong knowledge of agile practices and agile planning tools (e.g., Jira).
- Effective written and verbal communication skills for both technical and non-technical audiences.
- Experience working in a highly regulated environment with a deep understanding of how quality standards and procedures are applied to high-risk applications, protecting privacy related data and identifying security threats.