Peraton is seeking a Cloud Reliability Systems Engineer in Chantilly, VA to support our Department of Defense customer as part of a highly talented, highly motivated and high-performing team. As part of the Infrastructure Operations and Maintenance Support team you will be responsible for the availability, performance, monitoring, and incident response, among other things, of the Cloud Infrastructure that we support in a 24x7 environment.
What you'll do:
- Ensure the 24x7 uptime of our multi-tenant cloud infrastructure
- Work closely with the engineering teams to improve our platforms and eliminate complexity from architecture and processes
- Configure and use state-of-the-art monitoring tools to gather insights and then act upon the results
- Conduct incident response and in-depth root cause analysis
- This position is hands-on, requiring the ability to provide first-level system and network support and problem resolution identification
- Responsible for the monitoring the daily software and network operations in a distributed environment
- Responsible for monitoring, working with users on fault isolation and resolution, as well as system analysis and reporting
- This job will include shift work to allow for complete 24x7 monitoring of software systems. Will need to have flexibility to work multiple shifts (day, mid, swing), as needed.
- Job is on-site at Peraton Chantilly, VA facility. No remote work allowed.