Responsible for the monitoring, provisioning, resiliency, and customer interactions.
Working extensively with Windows including patching and certificate provisioning and renewals.
Able to map out existing infrastructure flows and dependencies.
Leading root cause analysis meetings.
Responding to incidents.
Understanding of logs and monitoring tools (Splunk, Sumo Logic, New Relic, Dyanatrace, etc)
Strong scripting in at least one language (Perl, PowerShell, Go, Python, etc.)
Other Must Haves/Nice to Haves:
• Experience working in a high capacity, highly scalable mission-critical web serving environment
• Proven ability to participate with other functional teams in systems integration and design including writing operational specifications, test plans and requirements management with attention to detail
• UNIX/LINUX and Windows and server experience, including expertise in system installation, configuration, administration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures
• Web (IIS, Apache), .Net & Java application (Tomcat, Jboss, etc) server expertise including installation, administration, configuration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures
• Experience in at least two relevant scripting or programming languages (Ruby, Perl, Python, Shell, PowerShell, etc.)
• Experience with Configuration Management platforms (Chef, Ansible, CFEngine, Puppet, etc.)
• Database Administration – setup, configuration and basic database troubleshooting skills
• Understanding of internet standards such as HTTP, DNS, FTP, SSH, HTML, XML, JDBC, ODBC, SNMP and other protocols
• Understanding of high availability hardware and database systems design and implementation including cluster management, redundancy and failover testing
• Knowledge of storage systems (SAN, NAS, RAID Array, etc)
• Experience hardening and maintaining secure systems (Safe Harbor or PCI experience a plus!)
• Network hardware architecting experience with load balancing equipment, switches, routers, and network troubleshooting
• Ability to produce system documentation, including writing requirements, operational specifications, system architecture, test plans and as-built documentation, all with attention to detail
• Experience working with ITIL and Service Management best practices is a plus.
• Ability to build strong relationships and influence others across the organization
• Demonstrated knowledge of agile project methodologies
• 5+ years experience designing, supporting and deploying Internet-based products or services
• 4+ years operating complex, large-scale Enterprise guest-facing Applications or web sites