Title : Sr. Network Engineer
Location: Seattle, WA (Onsite)
Duration: 6 Months
Primary Skills :
- BS/MS with 6+ years of experience in network architecture, deployment and troubleshooting of routers/switches, network interconnects, etc
- Prior experience in L2/L3 technologies, and routing protocols: Ethernet, RDMA/RoCE, IPv4/IPv6, BGP, etc
- Robust understanding and usage of networking tools
- programming languages: Python, scripting, etc. (Nice to have)
Cluster Network Engineering is looking for experienced Network Developers interested in designing, implementing, and automating our RDMA fabrics to support our GPU business. We're looking for innovative thinkers who enjoy tackling new and exciting challenges.
Description
Supports the design, deployment, and operations of a large-scale global cloud computing environment (Cloud Infrastructure - OCI). Primarily focused on development and support of network fabric and systems through a combination of a deep level understanding of networking at the protocol level coupled with programming skills to support the intensive automation required to operate a production environment. As OCI is a cloud-based network with a global footprint, this support will include hundreds of thousands of network devices supporting millions of servers, connected over a mix of dedicated backbone infrastructure and the Internet.
Career Level - Senior
Responsibilities
Participate in Network lifecycle management through network build and/or upgrade projects. Collaborate with program/project managers to develop milestones and deliverables. Will primarily use existing procedures and tools to develop and safely execute network change. However, may have to develop new procedures from time to time. Serve as technical lead for team projects. Contributes to the development of roadmap issues. Leads development of new runbooks and method of procedures. Mentors junior engineers. Participates in network solution and architecture design process. Responsibility for developing standalone features. Participate in operational rotations as either primary or secondary. Provide break-fix support for events. Serve as escalation point for event remediation. Lead post-event root cause analysis. Coordinate with networking automation services for the development and integration of support tooling. Frequently develops scripts to automate routine tasks for team and business unit. Serves as SME on software development projects for network automation. Supports network vendor software bug fixes. Collaborate with network vendor technical account team and internal Quality Assurance team to drive bug resolution and assist in the qualification of new firmware and/or operating systems.
Internal Contact Email