DescriptionSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.
ResponsibilitiesThe Hospitality Cloud SRE team is focused on maximizing service reliability for our hotel product service offerings across global Oracle data centres. Our team runs with a start-up like approach, leaving room for creative freedom. We have worked to assemble the smartest people in the industry to build and grow this revolutionary and disruptive team.
We are looking to add new members to this dynamic team and are seeking subject matter experts for designing and continuously improving reliability for all components within our solution portfolio.
About The Job
As part of the SRE team, you will be continually challenged and directly contribute to the success of our Oracle Hospitality cloud service offerings, every day, working closely with product and Infrastructure partners.
In this role, which is a mix of software development, service architectural design and operational readiness, you will solve interesting technical challenges by defining, designing deploying and troubleshooting key Oracle Cloud services, platforms, and infrastructure, always and thinking about reliability, scalability, resilience, security, and performance.
Ideal Qualification/ Experience
- BS or MS in Computer Science, or equivalent work experience
- Must have hands-on experience developing and deploying large-scale HA Cloud Native enterprise solutions according to MAA best practices.
- Competency in at least 2 languages (Java, JS, Shell/Bash, Node.js, Python).
- Competency in Microservices (Java & JS) Platforms (Node, Redis, Spring, Quarkus)
- Experience with Observability Driven Development
- Ability to create, manage and administer Production/UAT/development Cloud Native environments for an enterprise application.
- Good understanding and appreciation of Cloud Native Computing Foundation Charter (CNCF) and Cloud Native Technologies
- Knowledge of networking and security i.e. DNS records, Load Balancers (F5 / LbaaS /NGINX/HAProxy), subnets, TLS, SSL, SAML etc.
- Knowledge of Container technology (Docker etc.) and developing software to work in containers and container orchestration technologies (Kubernetes, Docker Swarm)
- Conducting performance tuning to maintain system reliability and stability.
- CI/CD pipeline development experience is a must (GitLab, Jenkins)
- Knowledge and experience of Monitoring and Observability pipelines and Observability enabling tools (Prometheus, Grafana, Thanos, ELK, Datadog etc) as well as Tracing (OpenTelemetry)
- Broad and deep experience with technology transformation(monolith to microservice) and significant Cloud solution experience in three areas:
- Multi Tier Application Modernization with K8S, Cloud Infrastructure Economics (DR, Dev/Test and Infrastructure as a Code).
- Multi -tier Application Transformation – Decomposition into a microservices architecture.
- Integration and API Development.
- Experience working on other Cloud providers including AWS, IBM and/or GCP.
- Experience with security API’s/Micro Services with OAuth and OpenID
- Methodical approach to troubleshooting complex problems
- Defining and documenting technical architecture of complex and highly scalable products
- Most importantly, the aptitude to be a good team player and the willingness to learn and implement the latest Cloud technologies