Responsibilities include diagnosing and resolving production incidents, writing Python and Bash scripts on the fly to support live troubleshooting and automation, and maintaining operational reliability across cloud networking environments.
Candidates should have hands-on expertise with remote access technologies such as FastConnect, IPsec, and BGP for secure and scalable route distribution. A strong understanding of Linux system processes, memory utilisation, disk and log management, network functionality, containerisation, and the TCP/IP stack is essential.
The role involves triaging and resolving Severity 1 and 2 incidents using logs, metrics, and CLI tools under pressure, including failed changes or system and process failures that directly impact customers in a 24/7 operational environment
Responsibilities
Hold authority for end-to-end performance and operability. Partner with global development teams to define and implement improvements in service architecture. Clearly articulate the technical characteristics of services and technology areas, guiding development teams to engineer and deliver premier capabilities within the Oracle Cloud service portfolio.
Develop and communicate a clear understanding of the scale, capacity, security, and performance attributes and requirements of the service and technology stack. Demonstrate a solid grasp of automation and orchestration principles.
Act as the ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Apply a deep understanding of service topologies and their dependencies to troubleshoot issues and define mitigations. Understand and explain the impact of product architecture decisions on distributed systems.
Exhibit professional curiosity and a desire to develop a deep technical understanding of services and technologies.
Ensure high quality, accurate and timely technical documentation of incidents, problems, changes, and standard operating procedures is maintained using tools such as Jira and Confluence
Work is non-routine and highly complex, involving the application of advanced technical and business skills within the Virtual Networking specialisation of Oracle Cloud Infrastructure (OCI).
Qualifications
Career Level - IC4