BENGALURU, KARNATAKA, India
1 day ago
Senior Service Reliability Operator (SRO), Cloud Operations

Your Role/Opportunity:
An opportunity for Service Reliability Operator (SRO) who will ensure the availability and resiliency of our Cloud services 24x7x365. The ideal candidate will have a pulse on the Oracle’s SaaS services and be accountable for the troubleshooting and resolution of service issues.
Additionally, you will have the opportunity to create future automation and tooling that will allow us to continuously improve our service.  Your role in driving improvements in availability, effort and velocity will delight our customers with and while reducing costs of Operations.
You will leverage excellence in communication, technical/business analysis, problem solving and attention to detail to methodically resolve issues.  

 

Responsibilities

 

In this role you will need to:

Technical Resolution of Service Issues Automation of day-on-day operation work. Troubleshooting: have a deep understanding of our services and dependencies in order to respond quickly and efficiently to major incidents and minimize service disruptions when they occur Identify the processes which becomes bottlenecks in operations management and resolve them through process improvement, automation. Stay informed of new technologies, Innovate.       Ownership: understand internal team process and ensure compliance with them. Administer production servers/services and test system health. Offer mitigation paths to accelerate the process of system recovery. Work with system monitoring and alerting tools to identify trouble source. Execute defined SOPs to avoid or reduce event impact duration. Undertake Incident Command training and experience working on an on-call rotation. Contribute to Technical Resolution of Service Issues Contribute to automation of day-on-day operation work.


Our Ideal Candidate:

Bachelor's degree in CS, EE, or equivalent 5+ years’ work experience in supporting Production Services Excellent working experience in Unix/Linux/Windows OS Exposure to DevSecOps Tools, OCI. Excellent working /Troubleshooting experience in Application Middleware/Tomcat/Weblogic Servers. Demonstrable experience in one or more scripting/programming languages:  Python, Java, Perl, shell Strong communication and analytical skills Understanding of virtualization solutions and Cloud services Able to work as part of a shift in a 24x7x365 operations team. Understanding of monitoring / dashboards (e.g. Enterprise Manager, Grafana, Kibana or Equivalent, Splunk etc) Excellent problem-solving skills Technical background with an ability to troubleshoot issues impacting large scale service architectures and application stacks. Handles hard problems with a positive "can do" attitude. Team player and able to work with others all skill levels. Understanding AI from operation perspective.

Career Level - IC3

Confirm your E-mail: Send Email