About Saviynt:
Saviynt empowers organizations to secure and manage access to their critical assets in today's dynamic digital landscape. Our Enterprise Identity Cloud provides unparalleled visibility, control, and intelligence, enabling businesses to mitigate cyber risks while ensuring employees have the right access at the right time.
WHAT YOU WILL BE DOING
Lead and manage a 24x7 team of Cloud Operations engineers, ensuring adequate staffing and shift coverage to maintain continuous operations. Oversee the monitoring of our SaaS application and underlying infrastructure (Kubernetes on AWS and Azure, VPN connections, customer applications, Elastic Search, MySQL) for alerts and performance issues. Manage the full lifecycle of alerts, incidents, and service requests reported through FreshService, ensuring timely and accurate logging, prioritization, resolution, and escalation. Develop, implement, and maintain operational procedures, runbooks, and knowledge base articles to standardize incident resolution and service request fulfillment. Drive continuous improvement initiatives to optimize operational efficiency, reduce incident rates, and improve service request turnaround times. Collaborate with engineering, development, and SRE teams to troubleshoot complex issues, identify root causes, and implement preventative measures. Ensure adherence to defined SLAs (Service Level Agreements) and KPIs (Key Performance Indicators) for operational performance. Generate regular reports on operational metrics, incident trends, and service request performance for management review. Participate in on-call escalations as needed and provide leadership during critical incidents.Foster a collaborative and high-performing team environment, providing coaching, mentoring, and performance feedback to team members. Manage and maintain operational documentation, including system diagrams, contact lists, and escalation paths. Ensure compliance with relevant security and compliance policies within the operations center. Plan and coordinate scheduled maintenance activities with minimal impact to service availability. 
PI270399237