Cyberjaya, Malaysia| Posted on 10/09/2024
As a Manager of Disaster Recovery, you'll be based at Deriv's vibrant headquarters in Cyberjaya, Malaysia — the Silicon Valley of Southeast Asia. Our state-of-the-art office is already home to a diverse community of nearly 550 brilliant minds, and we're expanding to welcome even more talent. Imagine yourself at the core of Deriv's global operations, surrounded by innovation, cultural richness, and endless opportunities for growth.Cyberjaya offers a unique blend of modern living and lush tropical surroundings, perfect for those seeking a balanced lifestyle. As part of our HQ team, you'll be at the forefront of shaping Deriv's future, collaborating with professionals from around the world in a dynamic, cutting-edge environment.Your challenges and missionBe the guardian of our digital world. You'll be the architect of our resilience, ensuring Deriv's systems can weather any storm. It's not just about backup plans, it's about building a fortress that keeps us running smoothly, no matter what.Strategise and lead. You'll create comprehensive disaster recovery (DR) plans, mentor your team, and guide us through the ever-changing landscape of IT resilience. You'll provide leadership and direction for Business Impact Analysis (BIA) and DR planning.Build unbreakable systems. You'll design and deploy cutting-edge DR solutions tailored to our critical cloud applications and services (AWS, GCP, Azure), ensuring they're robust, scalable, and ready for anything.Anticipate and mitigate risks. You'll conduct deep-dive risk assessments, leverage machine learning, and lead exercises that prepare us for the unexpected. Ensure DR strategies meet or exceed RTO and RPO.Automate for speed and efficiency. You'll develop frameworks and orchestration tools (e.g., Jenkins, AWS Step Functions, Ansible, Terraform, AWS CloudFormation) that streamline recovery processes, minimising downtime and maximising our ability to bounce back quickly. Leverage IaC techniques to automate the deployment, configuration, and testing of disaster recovery environments.Test, validate, and improve. You'll design rigorous testing protocols, including disaster recovery drills and simulations, using observability tools like Grafana and AWS CloudWatch to ensure our plans are battle-tested and effective. Implement logging frameworks to ensure continuous monitoring, validation, and improvement of disaster recovery procedures. Integrate chaos engineering practices with automated testing tools to stress-test systems.Collaborate across boundaries. You'll partner with teams across the organisation, ensuring everyone understands their role in disaster recovery and is ready to act when needed. Work closely with architects, system engineers, and security specialists.Lead in the face of crisis. When disruptions occur, you'll take charge, coordinating recovery efforts and minimising impact with a calm and steady hand.Ensure compliance and readiness. You'll navigate the complex regulatory landscape, ensuring we're always prepared for audits and inspections, especially within the high-stakes financial sector. Provide detailed performance reports to senior leadership, regulatory bodies, and stakeholders.Never stop learning. You'll stay ahead of the curve, continuously improving our disaster recovery capabilities and sharing your knowledge with the team.Requirements10+ years in disaster recovery, business continuity, or a related field3+ years in a leadership role within a highly technical environmentIn-depth experience with AWS services critical to disaster recovery, such as AWS Backup, Amazon RDS Multi-AZ deployments, AWS Elastic Disaster Recovery, AWS CloudFormation, AWS Global Accelerator, and AWS Fault Injection Simulator (FIS)Proficiency in managing DR within cloud environments (AWS, GCP, Azure) and hybrid architecturesExtensive knowledge of modern architectures—microservices, serverless computing, containerisationExperience with tools like Docker and KubernetesProven track record of leading complex, high-impact DR projectsStrong familiarity with Agile methodologies and Business Continuity principlesExceptional analytical skillsAbility to craft innovative solutions for complex DR challengesAbility to communicate complex technical concepts to executive leadership and lead cross-functional teamsBachelor's degree in Computer Science, Information Technology, or related fieldMaster's degree or relevant certifications (e.g., CDRP, CISSP, ISO 22301 LI) are a plus#J-18808-Ljbffr