Staff Site Reliability Engineer - (RID-00185)Introduced in July 2018, Setel is a mobile platform that aims to delight customers by innovating for better, inclusive mobility. Setel serves customers across Malaysia by powering one app as the constant companion to ease motorists' journey across fueling, parking, EV charging, motor insurance, road tax, auto assistance, general purchases, and more across an ecosystem of PETRONAS petrol stations, retail partners, and online merchants.
Role Purpose:
We're looking for an SRE to join the Setel Engineering team. We are obsessed with delivering a seamless and frictionless retail experience for our customers. If you live to solve hard problems, love proving out new technologies, and take pride in your deliverables, then we'd love to meet you!
In This Role You Will:
Be an expert in Setel infrastructure and develop best practices to help development teams utilize cloud-native infrastructure effectively.
Design, build, and test out tools/libraries to improve overall architecture in terms of performance, cost efficiency, reliability, and scalability.
Brainstorm new ideas to improve development quality and speed.
Automate all aspects of deployment with CI/CD pipelines and Infrastructure as a Code (IaC).
Provide technical guidance and educate development teams on DevOps practice. Continuously improve pipelines and tools based on feedback.
Ensure all services are monitored with proper alerts. Provide support for production issues when required.
Monitor and optimize the cost of infrastructure and tooling.
Take the lead on capacity planning to help Setel teams anticipate and prepare for growth.
Ensure policies and process documentation are up to date.
Manage, enforce, and simulate the disaster recovery and backup policies.
Assist with any related tasks, projects, and other assigned duties as and when deemed necessary.
Ensure adherence to the compliance of company policies, industry regulations, and legal requirements.
You're A Great Fit If You Have:
5+ years as Software, DevOps, or Site Reliability Engineer.
Great verbal and written communication skills.
Experience in deploying containers on orchestrators such as ECS, Kubernetes, and Swarm.
Excellent knowledge of cloud platforms such as AWS, Azure, or Google Cloud Platform.
Excellent understanding of large-scale distributed systems in practice especially in cloud-native architecture.
Experience in deploying micro-service architectures in production and understanding best practices.
Experience in building CI/CD pipelines using tools such as ArgoCD, Argo Workflows, Gitlab CI, and CircleCI.
Good understanding of Linux OS, security, networking, and scripting (Bash, PowerShell, Python, or similar).
Able to multitask, prioritize, and manage time efficiently.
Able to work in distributed teams across multiple time zones.
Ability to handle sensitive information with confidentiality.
Excellent communication and interpersonal skills.
What Makes Working With Us Awesome
Our people and culture- You will get to work with awesome and friendly colleagues with whom you can expect to collaborate well to deliver your work.
Availability of tools and applications: You will be provided with different tools to facilitate your work.
Development-focused- Your learning and growth matter most to us.
Relax and unwind in the leisure area with video games, board games, books, and more.
Wear your favourite jeans, or any cool OOTD so that you can work comfortably.
Coffee, tea, or snacks are available for consumption at the pantry.
A healthy body leads to a brilliant mind. Let's get moving with the inter-company sports team.
There will be workshops, talent shows, sports activities, and other events for sharing and bonding.#J-18808-Ljbffr