Current jobs related to Senior Site Reliability Engineer - Kuala Lumpur, Kuala Lumpur - Ant International

  • Reliability Engineer

    3 weeks ago


    Kuala Lumpur, Kuala Lumpur, Malaysia The Chemical Engineer Full time

    About usAt ExxonMobil, our vision is to lead in energy innovations that advance modern living and a net-zero future. As one of the world's largest publicly traded energy and chemical companies, we are powered by a unique and diverse workforce fueled by the pride in what we do and what we stand for.The success of our Upstream, Product Solutions and Low Carbon...


  • Kuala Lumpur, Kuala Lumpur, Malaysia SWIFT Full time

    About the PositionWe are seeking a highly skilled Senior Site Reliability Engineer to lead our team responsible for ensuring the reliability, uptime, and performance of our mission-critical systems. As a Senior SRE Manager, you will be responsible for providing technical leadership, mentorship, and guidance to your team members, as well as collaborating with...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Businesslist Full time

    A successful Site Reliability Engineer will possess:Hands-on experience with Azure to manage and optimize cloud infrastructure.Exceptional problem-solving abilities, with the resilience to perform well under pressure.Proficiency in Infrastructure as Code (e.g., Terraform) to automate and streamline processes.Expertise in containerization technologies like...


  • Kuala Lumpur, Kuala Lumpur, Malaysia iSoftStone Full time

    iSoftStone WP. Kuala Lumpur, Federal Territory of Kuala Lumpur, MalaysiaSite Reliability EngineeriSoftStone WP. Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia2 weeks ago Be among the first 25 applicantsEnsure the stability of Alibaba apsara stack and cloud services running on it. Carry health check, operation & maintenance, troubleshooting tasks....


  • Kuala Lumpur, Kuala Lumpur, Malaysia Businesslist Full time

    Ensure high availability and performance of AWS infrastructure, applications, and services.Design, Architect & Implement auto-scaling, load balancing, and failover strategies in AWS environments.Automate repetitive tasks, workflows, and deployments using scripting or various technologies (e.g., PowerShell, Python, Terraform, Ansible, etc).Deploy...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Tetrasparks Sdn Bhd Full time

    1 week ago Be among the first 25 applicantsDirect message the job poster from Tetrasparks Sdn BhdHuman Resources Recruiter at Tetrasparks Sdn BhdWe are seeking an experienced and dynamic Site Reliability Engineering Lead to drive the reliability, scalability, and performance of our production systems in I-Gaming industry. In this role, you will lead a team...


  • Kuala Lumpur, Kuala Lumpur, Malaysia ServeDeck Innovation Sdn Bhd Full time

    Direct message the job poster from ServeDeck Innovation Sdn BhdHR Professional | Talent Acquisition | Labor Law SavvyResponsibilitiesDesign, manage and optimize Continuous Integration and Continuous Delivery.Monitor application and infrastructure (cloud) to ensure system security and availability using the real time dashboard and active alerting...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Unison Consulting Full time

    Site Reliability Engineer (Mandarin Speaker)5 days ago Be among the first 25 applicantsOverview: As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Randstad (Schweiz) AG Full time

    Site Reliability Engineer / SRE (Hybrid) | KLHybridOverview:As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Manulife Full time

    Senior Platform Reliability Engineer - Azure API Management (APIM)Manulife Kuala Lumpur, Federal Territory of Kuala Lumpur, MalaysiaJoin or sign in to find your next jobJoin to apply for the Senior Platform Reliability Engineer - Azure API Management (APIM) role at ManulifeSenior Platform Reliability Engineer - Azure API Management (APIM)Manulife Kuala...

Senior Site Reliability Engineer

1 month ago


Kuala Lumpur, Kuala Lumpur, Malaysia Ant International Full time
Senior Site Reliability Engineer (DevOps)

Ant International powers the future of global commerce with digital innovation for everyone and every business to thrive. In close collaboration with partners, we support merchants of all sizes worldwide to realize their growth aspirations through a comprehensive range of tech-driven digital payment and financial services solutions.

We are seeking a Senior Site Reliability Engineer for our Malaysia Tech Center to work on end-to-end solutions for cross-border payments for our global merchants and globalization business.

Key Responsibilities:

  • Collaborate with global teams to complete daily operations and alarm handling.
  • Identify and implement solutions for stability, scalability, and security of business infrastructure using frameworks and industry best practices.
  • Drive and manage technical and solution architecture discussions between global teams and partners to ensure timely delivery that meets customer needs.
  • Plan and execute a roadmap for strategic infrastructure improvement incorporating initiatives that align with the company goals.

Qualifications:

  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience.
  • 7+ years of experience in site reliability engineering.
  • Extensive experience in performing O&M activities, including security patching, version upgrades, alarm management, and handling in public cloud environments, especially Google Cloud or AWS services.
  • Advanced proficiency and understanding of the factors and scenarios that generate technology risks in public cloud infrastructure.
  • Ability to manage and prevent these risks, and design general technology risk solutions/systems/products through systematic abstraction.
  • Excellent communication and interpersonal skills with a very proactive attitude in solving difficult problems.
Seniority Level

Mid-Senior level

Employment Type

Full-time

Job Function

Engineering and Information Technology

#J-18808-Ljbffr