Platform Reliability Engineer

2 days ago


Kuala Lumpur, Kuala Lumpur, Malaysia Tata Consultancy Services (TCS) Full time 90,000 - 120,000 per year

Roles & Responsibilities:

Job Purpose:

Platform Reliability Engineer (PRE) is responsible for engineering, operating, and maintaining GEL's internal container platform and its supporting infrastructure, with a strong focus on reliability, resiliency, and security. As a Senior PRE within GEL's Infrastructure team, you will play a pivotal role in designing, building, and operating distributed container hosting solutions using Broadcom's Tanzu product.

The Job:

  • As a Senior Platform Reliability Engineer, you will play a key role in maintaining the stability, reliability, and efficiency of GEL's internal container platform and its supporting infrastructure. Your responsibilities will include core operational tasks such as resource provisioning and management, responding to platform and application outages, capacity planning, monitoring, and driving reliability enhancements.
  • You will continuously evaluate platform's technical architecture to ensure it scales effectively with evolving application demands.
  • This includes proactively identifying and resolving reliability issues, analyzing product dependencies, pinpointing performance bottlenecks, and implementing optimization strategies to enhance platform availability and cost efficiency.
  • In this role, you will participate in a 24/7 on-call rotation, promptly addressing alerts from the global monitoring team and resolving production incidents to maintain platform and application uptime. Additionally, you will regularly review team workflows to identify manual processes and implement automation solutions that reduce effort and minimize human error.
  • Regularly review the security advisory issued by Broadcom related to Tanzu suite of products and deploy product updates as required to keep platform vulnerable free.
  • Work with open-source technologies, CI/CD, SCM tools as necessary, and source control such as Bitbucket, implement organization containers (eg, Docker and Kubernetes). Stay current with industry trends and propose new ways for our business to improve
  • Takes accountability in considering business and regulatory compliance risks and takes appropriate steps to mitigate the risks.
  • Maintains awareness of industry trends on regulatory compliance, emerging threats and technologies in order to understand the risk and better safeguard the company.
  • Highlights any potential concerns /risks and proactively shares best risk management practices.

Location

Kuala Lumpur

Job Function

IT INFRASTRUCTURE SERVICES

Role

Engineer

Job Id

379349

Desired Skills

Microsoft Platform Architecture

Desired Candidate Profile

Qualifications : Undergraduate



  • Kuala Lumpur, Kuala Lumpur, Malaysia HCLTech Full time 120,000 - 240,000 per year

    About the role:Platform Reliability Engineer (PRE) is responsible for engineering, operating, and maintaining GEL's internal container platform and its supporting infrastructure, with a strong focus on reliability, resiliency, and security. As a Senior PRE within GEL's Infrastructure team, you will play a pivotal role in designing, building, and operating...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Abhidi Solution Full time 80,000 - 120,000 per year

    Please find the Skill set requirement for Tanzu (TAS & TKGi)TASTanzu Application Service (TAS), BOSH, Ops Manager, CF Cli.Different types of Tiles using Tanzu platform (Bosh director, antivirus, harbor, compliance scanner, healthwatch)TKGiTanzu Kubernetes Grid integration (basic Architecture).Kubernetes - Lifecycle activities include upgrading the cluster...

  • Platform Engineer

    2 days ago


    Kuala Lumpur, Kuala Lumpur, Malaysia Hyred APAC Full time 100,000 - 150,000 per year

    About the clientOur client specialises in building Agentic AI systems.Role DescriptionOur client is looking to onboard a Platform Engineer to serve as one of the principal architects of SupplyOS, their agentic AI platform that powers procurement, inventory, manufacturing, and logistics operations across Asia.You will help define the long term technical...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Kneat Full time 80,000 - 120,000 per year

    Site Reliability Engineer – Kuala Lumpur, MalaysiaKneat enables regulated organizations to move from paper-based validation to intelligent, digitized, paperless solutions. And we do it through the ongoing development of a powerful, purpose-built software platform. In 2014, after eight years of intensive software development, we launched Kneat Gx—the...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Kneat Full time 80,000 - 120,000 per year

    Site Reliability Engineer – Kuala Lumpur, MalaysiaKneat enables regulated organizations to move from paper-based validation to intelligent, digitized, paperless solutions. And we do it through the ongoing development of a powerful, purpose-built software platform. In 2014, after eight years of intensive software development, we launched Kneat Gx—the...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Baker Hughes Full time 80,000 - 120,000 per year

    Do you enjoy being part of team that provides high-quality project delivery for our customers?Are you a Reliability Engineer who is keen to enhance your techniques across diverse industries?Join our Baker Hughes - ARMS TeamARMS Reliability, part of Baker Hughes Bently Nevada is a leading global provider of reliability solutions, supporting some of the...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Aisling Group Full time 80,000 - 120,000 per year

    We are looking for Database Reliability Engineers (DBREs) who are responsible for keeping our database systems running smoothly 24/7/365. DBREs build tools, design and implement services, and improve the performance and reliability of our database systems as we rapidly scale our product and organization. DBREs will play a highly visible role leading projects...


  • Kuala Lumpur, Kuala Lumpur, Malaysia PeopleScope Full time 60,000 - 120,000 per year

    Site Reliability EngineerJob Description:Ability to debug scripts and automate routine tasks in OS, network, database or application servers. Coding experience beyond simple scripts; Experience in Devops process, programming knowledge in at least one of the following languages: Java, Python, or Go; Scripting skills in at least of the following:...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Unison Group Full time 120,000 - 240,000 per year

    As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system architecture and design principles, focusing on key SRE...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Career Wise Full time 120,000 - 240,000 per year

    As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system architecture and design principles, focusing on key SRE...