Site Reliability Engineer

1 day ago


Kuala Lumpur, Kuala Lumpur, Malaysia PERSOLKELLY Malaysia Full time

This job is for a Site Reliability Engineer focusing on network systems. You might like this job because you'll automate tasks, ensure services run smoothly with minimal downtime, and work with coding languages like Java or Python in a dynamic tech environment

Job Description:

Ability to debug scripts and automate routine tasks in OS, network, database or application servers. Coding experience beyond simple scripts;

• Experience in DevOps process, programming knowledge in at least one of the following languages: Java, Python, or Go;

• Scripting skills in at least one of the following: Shell, Terraform, Ansible, Chef or Puppet;

• Deep understanding of Unix/Linux operating systems, virtual machines, containers, container management systems, enterprise cloud platforms and data structures;

• Engage in and improve the lifecycle of services—from launch through to deployment, operation and optimization in reliability and user experience;

• Ensure service reliability once they are live by measuring and monitoring availability, latency, and overall system health. Practice sustainable incident response;

Site Reliability Engineer Responsibilities:


• The service reliability SLA is greater than or equal to 99.99% annual downtime = 95%. The major and critical alarm should be timely handled and cleared within 24 hours;

• The dual-cloud drill is 100% completed as required (once every half a year), and the drill summary materials are archived as required;

• The average closure duration of change flows meets the annual KPI requirements of the department. (In 2022, the target is less than 4 days and will be updated every year.);

• To provide on-call duty to handle daily alerts, work orders, upgrades, etc.;

• Others triggering tasks, such as OS patch upgrade and security hardening, are completed according to the planned time of the project.

Job Requirements

Requirement:
Bachelor degree or above in Computer Science/Electronics & Communication;

• Have in-depth knowledge of SRE role and DevOps process;

• Have strong observation and critical thinking to handle business emergencies;

• Ability to adapt to dynamic environment and apply problem-solving skills to resolve issues;

• Have excellent written and verbal communication skills;

Better to have:

  • Linux, Oracle or any system related to networking.
#J-18808-Ljbffr

  • Kuala Lumpur, Kuala Lumpur, Malaysia Chinasoft International (CSI) Full time

    Company DescriptionWe suggest you enter details here.Role DescriptionThis is a full-time on-site role for a Site Reliability Engineer based in WP. Kuala Lumpur. The Site Reliability Engineer will be responsible for maintaining system reliability and availability. Daily tasks will include troubleshooting issues, ensuring proper infrastructure setup, and...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Glints Full time

    Glints Federal Territory of Kuala Lumpur, MalaysiaSite Reliability EngineerReady to elevate your career with a globally recognized professional services firm? We are seeking a skilled DevOps / SRE Specialist to join our team. You'll be at the forefront of transforming business challenges into cutting-edge technology solutions, working alongside diverse...


  • Kuala Lumpur, Kuala Lumpur, Malaysia ServeDeck Innovation Sdn Bhd Full time

    Job Description:We are seeking a highly skilled Site Reliability Engineer to join our team at ServeDeck Innovation Sdn Bhd. As a key member of our engineering team, you will be responsible for designing, managing, and optimizing our continuous integration and continuous delivery pipelines.You will work closely with our development and support teams to ensure...


  • Kuala Lumpur, Kuala Lumpur, Malaysia PERSOLKELLY Malaysia Full time

    Achieve service excellence as a Site Reliability Engineer specializing in network systems! You will join a team that aims to deliver high-quality services through automation, reliability and efficiency.Key Job Duties:Debug scripts and automate routine tasks in OS, network, database or application serversCoding experience beyond simple scripts is requiredOur...


  • Kuala Lumpur, Kuala Lumpur, Malaysia WCC Full time

    Job DescriptionSysadmin, IT Operations, and DevOps ExpertiseDistributed Production Load ManagementContinuous Integration, Continuous Deployment, and Continuous ImprovementWe are seeking a skilled Site Reliability Engineer to support our product owners and DevOps team in determining which new features can be launched and when, using service-level agreements...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Glints Full time

    We are seeking a skilled DevOps/SRE specialist to join our team at Glints. This role will be at the forefront of transforming business challenges into cutting-edge technology solutions.Key ResponsibilitiesConduct research and analysis to develop innovative, technology-enabled business solutions.SUPPORT THE DESIGN AND DELIVERY OF DIGITAL SOLUTION...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Businesslist Full time

    Responsibilities:The successful Reliability Engineer will be responsible for designing and implementing reliability solutions to improve product quality and reduce costs at Businesslist.Key responsibilities include developing and implementing reliability engineering strategies, conducting failure mode and effects analysis (FMEA), and identifying...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Unison Consulting Full time

    We are seeking a talented DevOps/Site Reliability Engineer with a strong background in DevOps practices, Linux environments, and proficiency in scripting languages like Bash, Shell, Python, or Golang. As a Cloud Reliability Engineer at Unison Consulting, you will be responsible for managing and automating the deployment, monitoring, and reliability of our...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Elsa Talent Solutions Sdn. Bhd. Full time

    At Elsa Talent Solutions Sdn. Bhd., we are seeking a talented Reliability & Integrity Management Engineer to join our team.As a key member of our reliability engineering team, you will be responsible for creating and executing strategies to enhance system performance and minimize downtime.You will utilize Six Sigma methodologies to identify and eliminate...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Elsa Energy Full time

    Reliability Engineering SpecialistElsa Energy is seeking a skilled Reliability Engineering Specialist to enhance the reliability and efficiency of our physical asset management systems and processes.The ideal candidate will possess a strong background in Six Sigma methodologies and a working understanding of artificial intelligence (AI) applications. Key...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Ant International Full time

    Senior Site Reliability EngineerAnt International enables global commerce with digital innovation. Our goal is to empower businesses and individuals worldwide through comprehensive tech-driven services.We seek a Senior Site Reliability Engineer to work on end-to-end solutions for cross-border payments for our global merchants and globalization business at...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Chinasoft International (CSI) Full time

    We're looking for a skilled Software Reliability Specialist to join our team at Chinasoft International (CSI). This is a full-time on-site role based in Kuala Lumpur, where you'll play a crucial role in maintaining system reliability and availability.Your daily tasks will involve troubleshooting issues, setting up infrastructure, and performing system...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Net2Source Inc. Full time

    About the PositionWe are seeking a System Reliability Engineer to join our IT team in Kuala Lumpur, Malaysia. In this role, you will be responsible for ensuring the reliability and performance of our systems and applications.Main ResponsibilitiesDesigning and implementing system reliability solutions.Monitoring and analyzing system performance, identifying...


  • Kuala Lumpur, Kuala Lumpur, Malaysia ServeDeck Innovation Sdn Bhd Full time

    Requirements:3+ years experience in Site Reliability Engineering, DevOps, or related roles.Hands-on experience with AWS cloud infrastructure and services.Experience in infrastructure as code, containerization, and orchestration.Knowledge of scripting languages such as bash, python, and go.Strong understanding of ISO 27001 policies and processes.Awareness of...

  • Project Site Engineer

    22 hours ago


    Kuala Lumpur, Kuala Lumpur, Malaysia SPECIFIC DIMENSION SDN BHD Full time

    Job OverviewSPECIFIC DIMENSION SDN BHD is seeking a highly skilled Project Site Engineer to lead our project activities. The ideal candidate will have experience in managing engineering projects and ensuring timely completion.Key ResponsibilitiesProject Planning and ExecutionSite Inspection and Quality ControlTeam Management and CoordinationReporting and...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Itjobsworldwide Full time

    Job DescriptionWe are seeking a skilled Database Reliability Engineer to join our dynamic team in Kuala Lumpur. As a key member of our database team, you will be responsible for designing, implementing, and maintaining high-availability MariaDB/Mongo clusters.The ideal candidate will have a strong understanding of SQL, application performance, and experience...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Symphony Industrial AI, Inc. Full time

    Symphony Industrial AI, Inc. is a leading provider of financial services software, delivering cutting-edge compliance and fraud solutions. We are seeking a highly skilled Customer Operations Engineer to join our team.As a Customer Operations Engineer, you will be responsible for ensuring the reliability, performance, and availability of our SaaS solutions....


  • Kuala Lumpur, Kuala Lumpur, Malaysia SBM Offshore Full time

    Reliability Engineer for Piping and Pressure SystemsSBM Offshore is a leading player in the deepwater ocean-infrastructure industry. We are seeking a Reliability Engineer for Piping and Pressure Systems to join our team.The successful candidate will have a strong background in mechanical engineering and experience in risk-based inspection management of...


  • Kuala Lumpur, Kuala Lumpur, Malaysia SBM Offshore Full time

    Operational Reliability SpecialistThe Operational Reliability Specialist is responsible for ensuring the operational reliability and safety compliance of electrical systems, instrumentation, and ICSS. This includes developing and implementing strategies for enhancing the reliability and efficiency of ICET systems, leading and managing the ICET team, and...


  • Kuala Lumpur, Kuala Lumpur, Malaysia PEOPLE PROFILERS Full time

    Are you a highly skilled Technical System Specialist looking for a new challenge? People Profilers is seeking a talented individual to join their team in Kuala Lumpur, Malaysia.About the Opportunity:This is an exciting opportunity to work with our clients' complex business problems, leveraging your training in technology and analytical abilities to support...