Site Reliability Engineer
1 day ago
This job is for a Site Reliability Engineer focusing on network systems. You might like this job because you'll automate tasks, ensure services run smoothly with minimal downtime, and work with coding languages like Java or Python in a dynamic tech environment
Job Description:Ability to debug scripts and automate routine tasks in OS, network, database or application servers. Coding experience beyond simple scripts;
• Experience in DevOps process, programming knowledge in at least one of the following languages: Java, Python, or Go;
• Scripting skills in at least one of the following: Shell, Terraform, Ansible, Chef or Puppet;
• Deep understanding of Unix/Linux operating systems, virtual machines, containers, container management systems, enterprise cloud platforms and data structures;
• Engage in and improve the lifecycle of services—from launch through to deployment, operation and optimization in reliability and user experience;
• Ensure service reliability once they are live by measuring and monitoring availability, latency, and overall system health. Practice sustainable incident response;
• The service reliability SLA is greater than or equal to 99.99% annual downtime = 95%. The major and critical alarm should be timely handled and cleared within 24 hours;
• The dual-cloud drill is 100% completed as required (once every half a year), and the drill summary materials are archived as required;
• The average closure duration of change flows meets the annual KPI requirements of the department. (In 2022, the target is less than 4 days and will be updated every year.);
• To provide on-call duty to handle daily alerts, work orders, upgrades, etc.;
• Others triggering tasks, such as OS patch upgrade and security hardening, are completed according to the planned time of the project.
Requirement:
Bachelor degree or above in Computer Science/Electronics & Communication;
• Have in-depth knowledge of SRE role and DevOps process;
• Have strong observation and critical thinking to handle business emergencies;
• Ability to adapt to dynamic environment and apply problem-solving skills to resolve issues;
• Have excellent written and verbal communication skills;
Better to have:
- Linux, Oracle or any system related to networking.
-
Site Reliability Engineer
1 day ago
Kuala Lumpur, Kuala Lumpur, Malaysia Chinasoft International (CSI) Full timeCompany DescriptionWe suggest you enter details here.Role DescriptionThis is a full-time on-site role for a Site Reliability Engineer based in WP. Kuala Lumpur. The Site Reliability Engineer will be responsible for maintaining system reliability and availability. Daily tasks will include troubleshooting issues, ensuring proper infrastructure setup, and...
-
Site Reliability Engineer
10 hours ago
Kuala Lumpur, Kuala Lumpur, Malaysia Glints Full timeGlints Federal Territory of Kuala Lumpur, MalaysiaSite Reliability EngineerReady to elevate your career with a globally recognized professional services firm? We are seeking a skilled DevOps / SRE Specialist to join our team. You'll be at the forefront of transforming business challenges into cutting-edge technology solutions, working alongside diverse...
-
Site Reliability Expert
3 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia ServeDeck Innovation Sdn Bhd Full timeJob Description:We are seeking a highly skilled Site Reliability Engineer to join our team at ServeDeck Innovation Sdn Bhd. As a key member of our engineering team, you will be responsible for designing, managing, and optimizing our continuous integration and continuous delivery pipelines.You will work closely with our development and support teams to ensure...
-
Site Reliability Engineer
24 hours ago
Kuala Lumpur, Kuala Lumpur, Malaysia PERSOLKELLY Malaysia Full timeAchieve service excellence as a Site Reliability Engineer specializing in network systems! You will join a team that aims to deliver high-quality services through automation, reliability and efficiency.Key Job Duties:Debug scripts and automate routine tasks in OS, network, database or application serversCoding experience beyond simple scripts is requiredOur...
-
Site Reliability Systems Manager
1 day ago
Kuala Lumpur, Kuala Lumpur, Malaysia WCC Full timeJob DescriptionSysadmin, IT Operations, and DevOps ExpertiseDistributed Production Load ManagementContinuous Integration, Continuous Deployment, and Continuous ImprovementWe are seeking a skilled Site Reliability Engineer to support our product owners and DevOps team in determining which new features can be launched and when, using service-level agreements...
-
Site Reliability Specialist
7 hours ago
Kuala Lumpur, Kuala Lumpur, Malaysia Glints Full timeWe are seeking a skilled DevOps/SRE specialist to join our team at Glints. This role will be at the forefront of transforming business challenges into cutting-edge technology solutions.Key ResponsibilitiesConduct research and analysis to develop innovative, technology-enabled business solutions.SUPPORT THE DESIGN AND DELIVERY OF DIGITAL SOLUTION...
-
Reliability Engineer
2 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Businesslist Full timeResponsibilities:The successful Reliability Engineer will be responsible for designing and implementing reliability solutions to improve product quality and reduce costs at Businesslist.Key responsibilities include developing and implementing reliability engineering strategies, conducting failure mode and effects analysis (FMEA), and identifying...
-
Cloud Reliability Engineer
7 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Unison Consulting Full timeWe are seeking a talented DevOps/Site Reliability Engineer with a strong background in DevOps practices, Linux environments, and proficiency in scripting languages like Bash, Shell, Python, or Golang. As a Cloud Reliability Engineer at Unison Consulting, you will be responsible for managing and automating the deployment, monitoring, and reliability of our...
-
Reliability Strategist
6 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Elsa Talent Solutions Sdn. Bhd. Full timeAt Elsa Talent Solutions Sdn. Bhd., we are seeking a talented Reliability & Integrity Management Engineer to join our team.As a key member of our reliability engineering team, you will be responsible for creating and executing strategies to enhance system performance and minimize downtime.You will utilize Six Sigma methodologies to identify and eliminate...
-
Reliability Engineering Specialist
3 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Elsa Energy Full timeReliability Engineering SpecialistElsa Energy is seeking a skilled Reliability Engineering Specialist to enhance the reliability and efficiency of our physical asset management systems and processes.The ideal candidate will possess a strong background in Six Sigma methodologies and a working understanding of artificial intelligence (AI) applications. Key...
-
System Reliability Specialist
4 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Ant International Full timeSenior Site Reliability EngineerAnt International enables global commerce with digital innovation. Our goal is to empower businesses and individuals worldwide through comprehensive tech-driven services.We seek a Senior Site Reliability Engineer to work on end-to-end solutions for cross-border payments for our global merchants and globalization business at...
-
Software Reliability Specialist
24 hours ago
Kuala Lumpur, Kuala Lumpur, Malaysia Chinasoft International (CSI) Full timeWe're looking for a skilled Software Reliability Specialist to join our team at Chinasoft International (CSI). This is a full-time on-site role based in Kuala Lumpur, where you'll play a crucial role in maintaining system reliability and availability.Your daily tasks will involve troubleshooting issues, setting up infrastructure, and performing system...
-
System Reliability Engineer
1 day ago
Kuala Lumpur, Kuala Lumpur, Malaysia Net2Source Inc. Full timeAbout the PositionWe are seeking a System Reliability Engineer to join our IT team in Kuala Lumpur, Malaysia. In this role, you will be responsible for ensuring the reliability and performance of our systems and applications.Main ResponsibilitiesDesigning and implementing system reliability solutions.Monitoring and analyzing system performance, identifying...
-
Reliability Systems Architect
3 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia ServeDeck Innovation Sdn Bhd Full timeRequirements:3+ years experience in Site Reliability Engineering, DevOps, or related roles.Hands-on experience with AWS cloud infrastructure and services.Experience in infrastructure as code, containerization, and orchestration.Knowledge of scripting languages such as bash, python, and go.Strong understanding of ISO 27001 policies and processes.Awareness of...
-
Project Site Engineer
22 hours ago
Kuala Lumpur, Kuala Lumpur, Malaysia SPECIFIC DIMENSION SDN BHD Full timeJob OverviewSPECIFIC DIMENSION SDN BHD is seeking a highly skilled Project Site Engineer to lead our project activities. The ideal candidate will have experience in managing engineering projects and ensuring timely completion.Key ResponsibilitiesProject Planning and ExecutionSite Inspection and Quality ControlTeam Management and CoordinationReporting and...
-
Database Reliability Engineer
6 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Itjobsworldwide Full timeJob DescriptionWe are seeking a skilled Database Reliability Engineer to join our dynamic team in Kuala Lumpur. As a key member of our database team, you will be responsible for designing, implementing, and maintaining high-availability MariaDB/Mongo clusters.The ideal candidate will have a strong understanding of SQL, application performance, and experience...
-
SaaS Support and Reliability Engineer
3 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Symphony Industrial AI, Inc. Full timeSymphony Industrial AI, Inc. is a leading provider of financial services software, delivering cutting-edge compliance and fraud solutions. We are seeking a highly skilled Customer Operations Engineer to join our team.As a Customer Operations Engineer, you will be responsible for ensuring the reliability, performance, and availability of our SaaS solutions....
-
Kuala Lumpur, Kuala Lumpur, Malaysia SBM Offshore Full timeReliability Engineer for Piping and Pressure SystemsSBM Offshore is a leading player in the deepwater ocean-infrastructure industry. We are seeking a Reliability Engineer for Piping and Pressure Systems to join our team.The successful candidate will have a strong background in mechanical engineering and experience in risk-based inspection management of...
-
Operational Reliability Specialist
6 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia SBM Offshore Full timeOperational Reliability SpecialistThe Operational Reliability Specialist is responsible for ensuring the operational reliability and safety compliance of electrical systems, instrumentation, and ICSS. This includes developing and implementing strategies for enhancing the reliability and efficiency of ICET systems, leading and managing the ICET team, and...
-
Reliability and Efficiency Expert
2 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia PEOPLE PROFILERS Full timeAre you a highly skilled Technical System Specialist looking for a new challenge? People Profilers is seeking a talented individual to join their team in Kuala Lumpur, Malaysia.About the Opportunity:This is an exciting opportunity to work with our clients' complex business problems, leveraging your training in technology and analytical abilities to support...