Site Reliability Engineer
2 weeks ago
Direct message the job poster from ServeDeck Innovation Sdn Bhd
HR Professional | Talent Acquisition | Labor Law SavvyResponsibilities
- Design, manage and optimize Continuous Integration and Continuous Delivery.
- Monitor application and infrastructure (cloud) to ensure system security and availability using the real time dashboard and active alerting mechanism.
- Manage and monitor archival, backup and housekeeping of the data & application resources.
- Make the logs and other information from production & test environment to be securely accessible by people who need it.
- Perform necessary updates to the application, database and infrastructure as required by the business/operation and security requirements.
- Run security scan on the system & source code, perform early assessment on reported security findings and escalate to development team if necessary.
- Strive to increase the service reliability through establishing guidance and methods of improvement.
- Collaborate and cultivate relationships with Development and Support teams to improve reliability, stability and scalability of services.
- Deliver data and analytics to provide insights for our team from a reliability and resilience perspective.
- Identify and resolve problems relating to critical service operations and to prevent their recurrence using automation.
- Improve the incident management lifecycle to identify, mitigate, and learn from reliability risks.
- Work closely with internal teams to ensure technical and operational compliance with ISO 27001 requirements.
Requirements
Skills and Qualifications
- 3+ years experience in Site Reliability Engineering, DevOps, or related roles.
- Hands-on experience with AWS cloud infrastructure and services (e.g. EC2, RDS, IAM and etc.)
- Experience in infrastructure as code e.g. Terraform.
- Experience with containerization and orchestration (e.g. Docker, AWS EKS).
- Experience in managing and improving the processes within CI/CD tools (e.g. Jenkins), BitBucket repository and code quality scanner (e.g. SonarQube).
- Experience in application and infrastructure monitoring and familiar with application logging and monitoring tools such as Datadog and Grafana-Loki-Promtail stack.
- Familiar with scripting e.g. bash, python, go for task automation.
- Experience in managing linux servers.
- Awareness of the security practices, standards and processes will be an advantage.
- Experience analyzing and resolving performance, scalability and reliability issues.
- Knowledge on web application environments, such as TCP/IP, SSL/TLS, HTTP, DNS, routing, load balancing, CDNs, Tomcat, Apache, etc.
Preferred Qualifications
- Experience in ISO 27001 policies and processes.
- Experience in SaaS environments with multi-tenant architectures.
- Experience with PDPA, GDPR, SOC 2 or other compliance frameworks.
- Experience with performance testing using JMeter.
Benefits of Joining our Team include:
- Opportunities working with both international and high profile clients
- Enjoy hybrid work, flexible hours, result oriented, and collaborate with a mission-driven team invested in your growth.
- Outpatient medical coverage for employees, their spouses, and children, in accordance with company policy.
- Provision for spectacles and dental expenses for employees.
- A range of allowances, including travel and mobile phone expenses, among others.
- Product and technical training are provided, both from internal & external sources
- Accelerate your career through hands-on challenges, mentorship from leadership, and opportunities to lead as the team scales.
- Partner closely with product, and operation teams while owning decisions in a flat hierarchy.
- Collaborate with a passionate team using the latest technologies and frameworks.
Mid-Senior level
Employment typeFull-time
Job functionEngineering and Information Technology
IndustriesInformation Services
#J-18808-Ljbffr-
Reliability Engineer
3 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia The Chemical Engineer Full timeAbout usAt ExxonMobil, our vision is to lead in energy innovations that advance modern living and a net-zero future. As one of the world's largest publicly traded energy and chemical companies, we are powered by a unique and diverse workforce fueled by the pride in what we do and what we stand for.The success of our Upstream, Product Solutions and Low Carbon...
-
Site Reliability Engineer
2 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia Businesslist Full timeA successful Site Reliability Engineer will possess:Hands-on experience with Azure to manage and optimize cloud infrastructure.Exceptional problem-solving abilities, with the resilience to perform well under pressure.Proficiency in Infrastructure as Code (e.g., Terraform) to automate and streamline processes.Expertise in containerization technologies like...
-
Site Reliability Engineer
2 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Businesslist Full timeEnsure high availability and performance of AWS infrastructure, applications, and services.Design, Architect & Implement auto-scaling, load balancing, and failover strategies in AWS environments.Automate repetitive tasks, workflows, and deployments using scripting or various technologies (e.g., PowerShell, Python, Terraform, Ansible, etc).Deploy...
-
Site Reliability Engineer
3 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Unison Consulting Full timeSite Reliability Engineer (Mandarin Speaker)5 days ago Be among the first 25 applicantsOverview: As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive...
-
Site Reliability Engineer
3 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia Randstad (Schweiz) AG Full timeSite Reliability Engineer / SRE (Hybrid) | KLHybridOverview:As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system...
-
Site Reliability Engineer Professional
2 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia Businesslist Full timeA highly skilled Site Reliability Engineer will play a pivotal role in ensuring the smooth operation of our cloud infrastructure. They will be responsible for managing and optimizing our Azure environment to meet the evolving needs of our business.This is an exciting opportunity for a technical expert to join our team at Businesslist, where we are dedicated...
-
Site Reliability Engineering Director
7 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia SWIFT Full timeAbout the PositionWe are seeking a highly skilled Senior Site Reliability Engineer to lead our team responsible for ensuring the reliability, uptime, and performance of our mission-critical systems. As a Senior SRE Manager, you will be responsible for providing technical leadership, mentorship, and guidance to your team members, as well as collaborating with...
-
Site Reliability Engineer
3 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia Hunters International Sdn Bhd Full timeOverview:As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system architecture and design principles, focusing on...
-
Site Reliability Engineer
4 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia iSoftStone Full timeiSoftStone WP. Kuala Lumpur, Federal Territory of Kuala Lumpur, MalaysiaSite Reliability EngineeriSoftStone WP. Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia2 weeks ago Be among the first 25 applicantsEnsure the stability of Alibaba apsara stack and cloud services running on it. Carry health check, operation & maintenance, troubleshooting tasks....
-
Site Reliability Engineering Lead
2 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Tetrasparks Sdn Bhd Full time1 week ago Be among the first 25 applicantsDirect message the job poster from Tetrasparks Sdn BhdHuman Resources Recruiter at Tetrasparks Sdn BhdWe are seeking an experienced and dynamic Site Reliability Engineering Lead to drive the reliability, scalability, and performance of our production systems in I-Gaming industry. In this role, you will lead a team...