Site Reliability/devops Developer

7 days ago


Kuala Lumpur, Malaysia amIT Global Solutions Sdn Bhd Full time

**Requirements**:

- You will be responsible for management and delivery of a system(s) within a platform leveraging agile practices, by leveraging existing experience of working in an agile environment.
- **The right person will have at least 8 to 12 years of relevant experience in DevOps, SRE**.
- **Should be well versed in the concepts of DevOps and have a full understanding of Site Reliability Engineering (SRE) principles.**:

- Knowledge of the correlation between **SLIs and SLOs** when measuring service reliability
- Must be familiar with well-known system monitoring and system configuration & management tools such as Elasticsearch, Grafana, Prometheus, Ansible and Saltstack
- Must be familiar with Linux system, Administration, Linux Shell Programming (Bash)
- Possesses Programming skills in one more of these languages: Java, Python
- Experience in coordinating with development teams to streamline code deployment with CICD and IAC pipelines, possesses the ability in building automated solutions through code.
- Familiar with message queue systems (e.g. Kafka, RabbitMQ) and other distributed systems (e.g., Consul, Zookeeper, MongoDB, Redis etc.)
- Experience in conducting system tests for security, performance, availability, and reliability.
- Demonstrated skills in communication (oral, written, presentation), analysis, problem solving and short term and long-term planning.
- Demonstrated portfolio of work showcasing technical competence
- An appreciation of the consulting lifestyle and ability to travel (both locally and abroad) is a prerequisite to fit to our short-term and long-term project assignment.

**Job Types**: Full-time, Permanent, Contract
Contract length: 24 months

Pay: RM10,000.00 - RM25,000.00 per month



  • Kuala Lumpur, Kuala Lumpur, Malaysia PeopleScope Full time 60,000 - 120,000 per year

    Site Reliability EngineerJob Description:Ability to debug scripts and automate routine tasks in OS, network, database or application servers. Coding experience beyond simple scripts; Experience in Devops process, programming knowledge in at least one of the following languages: Java, Python, or Go; Scripting skills in at least of the following:...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Kneat Full time

    Site Reliability Engineer – Kuala Lumpur, MalaysiaKneat enables regulated organizations to move from paper-based validation to intelligent, digitized, paperless solutions. And we do it through the ongoing development of a powerful, purpose-built software platform. In 2014, after eight years of intensive software development, we launched Kneat Gx—the...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Kneat Full time

    Site Reliability Engineer – Kuala Lumpur, MalaysiaKneat enables regulated organizations to move from paper-based validation to intelligent, digitized, paperless solutions. And we do it through the ongoing development of a powerful, purpose-built software platform. In 2014, after eight years of intensive software development, we launched Kneat Gx—the...


  • Kuala Lumpur, Kuala Lumpur, Malaysia VCB Malaysia Berhad Full time 144,000 - 156,000 per year

    Overview:As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system architecture and design principles, focusing on...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Abhidi Solution Private Limited Full time 120,000 - 180,000 per year

    Job Title: Site Reliability Engineer (SRE)Job Type: Permanent positionWork Location Kuala LumpurResponsibilities:Strong hands-on experience with VMware solutionsStrong experience with patch management for OS & middlewareExperience in VMware server templating/blueprints (RedHat & Windows)Experience with Infrastructure-as-Code, orchestration, configuration...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Hunters International Full time 19,000 per year

    Overview:As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system architecture and design principles, focusing on...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Unison Consulting Full time 120,000 - 240,000 per year

    As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system architecture and design principles, focusing on key SRE...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Unison Group Full time 120,000 - 240,000 per year

    As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system architecture and design principles, focusing on key SRE...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Career Wise Full time 120,000 - 240,000 per year

    As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system architecture and design principles, focusing on key SRE...


  • Kuala Lumpur, Kuala Lumpur, Malaysia FPT Software Malaysia Sdn. Bhd. Full time

    Key Responsibilities:Disaster Recovery Planning (DRP):Design and maintain scalable failover systems, backup strategies, and redundancy mechanisms across cloud and on-prem environments.Develop and update DR documentation, runbooks, and recovery playbooks for infrastructure and application layers.Business Continuity Testing:Plan, coordinate, and execute...