SRE Senior Engineering Manager

3 weeks ago


Kuala Lumpur, Kuala Lumpur, Malaysia SWIFT Full time
About the Role

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Swift's services—both our internally critical and externally visible systems—have reliability, uptime appropriate to users' needs, and a fast rate of improvement. Additionally, SREs maintain vigilant oversight of system capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure, and eliminating manual work through automation.

As a Senior Site Reliability Engineering Manager, you will lead a team responsible for providing the platform for mission-critical systems to maintain constant uptime, scale seamlessly, and enable new applications and services to flourish. The successful candidate will be highly self-motivated with a passion for excellence, quality, and detail. The SRE Manager will support operations, collaborate with developers and architects to design systems, and assist in implementation to improve stability, security, and scalability.

What to Expect?

Team Building and Mentorship:

  • Recruit and retain engineers with diverse perspectives.
  • Provide coaching, mentorship, and career development support to ensure team members excel both technically and personally.

Collaboration and Alignment:

  • Partner with Product Owners and Engineering Leads to align SRE members with cross-functional squads.
  • Foster effective collaboration across teams and functions.

Technical Leadership:

  • Guide software design patterns, architecture, and engineering best practices.
  • Drive design-focused software delivery to enhance quality and scalability.

Continuous Learning and Knowledge Sharing:

  • Promote a culture of learning, knowledge sharing, and excellence across the organization.
  • Encourage adoption of consistent practices across teams.

Driving Innovation:

  • Stay updated on technology trends and facilitate the adoption of new tools and methodologies.
  • Encourage innovative thinking to drive team and organizational growth.

Performance Management:

  • Set annual objectives for team members.
  • Conduct performance appraisals and provide constructive feedback to support career development.

What Will Make You Successful?

Professional Skills:

  • Bachelor's or higher degree in Computer Science, Engineering, or related disciplines.
  • Strong communication and leadership skills, promoting a diverse and collaborative culture.
  • Passion for people development and commitment to creating an inclusive work environment.
  • Customer-oriented and quality-focused mindset with a drive to deliver true customer value.
  • Open-minded, solutions-oriented team player energized by collaboration.
  • Familiarity with Agile and DevOps practices.
  • Fluency in English (spoken and written).
  • Experience in observability and/or anomaly detection is a plus.

Key Qualifications:

  • 8+ years of experience in software development using one or more programming languages.
  • Expertise in designing, analyzing, and troubleshooting distributed systems.
  • 5+ years of leadership experience managing and mentoring technical teams.
  • Skilled in cross-functional collaboration to achieve project success.
  • Strong passion for automation and reducing manual workloads.
  • Proven ability to encourage a culture of visibility and transparency across teams.
  • Experience managing enterprise services in large-scale Linux environments.
  • Expertise with Kubernetes and configuration management tools like Puppet, Chef, or Ansible.
  • Proficiency in troubleshooting issues across the entire software stack.
  • Hands-on experience operating large-scale multi-tenant infrastructure as a managed service.
  • Strong verbal and written communication skills.

Additional Requirements:

  • Advocacy for automation to minimize operational workloads.
  • Strong sense of ownership, coupled with a collaborative and transparent communication style.
  • Self-motivated and inquisitive, always eager to learn and improve systems and processes.

About the Team:

On the SRE team, you'll tackle the complex challenges of scale unique to Swift, leveraging your expertise in coding, algorithms, complexity analysis, and large-scale system design. Our culture values diversity, intellectual curiosity, problem-solving, and openness. We encourage collaboration, big thinking, and risk-taking in a supportive, blame-free environment. SRE promotes self-direction to work on meaningful projects while fostering a learning environment that provides the mentorship needed to grow and succeed.

What we offer:

  • We put you in control of your career.
  • We give you a competitive package.
  • We help you perform at your best.
  • We help you make a difference.
  • We give you the freedom to be yourself.

We give you the freedom to be yourself. We are creating an environment of unique individuals – like you – with different perspectives on the financial industry and the world. An environment in which everyone's voice counts and where you can reach your full potential regardless of age, background, culture, colour, disability, gender, nationality, race, religion, or veteran/military status.

#J-18808-Ljbffr

  • Kuala Lumpur, Kuala Lumpur, Malaysia Swift Software Full time

    SRE Senior Engineering Manager page is loadedSRE Senior Engineering ManagerApply locations Kuala Lumpur, Malaysia time type Full time posted on Posted Today job requisition id 2024-13914About the RoleSite Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE...

  • DevOps/SRE Engineer

    2 weeks ago


    Kuala Lumpur, Kuala Lumpur, Malaysia Unison Consulting Full time

    We are seeking a talented DevOps/Site Reliability Engineer (SRE) with a strong background in DevOps practices, Linux environments, and proficiency in scripting and programming languages like Bash, Shell, Python, or Golang. You will be responsible for managing and automating the deployment, monitoring, and reliability of our services. You will work closely...

  • SRE/DevOps Engineer

    2 weeks ago


    Kuala Lumpur, Kuala Lumpur, Malaysia Ibroad Solutions Full time

    Job Title: SRE/DevOps EngineerEmployment Type: Full-timeResponsibilitiesDesign and implement resilient system architectures that support high availability and scalabilityDevelop automation tools and scripts to enhance operational efficiency and reduce manual effortDefine, track, and analyze SLOs and SLIs to ensure reliability and performance meet business...

  • SRE/ Devops Engineer

    2 weeks ago


    Kuala Lumpur, Kuala Lumpur, Malaysia Unison Consulting Full time

    We are seeking a talented DevOps/Site Reliability Engineer (SRE) with a strong background in DevOps practices, Linux environments, and proficiency in scripting and programming languages like Bash, Shell, Python, or Golang. You will be responsible for managing and automating the deployment, monitoring, and reliability of our services. You will work closely...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Smart Teq Solution Sdn Bhd Full time

    Smart Teq Solution Sdn Bhd is seeking a skilled DevOps and SRE Professional to join our team.About the RoleThis role involves ensuring all our infrastructure runs at optimal condition.Provide deployment, patches, and updates on all services running on public cloud and on-premise.Work closely with developers to provide complete, up-to-date, and readable...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Swift Software Full time

    About the Role:As a Senior Site Reliability Engineering Manager at Swift Software, you will lead a team responsible for providing the platform for mission-critical systems to maintain constant uptime, scale seamlessly, and enable new applications and services to flourish. The successful candidate will be highly self-motivated with a passion for excellence,...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Ibroad Solutions Full time

    Engineering Operations Manager Job OverviewIbroad Solutions is seeking an experienced Engineering Operations Manager to lead our engineering operations team. This role involves overseeing the design and implementation of resilient system architectures that support high availability and scalability.The ideal candidate will have a strong understanding of SRE...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Swift Software Full time

    Job Description:We are seeking a Senior Site Reliability Engineering Manager to lead our team in providing high-availability platforms that support our mission-critical systems. The ideal candidate will have experience in designing, analyzing, and troubleshooting distributed systems, as well as collaborating with developers and architects to improve...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Hytech Empire Full time

    Hytech Empire is seeking a highly experienced Cloud Engineering Manager to lead our cloud engineering efforts.You will be responsible for managing a team of engineers and ensuring that we deliver high-quality cloud solutions to our clients.We are looking for a professional with expertise in scripting languages, strong understanding of networking concepts,...


  • Kuala Lumpur, Kuala Lumpur, Malaysia PEOPLE PROFILERS Full time

    Job Description:We are seeking an experienced Reliability Engineering Specialist to join our team. In this role, you will be responsible for the management and delivery of a system(s) within a platform leveraging agile practices.The ideal candidate will have at least 3 - 6 years of relevant experience in DevOps, SRE, and a full understanding of Site...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Aarorn Technologies Sdn Bhd Full time

    - ONSITE - Language proficiency - Mandarin - 5+ years exp as SREOverview:Site Reliability Engineer (SRE)As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive...


  • Kuala Lumpur, Kuala Lumpur, Malaysia KTH HR CONSULTING ZONE Full time

    As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system architecture and design principles, focusing on key SRE...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Smart Teq Solution Sdn Bhd Full time

    We're seeking an experienced Site Reliability Engineer Lead to join our team at Smart Teq Solution Sdn Bhd.About the RoleThis role involves ensuring all our infrastructure runs at optimal condition.Provide deployment, patches, and updates on all services running on public cloud and on-premise.Work closely with developers to provide complete, up-to-date, and...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Glints Full time

    Glints Federal Territory of Kuala Lumpur, MalaysiaSite Reliability EngineerReady to elevate your career with a globally recognized professional services firm? We are seeking a skilled DevOps / SRE Specialist to join our team. You'll be at the forefront of transforming business challenges into cutting-edge technology solutions, working alongside diverse...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Teknowiz Full time

    Site Reliability Engineer (DevOps Consultant)We are urgently hiring for one of our Big4 clients in Malaysia.Job Title: Site Reliability Engineer (DevOps)Location: KL/Johor/Penang (Onsite)Job Overview: As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help...


  • Kuala Lumpur, Kuala Lumpur, Malaysia KTH HR CONSULTING ZONE Full time

    About KTH HR CONSULTING ZONEWe are a leading provider of human resources consulting services, dedicated to delivering exceptional results to our clients. Our mission is to empower organizations to achieve their goals by leveraging innovative solutions and best practices.Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Smart Teq Solution Sdn Bhd Full time

    Get AI-powered advice on this job and more exclusive features.ResponsibilitiesEnsure all our infrastructure are running at optimal condition.Provide deployment, patches and updates on all services that are running on public cloud and on premise.Identify and resolve support tickets that are related to our infrastructure and services.Work closely with...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Randstad (Schweiz) AG Full time

    Site Reliability Engineer / SRE (Hybrid) | KLHybridOverview:As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system...

  • IT Recruiter

    2 weeks ago


    Kuala Lumpur, Kuala Lumpur, Malaysia PEOPLE PROFILERS Full time

    Kuala Lumpur, Federal Territory of Kuala Lumpur, MalaysiaIT Recruiter (RPO)Job Description:Job Ref: QVXY3W46People Profilers is hiring on behalf of a leading global consulting firm for a Senior IT Recruiter (RPO) in KL, Malaysia.The ideal candidate will possess extensive experience in recruiting Senior SRE, DevOps, NOC Engineers, Network Engineers, and Linux...


  • Kuala Lumpur, Kuala Lumpur, Malaysia Trees Engineering - Services Marketplace for the Energy Sector Full time

    Human Resources Business Partner at Trees EngineeringAbout The JobPerform as Leads / Sub-lead / Senior Engineer in project execution and takes the ownership of the specific tasks in a projectTechnical responsibility of Material RequisitionPerforming Design CalculationsCoordination with other disciplinesForecasting of MTO's for design areaCorrectly interpret...