SRE Senior Engineering Manager
3 weeks ago
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Swift's services—both our internally critical and externally visible systems—have reliability, uptime appropriate to users' needs, and a fast rate of improvement. Additionally, SREs maintain vigilant oversight of system capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure, and eliminating manual work through automation.
As a Senior Site Reliability Engineering Manager, you will lead a team responsible for providing the platform for mission-critical systems to maintain constant uptime, scale seamlessly, and enable new applications and services to flourish. The successful candidate will be highly self-motivated with a passion for excellence, quality, and detail. The SRE Manager will support operations, collaborate with developers and architects to design systems, and assist in implementation to improve stability, security, and scalability.
What to Expect?
Team Building and Mentorship:
- Recruit and retain engineers with diverse perspectives.
- Provide coaching, mentorship, and career development support to ensure team members excel both technically and personally.
Collaboration and Alignment:
- Partner with Product Owners and Engineering Leads to align SRE members with cross-functional squads.
- Foster effective collaboration across teams and functions.
Technical Leadership:
- Guide software design patterns, architecture, and engineering best practices.
- Drive design-focused software delivery to enhance quality and scalability.
Continuous Learning and Knowledge Sharing:
- Promote a culture of learning, knowledge sharing, and excellence across the organization.
- Encourage adoption of consistent practices across teams.
Driving Innovation:
- Stay updated on technology trends and facilitate the adoption of new tools and methodologies.
- Encourage innovative thinking to drive team and organizational growth.
Performance Management:
- Set annual objectives for team members.
- Conduct performance appraisals and provide constructive feedback to support career development.
What Will Make You Successful?
Professional Skills:
- Bachelor's or higher degree in Computer Science, Engineering, or related disciplines.
- Strong communication and leadership skills, promoting a diverse and collaborative culture.
- Passion for people development and commitment to creating an inclusive work environment.
- Customer-oriented and quality-focused mindset with a drive to deliver true customer value.
- Open-minded, solutions-oriented team player energized by collaboration.
- Familiarity with Agile and DevOps practices.
- Fluency in English (spoken and written).
- Experience in observability and/or anomaly detection is a plus.
Key Qualifications:
- 8+ years of experience in software development using one or more programming languages.
- Expertise in designing, analyzing, and troubleshooting distributed systems.
- 5+ years of leadership experience managing and mentoring technical teams.
- Skilled in cross-functional collaboration to achieve project success.
- Strong passion for automation and reducing manual workloads.
- Proven ability to encourage a culture of visibility and transparency across teams.
- Experience managing enterprise services in large-scale Linux environments.
- Expertise with Kubernetes and configuration management tools like Puppet, Chef, or Ansible.
- Proficiency in troubleshooting issues across the entire software stack.
- Hands-on experience operating large-scale multi-tenant infrastructure as a managed service.
- Strong verbal and written communication skills.
Additional Requirements:
- Advocacy for automation to minimize operational workloads.
- Strong sense of ownership, coupled with a collaborative and transparent communication style.
- Self-motivated and inquisitive, always eager to learn and improve systems and processes.
About the Team:
On the SRE team, you'll tackle the complex challenges of scale unique to Swift, leveraging your expertise in coding, algorithms, complexity analysis, and large-scale system design. Our culture values diversity, intellectual curiosity, problem-solving, and openness. We encourage collaboration, big thinking, and risk-taking in a supportive, blame-free environment. SRE promotes self-direction to work on meaningful projects while fostering a learning environment that provides the mentorship needed to grow and succeed.
What we offer:
- We put you in control of your career.
- We give you a competitive package.
- We help you perform at your best.
- We help you make a difference.
- We give you the freedom to be yourself.
We give you the freedom to be yourself. We are creating an environment of unique individuals – like you – with different perspectives on the financial industry and the world. An environment in which everyone's voice counts and where you can reach your full potential regardless of age, background, culture, colour, disability, gender, nationality, race, religion, or veteran/military status.
#J-18808-Ljbffr-
SRE Senior Engineering Manager
3 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia Swift Software Full timeSRE Senior Engineering Manager page is loadedSRE Senior Engineering ManagerApply locations Kuala Lumpur, Malaysia time type Full time posted on Posted Today job requisition id 2024-13914About the RoleSite Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE...
-
DevOps/SRE Engineer
2 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia Unison Consulting Full timeWe are seeking a talented DevOps/Site Reliability Engineer (SRE) with a strong background in DevOps practices, Linux environments, and proficiency in scripting and programming languages like Bash, Shell, Python, or Golang. You will be responsible for managing and automating the deployment, monitoring, and reliability of our services. You will work closely...
-
SRE/DevOps Engineer
2 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia Ibroad Solutions Full timeJob Title: SRE/DevOps EngineerEmployment Type: Full-timeResponsibilitiesDesign and implement resilient system architectures that support high availability and scalabilityDevelop automation tools and scripts to enhance operational efficiency and reduce manual effortDefine, track, and analyze SLOs and SLIs to ensure reliability and performance meet business...
-
SRE/ Devops Engineer
2 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia Unison Consulting Full timeWe are seeking a talented DevOps/Site Reliability Engineer (SRE) with a strong background in DevOps practices, Linux environments, and proficiency in scripting and programming languages like Bash, Shell, Python, or Golang. You will be responsible for managing and automating the deployment, monitoring, and reliability of our services. You will work closely...
-
DevOps and SRE Professional
2 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Smart Teq Solution Sdn Bhd Full timeSmart Teq Solution Sdn Bhd is seeking a skilled DevOps and SRE Professional to join our team.About the RoleThis role involves ensuring all our infrastructure runs at optimal condition.Provide deployment, patches, and updates on all services running on public cloud and on-premise.Work closely with developers to provide complete, up-to-date, and readable...
-
Senior Site Reliability Engineering Manager
1 week ago
Kuala Lumpur, Kuala Lumpur, Malaysia Swift Software Full timeAbout the Role:As a Senior Site Reliability Engineering Manager at Swift Software, you will lead a team responsible for providing the platform for mission-critical systems to maintain constant uptime, scale seamlessly, and enable new applications and services to flourish. The successful candidate will be highly self-motivated with a passion for excellence,...
-
Engineering Operations Manager
2 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Ibroad Solutions Full timeEngineering Operations Manager Job OverviewIbroad Solutions is seeking an experienced Engineering Operations Manager to lead our engineering operations team. This role involves overseeing the design and implementation of resilient system architectures that support high availability and scalability.The ideal candidate will have a strong understanding of SRE...
-
Engineering Manager for System Reliability
1 week ago
Kuala Lumpur, Kuala Lumpur, Malaysia Swift Software Full timeJob Description:We are seeking a Senior Site Reliability Engineering Manager to lead our team in providing high-availability platforms that support our mission-critical systems. The ideal candidate will have experience in designing, analyzing, and troubleshooting distributed systems, as well as collaborating with developers and architects to improve...
-
Cloud Engineering Manager
1 week ago
Kuala Lumpur, Kuala Lumpur, Malaysia Hytech Empire Full timeHytech Empire is seeking a highly experienced Cloud Engineering Manager to lead our cloud engineering efforts.You will be responsible for managing a team of engineers and ensuring that we deliver high-quality cloud solutions to our clients.We are looking for a professional with expertise in scripting languages, strong understanding of networking concepts,...
-
Reliability Engineering Specialist
2 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia PEOPLE PROFILERS Full timeJob Description:We are seeking an experienced Reliability Engineering Specialist to join our team. In this role, you will be responsible for the management and delivery of a system(s) within a platform leveraging agile practices.The ideal candidate will have at least 3 - 6 years of relevant experience in DevOps, SRE, and a full understanding of Site...
-
Site Reliability Engineer
4 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Aarorn Technologies Sdn Bhd Full time- ONSITE - Language proficiency - Mandarin - 5+ years exp as SREOverview:Site Reliability Engineer (SRE)As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive...
-
Site Reliability Engineer
2 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia KTH HR CONSULTING ZONE Full timeAs a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system architecture and design principles, focusing on key SRE...
-
Site Reliability Engineer Lead
2 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Smart Teq Solution Sdn Bhd Full timeWe're seeking an experienced Site Reliability Engineer Lead to join our team at Smart Teq Solution Sdn Bhd.About the RoleThis role involves ensuring all our infrastructure runs at optimal condition.Provide deployment, patches, and updates on all services running on public cloud and on-premise.Work closely with developers to provide complete, up-to-date, and...
-
Site Reliability Engineer
3 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia Glints Full timeGlints Federal Territory of Kuala Lumpur, MalaysiaSite Reliability EngineerReady to elevate your career with a globally recognized professional services firm? We are seeking a skilled DevOps / SRE Specialist to join our team. You'll be at the forefront of transforming business challenges into cutting-edge technology solutions, working alongside diverse...
-
Site Reliability Engineer
2 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia Teknowiz Full timeSite Reliability Engineer (DevOps Consultant)We are urgently hiring for one of our Big4 clients in Malaysia.Job Title: Site Reliability Engineer (DevOps)Location: KL/Johor/Penang (Onsite)Job Overview: As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help...
-
System Architect and Engineer
2 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia KTH HR CONSULTING ZONE Full timeAbout KTH HR CONSULTING ZONEWe are a leading provider of human resources consulting services, dedicated to delivering exceptional results to our clients. Our mission is to empower organizations to achieve their goals by leveraging innovative solutions and best practices.Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our...
-
Site Reliability Engineer
2 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Smart Teq Solution Sdn Bhd Full timeGet AI-powered advice on this job and more exclusive features.ResponsibilitiesEnsure all our infrastructure are running at optimal condition.Provide deployment, patches and updates on all services that are running on public cloud and on premise.Identify and resolve support tickets that are related to our infrastructure and services.Work closely with...
-
Site Reliability Engineer
3 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Randstad (Schweiz) AG Full timeSite Reliability Engineer / SRE (Hybrid) | KLHybridOverview:As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system...
-
IT Recruiter
2 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia PEOPLE PROFILERS Full timeKuala Lumpur, Federal Territory of Kuala Lumpur, MalaysiaIT Recruiter (RPO)Job Description:Job Ref: QVXY3W46People Profilers is hiring on behalf of a leading global consulting firm for a Senior IT Recruiter (RPO) in KL, Malaysia.The ideal candidate will possess extensive experience in recruiting Senior SRE, DevOps, NOC Engineers, Network Engineers, and Linux...
-
Senior Structural Engineer
4 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia Trees Engineering - Services Marketplace for the Energy Sector Full timeHuman Resources Business Partner at Trees EngineeringAbout The JobPerform as Leads / Sub-lead / Senior Engineer in project execution and takes the ownership of the specific tasks in a projectTechnical responsibility of Material RequisitionPerforming Design CalculationsCoordination with other disciplinesForecasting of MTO's for design areaCorrectly interpret...