Senior Site Reliability Engineer
2 days ago
Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT. Our customers include the world's leading public cloud and silicon providers, and industry leaders in many sectors. The company is a pioneer of global distributed collaboration, with 1200+ colleagues in 75+ countries and very few office based roles. Teams meet two to four times yearly in person, in interesting locations around the world, to align on strategy and execution.
The company is founder led, profitable and growing.
We are hiring a Senior Site Reliability EngineerNext-gen operations at scale, with pure Python infra-as-code, from bare metal to containers and applications. Our goal is to perfect enterprise infrastructure devops.
We run hundreds of private cloud, Kubernetes, and application clusters for customers across physical and public cloud estate, and we are raising the bar on what's possible with automation by embracing a universal operator pattern and model-driven operations.
To succeed in this role you need to believe in automation as a pure software engineering problem, not a hack-it-till-it-works-for-me problem. You need to be interested in the scientific approach to operations at scale, driven by metrics and code, and you need to be able to learn the entire stack, from bare metal networking and kernel up to serverless and open source applications.
Location: Globally remote role
The role entailsOur cloud operations engineers bring Python software-engineering skills and rigour to the operations domain. We practise devsecops from bare metal to application. We architect and run OpenStack, Kubernetes and software defined storage, and we enable devsecops for applications running on that infrastructure too.
To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers.
As a member of the team you will gain experience in a broad range of cloud technologies. We evolve our offerings as the state of the art improves, so you get to stay current with the latest capabilities in open source infrastructure. We drive upgrades to keep our customers on the latest, best solutions.
What we are looking for in you- Degree in Software Engineering or Computer Science
- Experience with Linux and familiarity with Linux networking and storage
- Python software development expertise
- Operational experience
- Excellent interpersonal skills, curiosity, flexibility, and accountability
- Ability to travel internationally twice a year, for company events up to two weeks long
- Experience with OpenStack or Kubernetes deployment or operations
- Familiarity with public or private cloud management
We consider geographical location, experience, and performance in shaping compensation worldwide. We revisit compensation annually (and more often for graduates and associates) to ensure we recognise outstanding performance. In addition to base pay, we offer a performance-driven annual bonus or commission. We provide all team members with additional benefits, which reflect our values and ideals. We balance our programs to meet local needs and ensure fairness globally.
- Distributed work environment with twice-yearly team sprints in person
- Personal learning and development budget of USD 2,000 per year
- Annual compensation review
- Recognition rewards
- Annual holiday leave
- Maternity and paternity leave
- Employee Assistance Programme
- Opportunity to travel to new locations to meet colleagues
- Priority Pass, and travel upgrades for long haul company events
Canonical is a pioneering tech firm at the forefront of the global move to open source. As the company that publishes Ubuntu, one of the most important open source projects and the platform for AI, IoT and the cloud, we are changing the world of software. We recruit on a global basis and set a very high standard for people joining the company. We expect excellence - in order to succeed, we need to be the best at what we do. Most colleagues at Canonical have worked from home since its inception in 2004. Working here is a step into the future, and will challenge you to think differently, work smarter, learn new skills, and raise your game.
Canonical is an equal opportunity employerWe are proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background create a better work environment and better products. Whatever your identity, we will give your application fair consideration.
#LI-Remote
-
Site Reliability Engineer
2 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia Kneat Full time 80,000 - 120,000 per yearSite Reliability Engineer – Kuala Lumpur, MalaysiaKneat enables regulated organizations to move from paper-based validation to intelligent, digitized, paperless solutions. And we do it through the ongoing development of a powerful, purpose-built software platform. In 2014, after eight years of intensive software development, we launched Kneat Gx—the...
-
Site Reliability Engineer
2 weeks ago
Kuala Lumpur, Kuala Lumpur, Malaysia Kneat Full time 80,000 - 120,000 per yearSite Reliability Engineer – Kuala Lumpur, MalaysiaKneat enables regulated organizations to move from paper-based validation to intelligent, digitized, paperless solutions. And we do it through the ongoing development of a powerful, purpose-built software platform. In 2014, after eight years of intensive software development, we launched Kneat Gx—the...
-
Senior Site Reliability
2 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Canonical - Jobs Full time 120,000 - 240,000 per yearCanonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers,...
-
Site Reliability Engineer
2 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia VCB Malaysia Berhad Full time 144,000 - 156,000 per yearOverview:As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system architecture and design principles, focusing on...
-
Site Reliability Engineer
2 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia PeopleScope Full time 60,000 - 120,000 per yearSite Reliability EngineerJob Description:Ability to debug scripts and automate routine tasks in OS, network, database or application servers. Coding experience beyond simple scripts; Experience in Devops process, programming knowledge in at least one of the following languages: Java, Python, or Go; Scripting skills in at least of the following:...
-
Site Reliability Engineer
2 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Abhidi Solution Private Limited Full time 120,000 - 180,000 per yearJob Title: Site Reliability Engineer (SRE)Job Type: Permanent positionWork Location Kuala LumpurResponsibilities:Strong hands-on experience with VMware solutionsStrong experience with patch management for OS & middlewareExperience in VMware server templating/blueprints (RedHat & Windows)Experience with Infrastructure-as-Code, orchestration, configuration...
-
Site Reliability Engineer
2 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Hunters International Full time 19,000 per yearOverview:As a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system architecture and design principles, focusing on...
-
Site Reliability Engineer
4 hours ago
Kuala Lumpur, Kuala Lumpur, Malaysia Unison Consulting Full time 120,000 - 240,000 per yearAs a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system architecture and design principles, focusing on key SRE...
-
Senior Site Reliability Engineer
3 hours ago
Kuala Lumpur, Kuala Lumpur, Malaysia Huawei Consumer Business Group Full time 80,000 - 120,000 per yearThis role is responsible for reliability, availability, user experience, capacity planning, AIOps, process enhancement and digitalization of the cloud-based internet services.Main responsibilities:Handle SRE role for assigned cloud services owning the KPIs for service reliability, issue to resolution, service deployment, business continuity management,...
-
Site Reliability Engineer
2 days ago
Kuala Lumpur, Kuala Lumpur, Malaysia Unison Group Full time 120,000 - 240,000 per yearAs a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system architecture and design principles, focusing on key SRE...